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METHODS OF DIAGNOSIS OF AN GIOGENESIS, COMPOSITIONS 
AND METHODS OF SCREENING FOR ANGIOGENESIS 
5 MODULATORS 



CROSS-REFERENCES TO RELATED APPLICATIONS 
This application claims priority to USSN 09/784,356, filed February 14 2001; 
10 USSN 09/791,390, filed February 22, 2001; USSN 60/285,475, filed April 19, 2001, USSN 
60/310,025, filed August 3, 2001, and USSN 60/334,244, filed November 29, 2001, each of 
which is herein incorporated by reference in its entirety. 



FIELD OF THE INVENTION 
15 The invention relates to the identification of nucleic acid and protein 

expression profiles and nucleic acids, products, and antibodies thereto that are involved in 
angiogenesis; and to the use of such expression profiles and compositions in diagnosis and 
therapy of angiogenesis. The invention further relates to methods for identifying and using 
agents and/or targets that modulate angiogenesis. 

20 

BACKGROUND OF THE INVENTION 
Both vasculogenesis, the development of an interactive vascular system 
comprising arteries and veins, and angiogenesis, the generation of new blood vessels, play a 
role in embryonic development. In contrast, angiogenesis is limited in a normal adult to the 

25 placenta, ovary, endometrium and sites of wound healing. However, angiogenesis, or its 

absence, plays an important role in the maintenance of a variety of pathological states. Some 
of these states are characterized by neovascularization, e.g., cancer, diabetic retinopathy, 
glaucoma, and age related macular degeneration. Others, e.g. 9 stroke, infertility, heart 
disease, ulcers, and scleroderma, are diseases of angiogenic insufficiency. 

30 Angiogenesis has a number of stages (see, e.g. , Folkman, J.Natl Cancer Inst 

82:4-6, 1990; Firestein, J Clin Invest 103:3-4, 1999; Koch, Arthritis RheumAl:95 1-62, 1998; 
Carter, Oncologist 5(Suppl l):51-4, 2000; Browder et al., Cancer Res. 60:1878-86, 2000; and 
Zhu and Witte, Invest New Drugs 17:195-212, 1999). The early stages of angiogenesis 
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include endothelial cell protease production, migration of cells, and proliferation. The early 
stages also appear to require some growth factors, with VEGF, TGF-oc, angiostatin, and 
selected chemokines all putatively playing a role. Later stages of angiogenesis include 
population of the vessels with mural cells (pericytes or smooth muscle cells), basement 
5 membrane production, and the induction of vessel bed specializations. The final stages of 
vessel formation include what is known as "remodeling", wherein a forming vasculature 
becomes a stable, mature vessel bed. Thus, the process is highly dynamic, often requiring 
coordinated spatial and temporal waves of gene expression. 

Conversely, the complex process may be subject to disruption by interfering 

10 with one or more critical steps. Thus, the lack of understanding of the dynamics of 

angiogenesis prevents therapeutic intervention in serious diseases such as those indicated. It 
is an object of the invention to provide methods that can be used to screen compounds for the 
ability to modulate angiogenesis. Additionally, it is an object to provide molecular targets for 
therapeutic intervention in disease states which either have an undesirable excess or a deficit 

15 in angiogenesis. The present invention provides solutions to both. 



SUMMARY OF THE INVENTION 
The present invention provides compositions and methods for detecting or 
modulating angiogenesis associated sequences. 
20 In one aspect, the invention provides a method of detecting an angiogenesis- 

associated transcript in a cell in a patient, the method comprising contacting a biological 
sample from the patient with a polynucleotide that selectively hybridized to a sequence at 
least 80% identical to a sequence as shown in Tables 1-8. In one embodiment, the biological 
sample is a tissue sample. In another embodiment, the biological sample comprises isolated 
25 nucleic acids, which are often mRNA. 

In another embodiment, the method further comprises the step of amplifying 
nucleic acids before the step of contacting the biological sample with the polynucleotide. 
Often, the polynucleotide comprises a sequence as shown in Tables 1-8. The polynucleotide 
can be labeled, for example, with a fluorescent label and can be immobilized on a solid 
30 surface. 

In other embodiments the patient is undergoing a therapeutic regimen to treat a 
disease associated with angiogenesis or the patient is suspected of having an angiogenesis- 
associated disorder. 
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In another aspect, the invention comprises an isolated nucleic acid molecule 
consisting of a polynucleotide sequence as shown in Tables 1-8. The nucleic acid molecule 
can be labeled, for example, with a fluorescent label, 

In other aspects, the invention provides an expression vector comprising an 
5 isolated nucleic acid molecule consisting of a polynucleotide sequence as shown in Tables 1- 
8 or a host cell comprising the expression vector. 

In another embodiment, the isolated nucleic acid molecule encodes a 
polypeptide having an amino acid sequence as shown in Table 8. 

In another aspect, the invention provides an isolated polypeptide which is 
10 encoded by a nucleic acid molecule having polynucleotide sequence as shown in Tables 1-8. 
In one embodiment, the isolated polypeptide has an amino acid sequence as shown in Table 
8. 

In another embodiment, the invention provides an antibody that specifically 

binds a polypeptide that has an amino acid sequence as shown in Table 8 or which is encoded 
15 by a nucleotide sequence of Tables 1-8 . The antibody can be conjugated or fused to an 

effector component such as a fluorescent label, a toxin, or a radioisotope. In some 

embodiments, the antibody is an antibody fragment or a humanized antibody. 

In another aspect, the invention provides a method of detecting a cell 

undergoing angiogenesis in a biological sample from a patient, the method comprising 
20 contacting the biological sample with an antibody that specifically binds to a polypeptide that 

has an amino acid sequence as shown in Table 8 or which is encoded by a nucleotide 

sequence of Tables 1-8 . In some embodiments, the antibody is further conjugated or fused to 

an effector component, for example, a fluorescent label. 

In another embodiment, the invention provides a method of detecting 
25 antibodies specific to angiogenesis in a patient, the method comprising contacting a 

biological sample from the patient with a polypeptide which is encoded by a nucleotide 

sequence of Tables 1-8. 

The invention also provides a method of identifying a compound that 

modulates the activity of an angiogenesis-associated polypeptide, the method comprising the 
30 steps of: (i) contacting the compound with a polypeptide that comprises at least 80% identity 

to an amino acid sequence as shown in Table 8 or which is encoded by a nucleotide sequence 

of Tables 1-8; and (ii) detecting an increase or a decrease in the activity of the polypeptide. 

In one embodiment, the polypeptide has an amino acid sequence as shown in Table 8 or is a 
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polypeptide encoded by a nucleotide sequence of Tables 1-8. In another embodiment, the 
polypeptide is expressed in a cell. 

The invention also provides a method of identifying a compound that 
modulates angiogenesis, the method comprising steps of: (i) contacting the compound with a 
5 cell undergoing angiogenesis; and (ii) detecting an increase or a decrease in the expression of 
a polypeptide sequence as shown in Table 8 or a polypeptide which is encoded by a 
nucleotide sequence of Tables 1-8. In one embodiment, the detecting step comprises 
hybridizing a nucleic acid sample from the cell with a polynucleotide that selectively 
hybridizes to a sequence at least 80% identical to a sequence as shown in Tables 1-8. In 

10 another embodiment, the method further comprises detecting an increase or decrease in the 
expression of a second sequence as shown in Table 8 or a polypeptide which is encoded by a 
nucleotide sequence of Tables 1-8 . 

In another embodiment, the invention provides a method of inhibiting 
angiogenesis in a cell that expresses a polypeptide at least 80% identical to a sequence as 

15 shown in Table 8 or which is 80% identical to a polypeptide encoded by a nucleotide 
sequence of Tables 1-8 , the method comprising the step of contacting the cell with a 
therapeutically effective amount of an inhibitor of the polypeptide. In one embodiment, the 
polypeptide has an amino acid sequence shown in Table 8 or is a polypeptide which is 
encoded by a nucleotide sequence of Tables 1-8 . In another embodiment, the inhibitor is an 

20 antibody. 

In other embodiments, the invention provides a method of activating 
angiogenesis in a cell that expresses a polypeptide at least 80% identical to a sequence as 
shown in Table 8 or at least 80% identical to a polypeptide which is encoded by a nucleotide 
sequence of Tables 1-8 , the method comprising the step of contacting the cell with a 
25 therapeutically effective amount of an activator of the polypeptide. In one embodiment, the 
polypeptide has an amino acid sequence shown in Table 8 or is a polypeptide which is 
encoded by a nucleotide sequence of Tables 1-8. 

Other aspects of the invention will become apparent to the skilled artisan by 
the following description of the invention. 



30 



Tables 1-8 provide nucleotide sequence of genes that exhibit changes in 
expression levels as a function of time in tissue undergoing angiogenesis compared to tissue 
that is not. 
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DESCRIPTION OF THE SPECIFIC EMBODIMENTS 
In accordance with the objects outlined above, the present invention provides 
novel methods for diagnosis and treatment of disorders associated with angiogenesis 
(sometimes referred to herein as angiogenesis disorders or AD), as well as methods for 
5 screening for compositions which modulate angiogenesis. By "disorder associated with 

angiogenesis" or "disease associated with angiogenesis" herein is meant a disease state which 
is marked by either an excess or a deficit of blood vessel development. Angiogenesis 
disorders asociated with increased angiogenesis include, but are not limited to, cancer and 
proliferative diabetic retinopathy. Pathological states for which it may be desirable to 

10 increase angiogenesis include stroke, heart disease, infertility, ulcers, wound healing, 

ischemia, and scleradoma. Solid tumors typically require angiogenesis to support or sustain 
growth, e.g., breast, colon, lung, brain, bladder, and prostate tumors. Other AD include, e.g., 
arthritis, inflammatory bowel disease, diabetis retinopathy, macular degeneration, 
atherosclerosis, and psoriasis. Also provided are methods for treating AD. 

15 Definitions 

The term "angiogenesis protein" or "angiogenesis polynucleotide" refers to 
nucleic acid and polypeptide polymorphic variants, alleles, mutants, and interspecies 
homologs that: (1) have an amino acid sequence that has greater than about 60% amino acid 
sequence identity, 65%, 70%, 75%, 80%, 85%, 90%, preferably 91%, 92%, 93%, 94%, 95%, 

20 96%, 97%, 98% or 99% or greater amino acid sequence identity, preferably over a region of 
over a region of at least about 25, 50, 100, 200, 500, 1000, or more amino acids, to an 
angiogenesis protein sequence of Table 8; (2) bind to antibodies, e.g., polyclonal antibodies, 
raised against an immunogen comprising an amino acid sequence of Table 8, and 
conservatively modified variants thereof; (3) specifically hybridize under stringent 

25 hybridization conditions to an anti-sense strand corresponding to a nucleic acid sequence of 
Tables 1-8 and conservatively modified variants thereof; (4) have a nucleic acid sequence 
that has greater than about 95%, preferably greater than about 96%, 97%, 98%, 99%, or 
higher nucleotide sequence identity, preferably over a region of at least about 25, 50, 100, 
200, 500, 1000, or more nucleotides, to a sense sequence corresponding to one set out in 

30 Tables 1-8 . A polynucleotide or polypeptide sequence is typically from a mammal 

including, but not limited to, primate, e.g., human; rodent, e.g, rat, mouse, hamster; cow, pig, 
horse, sheep, or any mammal An "angiogenesis polypeptide" and an "angiogenesis 
polynucleotide," include both naturally occurring or recombinant. 
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A "full length" angiogenesis protein or nucleic acid refers to an agiogenesis 
polypeptide or polynucleotide sequence, or a variant thereof, that contains all of the elements 
normally contained in one or more naturally occurring, wild type angiogenesis polynucleotide 
or polypeptide sequences. The "full length" may be prior to, or after, various stages of post- 
5 translation processing. 

"Biological sample" as used herein is a sample of biological tissue or fluid that 
contains nucleic acids or polypeptides, e.g., of an angiogenic protein. Such samples include, 
but are not limited to, tissue isolated from primates, e.g., humans, or rodents, e.g., mice, and 
rats. Biological samples may also include sections of tissues such as biopsy and autopsy 
10 samples, and frozen sections taken for histologic purposes. A biological sample is typically 
obtained from a eukaryotic organism, most preferably a mammal such as a primate e.g., 
chimpanzee or human; cow; dog; cat; a rodent, e.g., guinea pig, rat, mouse; rabbit; or a bird; 
reptile; or fish. 

"Providing a biological sample" means to obtain a biological sample for use in 
15 methods described in this invention. Most often, this will be done by removing a sample of 
cells from an animal, but can also be accomplished by using previously isolated cells (e.g., 
isolated by another person, at another time, and/or for another purpose), or by performing the 
methods of the invention in vivo. Archival tissues, having treatment or outcome histroy, will 
be particularly useful. 

20 The terms "identical" or percent "identity," in the context of two or more 

nucleic acids or polypeptide sequences, refer to two or more sequences or subsequences that 
are the same or have a specified percentage of amino acid residues or nucleotides that are the 
same (i.e., about 70% identity, preferably 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 
96%, 97%, 98%, 99%, or higher identity over a specified region (e.g., SEQ ID NOS:l-229), 

25 when compared and aligned for maximum correspondence over a comparison window or 
designated region) as measured using a BLAST or BLAST 2.0 sequence comparison 
algorithms with default parameters described below, or by manual alignment and visual 
inspection (see, e.g., NCBI web site http://www.ncbi.nlm.nih.gov/BLAST/ or the like). Such 
sequences are then said to be "substantially identical." This definition also refers to, or may 

30 be applied to, the compliment of a test sequence. The definition also includes sequences that 
have deletions and/or additions, as well as those that have substitutions. As described below, 
the preferred algorithms can account for gaps and the like. Preferably, identity exists over a 
region that is at least about 25 amino acids or nucleotides in length, or more preferably over a 
region that is 50-100 amino acids or nucleotides in length. 
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For sequence comparison, typically one sequence acts as a reference sequence, 
to which test sequences are compared. When using a sequence comparison algorithm, test 
and reference sequences are entered into a computer, subsequence coordinates are designated, 
if necessary, and sequence algorithm program parameters are designated. Preferably, default 
5 program parameters can be used, or alternative parameters can be designated. The sequence 
comparison algorithm then calculates the percent sequence identities for the test sequences 
relative to the reference sequence, based on the program parameters. 

A "comparison window", as used herein, includes reference to a segment of 
any one of the number of contiguous positions selected from the group consisting of from 20 

10 to 600, usually about 50 to about 200, more usually about 100 to about 150 in which a 
sequence may be compared to a reference sequence of the same number of contiguous 
positions after the two sequences are optimally aligned. Methods of alignment of sequences 
for comparison are well-known in the art. Optimal alignment of sequences for comparison 
can be conducted, e.g., by the local homology algorithm of Smith & Waterman, Adv. Appl. 

15 Math. 2:482 (1981), by the homology alignment algorithm of Needleman & Wunsch, Mol. 
Biol. 48:443 (1970), by the search for similarity method of Pearson & Lipman, Proc. Nat'l. 
Acad. Set USA 85:2444 (1988), by computerized implementations of these algorithms 
(GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package, 
Genetics Computer Group, 575 Science Dr., Madison, WI), or by manual alignment and 

20 visual inspection (see, e.g. t Current Protocols in Molecular Biology (Ausubel et al 9 eds. 
1995 supplement)). 

A preferred example of algorithm that is suitable for determining percent 
sequence identity and sequence similarity are the BLAST and BLAST 2.0 algorithms, which 
are described in Altschul et al, Nuc. Acids Res. 25:3389-3402 (1977) and Altschul et aL, J. 

25 Mol. Biol. 215:403-410 (1990), respectively. BLAST and BLAST 2.0 are used, with the 

parameters described herein, to determine percent sequence identity for the nucleic acids and 
proteins of the invention. Software for performing BLAST analyses is publicly available 
through the National Center for Biotechnology Information (http://www.ncbi.nlm.nih.gov/). 
This algorithm involves first identifying high scoring sequence pairs (HSPs) by identifying 

30 short words of length W in the query sequence, which either match or satisfy some positive- 
valued threshold score T when aligned with a word of the same length in a database 
sequence. T is referred to as the neighborhood word score threshold (Altschul et al, supra). 
These initial neighborhood word hits act as seeds for initiating searches to find longer HSPs 
containing them. The word hits are extended in both directions along each sequence for as 
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far as the cumulative alignment score can be increased. Cumulative scores are calculated 
using, for nucleotide sequences, the parameters M (reward score for a pair of matching 
residues; always > 0) and N (penalty score for mismatching residues; always < 0). For amino 
acid sequences, a scoring matrix is used to calculate the cumulative score. Extension of the 
5 word hits in each direction are halted when: the cumulative alignment score falls off by the 
quantity X from its maximum achieved value; the cumulative score goes to zero or below, 
due to the accumulation of one or more negative-scoring residue alignments; or the end of 
either sequence is reached. The BLAST algorithm parameters W, T, and X determine the 
sensitivity and speed of the alignment. The BLASTN program (for nucleotide sequences) 

10 uses as defaults a wordlength (W) of 1 1, an expectation (E) of 10, M=5, N—4 and a 
comparison of both strands. For amino acid sequences, the BLASTP program uses as 
defaults a wordlength of 3, and expectation (E) of 10, and the BLOSUM62 scoring matrix 
(see Henikoff & HenikofF, Proa Natl Acad. Sci. USA 89:10915 (1989)) alignments (B) of 
50, expectation (E) of 10, M=5, N— 4, and a comparison of both strands. 

15 The BLAST algorithm also performs a statistical analysis of the similarity 

between two sequences (see, e.g., Karlin & Altschul, Proc. Natl Acad. Sci. USA 90:5873- 
5787 (1993)). One measure of similarity provided by the BLAST algorithm is the smallest 
sum probability (P(N))> which provides an indication of the probability by which a match 
between two nucleotide or amino acid sequences would occur by chance. For example, a 

20 nucleic acid is considered similar to a reference sequence if the smallest sum probability in a 
comparison of the test nucleic acid to the reference nucleic acid is less than about 0.2, more 
preferably less than about 0.01, and most preferably less than about 0.001. 

An indication that two nucleic acid sequences or polypeptides are substantially 
identical is that the polypeptide encoded by the first nucleic acid is immunologically cross 

25 reactive with the antibodies raised against the polypeptide encoded by the second nucleic 
acid, as described below. Thus, a polypeptide is typically substantially identical to a second 
polypeptide, for example, where the two peptides differ only by conservative substitutions. 
Another indication that two nucleic acid sequences are substantially identical is that the two 
molecules or their complements hybridize to each other under stringent conditions, as 

30 described below. Yet another indication that two nucleic acid sequences are substantially 
identical is that the same primers can be used to amplify the sequences. 

A "host cell" is a naturally occurring cell or a transformed cell that contains an 
expression vector and supports the replication or expression of the expression vector. Host 
cells may be cultured cells, explants, cells in vivo, and the like. Host cells may be 
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prokaryotic cells such as E. coli, or eukaiyotic cells such as yeast, insect, amphibian, or 
mammalian cells such as CHO, HeLa, and the like (see, e.g., the American Type Culture 
Collection catalog or web site, www.atcc.org). 

The terms "polypeptide," "peptide" and "protein" are used interchangeably 
5 herein to refer to a polymer of amino acid residues. The terms apply to amino acid polymers 
in which one or more amino acid residue is an artificial chemical mimetic of a corresponding 
naturally occurring amino acid, as well as to naturally occurring amino acid polymers and 
non-naturally occurring amino acid polymer. 

The term "amino acid" refers to naturally occurring and synthetic amino acids, 

10 as well as amino acid analogs and amino acid mimetics that function in a manner similar to 
the naturally occurring amino acids. Naturally occurring amino acids are those encoded by 
the genetic code, as well as those amino acids that are later modified, e.g., hydroxyproline, y- 
carboxyglutamate, and Ophosphoserine. Amino acid analogs refers to compounds that have 
the same basic chemical structure as a naturally occurring amino acid, i.e., an a carbon that is 

15 bound to a hydrogen, a carboxyl group, an amino group, and an R group, e.g., homoserine, 
norleucine, methionine sulfoxide, methionine methyl sulfonium. Such analogs have modified 
R groups (e.g., norleucine) or modified peptide backbones, but retain the same basic chemical 
structure as a naturally occurring amino acid. Amino acid mimetics refers to chemical 
compounds that have a structure that is different from the general chemical structure of an 

20 amino acid, but that functions in a manner similar to a naturally occurring amino acid. 

Amino acids may be referred to herein by either their commonly known three 
letter symbols or by the one-letter symbols recommended by the IUPAC-IUB Biochemical 
Nomenclature Commission. Nucleotides, likewise, may be referred to by their commonly 
accepted single-letter codes. 

25 "Conservatively modified variants" applies to both amino acid and nucleic 

acid sequences. With respect to particular nucleic acid sequences, conservatively modified 
variants refers to those nucleic acids which encode identical or essentially identical amino 
acid sequences, or where the nucleic acid does not encode an amino acid sequence, to 
essentially identical sequences. Because of the degeneracy of the genetic code, a large 

30 number of functionally identical nucleic acids encode any given protein. For instance, the 
codons GCA, GCC, GCG and GCU all encode the amino acid alanine. Thus, at every 
position where an alanine is specified by a codon, the codon can be altered to any of the 
corresponding codons described without altering the encoded polypeptide. Such nucleic acid 
variations are "silent variations," which are one species of conservatively modified 
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variations. Every nucleic acid sequence herein which encodes a polypeptide also describes 
every possible silent variation of the nucleic acid. One of skill will recognize that each codon 
in a nucleic acid (except AUG, which is ordinarily the only codon for methionine, and TGG, 
which is ordinarily the only codon for tryptophan) can be modified to yield a functionally 
5 identical molecule. Accordingly, each silent variation of a nucleic acid which encodes a 
polypeptide is implicit in each described sequence with respect to the expression product, but 
not with respect to actual probe sequences. 

As to amino acid sequences, one of skill will recognize that individual 
substitutions, deletions or additions to a nucleic acid, peptide, polypeptide, or protein 

10 sequence which alters, adds or deletes a single amino acid or a small percentage of amino 
acids in the encoded sequence is a "conservatively modified variant" where the alteration 
results in the substitution of an amino acid with a chemically similar amino acid. 
Conservative substitution tables providing functionally similar amino acids are well known in 
the art. Such conservatively modified variants are in addition to and do not exclude 

15 polymorphic variants, interspecies homologs, and alleles of the invention. 

The following eight groups each contain amino acids that are conservative 
substitutions for one another: 1) Alanine (A), Glycine (G); 2) Aspartic acid (D), Glutamic acid 
(E); 3) Asparagine (N), Glutamine (Q); 4) Arginine (R), Lysine (K); 5) Isoleucine (I), 
Leucine (L), Methionine (M), Valine (V); 6) Phenylalanine (F), Tyrosine (Y), Tryptophan 

20 (W); 7) Serine (S), Threonine (T); and 8) Cysteine (C), Methionine (M) {see, e.g., Creighton, 
Proteins (1984)). 

Macromolecular structures such as polypeptide structures can be described in 
terms of various levels of organization. For a general discussion of this organization, see, 
e.g. 9 Alberts et al., Molecular Biology of the Cell (3 rd ed., 1994) and Cantor and Schimmel, 

25 Biophysical Chemistry Part I: Tlie Conformation of Biological Macromolecules (1 980). 
"Primary structure" refers to the amino acid sequence of a particular peptide. "Secondary 
structure" refers to locally ordered, three dimensional structures within a polypeptide. These 
structures are commonly known as domains. Domains are portions of a polypeptide that 
form a compact unit of the polypeptide and are typically 25 to approximately 500 amino 

30 acids long. Typical domains are made up of sections of lesser organization such as stretches 
of p-sheet and a-helices. "Tertiary structure" refers to the complete three dimensional 
structure of a polypeptide monomer. "Quaternary structure" refers to the three dimensional 
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structure formed, usually by the noncovalent association of independent tertiary units. 
Anisotropic terms are also known as energy terms. 

A "label" or a "detectable moiety" is a composition detectable by 
spectroscopic, photochemical, biochemical, immunochemical, chemical, or other physical 
5 means. For example, useful labels include 32 P, fluorescent dyes, electron-dense reagents, 
enzymes (e.g., as commonly used in an ELISA), biotin, digoxigenin, or haptens and proteins 
which can be made detectable, e.g., by incorporating a radiolabel into the peptide or used to 
detect antibodies specifically reactive with the peptide. 

An "effector" or "effector moiety" or "effector component" is a molecule that 

10 is bound (or linked, or conjugated), either covalently, through a linker or a chemical bond, or 
noncovalently, through ionic, van der Waals, electrostatic, or hydrogen bonds, to an antibody. 
The "effector" can be a variety of molecules including, for example, detection moieties 
including radioactive compounds, fluroescent compounds, an enzyme or substrate, tags such 
as epitope tags, a toxin; a chemotherapeutic agent; a lipase; an antibiotic; or a radioisotope 

1 5 emitting "hard" e.g. , beta radiation. 

A "labeled nucleic acid probe or oligonucleotide" is one that is bound, either 
covalently, through a linker or a chemical bond, or noncovalently, through ionic, van der 
Waals, electrostatic, or hydrogen bonds to a label such that the presence of the probe may be 
detected by detecting the presence of the label bound to the probe. Alternatively, method 

20 using high affinity interactions may achieve the same results where one of a pair of binding 
partners binds to the other, biotin, streptavidin. 

As used herein a "nucleic acid probe or oligonucleotide" is defined as a 
nucleic acid capable of binding to a target nucleic acid of complementary sequence through 
one or more types of chemical bonds, usually through complementary base pairing, usually 

25 through hydrogen bond formation. As used herein, a probe may include. natural (i.e., A, G, C, 
or T) or modified bases (7-deazaguanosine, inosine, etc.). In addition, the bases in a probe 
may be joined by a linkage other than a phosphodiester bond, so long as it does not interfere 
with hybridization. Thus, for example, probes may be peptide nucleic acids in which the 
constituent bases are joined by peptide bonds rather than phosphodiester linkages. It will be 

30 understood by one of skill in the art that probes may bind target sequences lacking complete 
complementarity with the probe sequence depending upon the stringency of the hybridization 
conditions. The probes are preferably directly labeled as with isotopes, chromOphores, 
lumiphores, chromogens, or indirectly labeled such as with biotin to which a streptavidin 
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complex may later bind. By assaying for the presence or absence of the probe, one can detect 
the presence or absence of the select sequence or subsequence. 

The term "recombinant" when used with reference, e.g., to a cell, or nucleic 
acid, protein, or vector, indicates that the cell, nucleic acid, protein or vector, has been 
5 modified by the introduction of a heterologous nucleic acid or protein or the alteration of a 
native nucleic acid or protein, or that the cell is derived from a cell so modified. Thus, for 
example, recombinant cells express genes that are not found within the native (non- 
recombinant) form of the cell or express native genes that are otherwise abnormally 
expressed, under expressed or not expressed at all. 

10 The term "heterologous" when used with reference to portions of a nucleic 

acid indicates that the nucleic acid comprises two or more subsequences that are not found in 
the same relationship to each other in nature. For instance, the nucleic acid is typically 
recombinantly produced, having two or more sequences from unrelated genes arranged to 
make a new functional nucleic acid, e.g., a promoter from one source and a coding region 

1 5 from another source. Similarly, a heterologous protein indicates that the protein comprises 
two or more subsequences that are not found in the same relationship to each other in nature 
(e.g., a fusion protein). 

A "promoter" is defined as an array of nucleic acid control sequences that 
direct transcription of a nucleic acid. As used herein, a promoter includes necessary nucleic 

20 acid sequences near the start site of transcription, such as, in the case of a polymerase II type 
promoter, a TATA element. A promoter also optionally includes distal enhancer or repressor 
elements, which can be located as much as several thousand base pairs from the start site of 
transcription. A "constitutive" promoter is a promoter that is active under most 
environmental and developmental conditions. An "inducible" promoter is a promoter that is 

25 active under environmental or developmental regulation. The term "operably linked" refers 
to a functional linkage between a nucleic acid expression control sequence (such as a 
promoter, or array of transcription factor binding sites) and a second nucleic acid sequence, 
wherein the expression control sequence directs transcription of the nucleic acid 
corresponding to the second sequence. 

30 An "expression vector" is a nucleic acid construct, generated recombinantly or 

synthetically, with a series of specified nucleic acid elements that permit transcription of a 
particular nucleic acid in a host cell. The expression vector can be part of a plasmid, virus, or 
nucleic acid fragment. Typically, the expression vector includes a nucleic acid to be 
transcribed operably linked to a promoter. 
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The phrase "selectively (or specifically) hybridizes to" refers to the binding, 
duplexing, or hybridizing of a molecule only to a particular nucleotide sequence under 
stringent hybridization conditions when that sequence is present in a complex mixture (e.g., 
total cellular or library DNA or RNA). 
5 The phrase "stringent hybridization conditions" refers to conditions under 

which a probe will hybridize to its target subsequence, typically in a complex mixture of 
nucleic acids, but to no other sequences. Stringent conditions are sequence-dependent and 
will be different in different circumstances. Longer sequences hybridize specifically at 
higher temperatures. An extensive guide to the hybridization of nucleic acids is found in 

10 Tijssen, Techniques in Biochemistry and Molecular Biology-Hybridization with Nucleic 
Probes, "Overview of principles of hybridization and the strategy of nucleic acid assays" 
(1993). Generally, stringent conditions are selected to be about 5-1 0°C lower than the 
theimal melting point (T m ) for the specific sequence at a defined ionic strength pH. The T m is 
the temperature (under defined ionic strength, pH, and nucleic concentration) at which 50% 

15 of the probes complementary to the target hybridize to the target sequence at equilibrium (as 
the target sequences are present in excess, at T m , 50% of the probes are occupied at 
equilibrium). Stringent conditions will be those in which the salt concentration is less than 
about 1.0 M sodium ion, typically about 0.01 to 1.0 M sodium ion concentration (or other 
salts) at pH 7.0 to 8.3 and the temperature is at least about 30°C for short probes (e.g., 10 to 

20 50 nucleotides) and at least about 60°C for long probes (e.g., greater than 50 nucleotides). 
Stringent conditions may also be achieved with the addition of destabilizing agents such as 
formamide. For selective or specific hybridization, a positive signal is at least two times 
background, preferably 10 times background hybridization. Exemplary stringent 
hybridization conditions can be as following: 50% formamide, 5x SSC, and 1% SDS, 

25 incubating at 42°C, or, 5x SSC, 1% SDS, incubating at 65°C, with wash in 0.2x SSC, and 
0.1% SDS at 65°C. For PCR, a temperature of about 36°C is typical for low stringency 
amplification, although annealing temperatures may vary between about 32°C and 48°C 
depending on primer length. For high stringency PCR amplification, a temperature of about 
62°C is typical, although high stringency annealing temperatures can range from about 50°C 

30 to about 65°C, depending on the primer length and specificity. Typical cycle conditions for 
both high and low stringency amplifications include a denaturation phase of 90°C - 95°C for 
30 sec - 2 min., an annealing phase lasting 30 sec. - 2 min., and an extension phase of about 
72°C for 1 - 2 min. Protocols and guidelines for low and high stringency amplification 
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reactions are provided, e.g., in Innis et al. (1990) PCR Protocols, A Guide to Methods and 
Applications, Academic Press, Inc. N.Y.). 

Nucleic acids that do not hybridize to each other under stringent conditions are 
still substantially identical if the polypeptides which they encode are substantially identical. 
5 This occurs, for example, when a copy of a nucleic acid is created using the maximum codon 
degeneracy permitted by the genetic code. In such cases, the nucleic acids typically hybridize 
under moderately stringent hybridization conditions. Exemplary "moderately stringent 
hybridization conditions" include a hybridization in a buffer of 40% formamide, 1 M NaCl, 
1% SDS at 37°C, and a wash in IX SSC at 45°C. A positive hybridization is at least twice 

10 background. Those of ordinary skill will readily recognize that alternative hybridization and 
wash conditions can be utilized to provide conditions of similar stringency. Additional 
guidelines for determining hybridization parameters are provided in numerous reference, e.g., 
and Current Protocols in Molecular Biology, ed. Ausubel, et al 

The phrase "functional effects" in the context of assays for testing compounds 

15 that modulate activity of an angiogenesis protein includes the determination of a parameter 
that is indirectly or directly under the influence of the angiogenesis protein, e.g., a functional, 
physical, or chemical effect, such as the ability to increase or decrease angiogenesis. It 
includes binding activity, the ability of cells to proliferate, expression in cells undergoing 
angiogenesis, and other characteristics of angiogenic cells. "Functional effects" include in 

20 vitro, in vivo, and ex vivo activities. 

By "determining the functional effect" is meant assaying for a compound that 
increases or decreases a parameter that is indirectly or directly under the influence of an 
angiogenesis protein sequence, e.g., functional, physical and chemical effects. Such 
functional effects can be measured by any means known to those skilled in the art, e.g., 

25 changes in spectroscopic characteristics (e.g., fluorescence, absorbance, refractive index), 
hydrodynamic (e.g., shape), chromatographic, or solubility properties for the protein, 
measuring inducible markers or transcriptional activation of the angiogenesis protein; 
measuring binding activity or binding assays, e.g. binding to antibodies, and measuring 
cellular proliferation, particularly endothelial cell proliferation, cell viability, cell division 

30 especially of endothelial cells, lumen formation and capillary or vessel growth or formation. 
Determination of the functional effect of a compound on angiogenesis can also be performed 
using angiogenesis assays known to those of skill in the art such as an in vitro assays, e.g., in 
vitro endothelial cell tube formation assays, and other assays such as the chick CAM assay, 
the mouse corneal assay, and assays that assess vascularization of an implanted tumor. The 
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functional effects can be evaluated by many means known to those skilled in the art, e.g., 
microscopy for quantitative or qualitative measures of alterations in morphological features, 
e.g., tube or blood vessel formation, measurement of changes in RNA or protein levels for 
angiogenesis-associated sequences, measurement of RNA stability, identification of 
5 downstream or reporter gene expression (CAT, luciferase, p-gal, GFP and the like), e.g., via 
chemiluminescence, fluorescence, colorimetric reactions, antibody binding, inducible 
markers, and ligand binding assays. 

"Inhibitors", "activators", and "modulators" of angiogenic polynucleotide and 
polypeptide sequences are used to refer to activating, inhibitory, or modulating molecules 

1 0 identified using in vitro and in vivo assays of angiogenic polynucleotide and polypeptide 
sequences. Inhibitors are compounds that, e.g., bind to, partially or totally block activity, 
decrease, prevent, delay activation, inactivate, desensitize, or down regulate the activity or 
expression of angiogenesis proteins, e.g, antagonists. "Activators" are compounds that 
increase, open, activate, facilitate, enhance activation, sensitize, agonize, or up regulate 

15 angiogenesis protein activity. Inhibitors, activators, or modulators also include genetically 
modified versions of angiogenesis proteins, e.g., versions with altered activity, as well as 
naturally occurring and synthetic ligands, antagonists, agonists, antibodies, small chemical 
molecules and the like. Such assays for inhibitors and activators include, e.g., expressing the 
angiogenic protein in vitro, in cells, or cell membranes, applying putative modulator 

20 compounds, and then determining the functional effects on activity, as described above. 
Activators and inhibitors of angiogenesis can also be identified by incubating angiogenic 
cells with the test compound and determining increases or decreases in the expression of 1 or 
more angiogenesis proteins, e.g., 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, 40, 50 or more angiogenesis 
proteins, such as angiogenesis proteins comprising the sequences set out in Table 8. 

25 Samples or assays comprising angiogenesis proteins that are treated with a 

potential activator, inhibitor, or modulator are compared to control samples without the 
inhibitor, activator, or modulator to examine the extent of inhibition. Control samples 
(untreated with inhibitors) are assigned a relative protein activity value of 100%. Inhibition 
of a polypeptide is achieved when the activity value relative to the control is about 80%, 

30 preferably 50%, more preferably 25-0%. Activation of an angiogenesis polypeptide is 

achieved when the activity value relative to the control (untreated with activators) is 1 10%, 
more preferably 150%, more preferably 200-500% (i.e., two to five fold higher relative to the 
control), more preferably 1000-3000% higher. 

15 
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"Antibody" refers to a polypeptide comprising a framework region from an 
immunoglobulin gene or fragments thereof that specifically binds and recognizes an antigen. 
The recognized immunoglobulin genes include the kappa, lambda, alpha, gamma, delta, 
epsilon, and mu constant region genes* as well as the myriad immunoglobulin variable region 
5 genes. Light chains are classified as either kappa or lambda. Heavy chains are classified as 
gamma, mu, alpha, delta, or epsilon, which in turn define the immunoglobulin classes, IgG, 
IgM, IgA, IgD and IgE, respectively. Typically, the antigen-binding region of an antibody 
will be most critical in specificity and affinity of binding. 

An exemplary immunoglobulin (antibody) structural unit comprises a 

10 tetramer. Each tetramer is composed of two identical pairs of polypeptide chains, each pair 
having one "light' 3 ' (about 25 kD) and one "heavy" chain (about 50-70 kD). The N-terminus 
of each chain defines a variable region of about 100 to 110 or more amino acids primarily 
responsible for antigen recognition. The terms variable light chain (Vl) and variable heavy 
chain (Vh) refer to these light and heavy chains respectively. 

15 Antibodies exist, e.g., as intact immunoglobulins or as a number of well- 

characterized fragments produced by digestion with various peptidases. Thus, for example, 
pepsin digests an antibody below the disulfide linkages in the hinge region to produce F(ab)*2, 
a dimer of Fab which itself is a light chain joined to Vh-C h 1 by a disulfide bond. The F(ab)*2 
may be reduced under mild conditions to break the disulfide linkage in the hinge region, 

20 thereby converting the F(ab) , 2 dimer into an Fab' monomer. The Fab' monomer is 

essentially Fab with part of the hinge region {see Fundamental Immunology (Paul ed., 3d ed. 
1993). While various antibody fragments are defined in terms of the digestion of an intact 
antibody, one of skill will appreciate that such fragments may be synthesized de novo either 
chemically or by using recombinant DNA methodology. Thus, the tenn antibody, as used 

25 herein, also includes antibody fragments either produced by the modification of whole 

antibodies, or those synthesized de novo using recombinant DNA methodologies (e.g., single 
chain Fv) or those identified using phage display libraries (see, e.g., McCafferty et ah, Nature 
348:552-554 (1990)) 

For preparation of antibodies, e.g., recombinant, monoclonal, or polyclonal 

30 antibodies, many technique known in the art can be used (see, e.g., Kohler & Milstein, 

Nature 256:495-497 (1975); Kozbor et al, Immunology Today 4: 72 (1983); Cole et aL, pp. 
77-96 in Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc. (1985); Coligan, 
Current Protocols in Immunology (1991); Harlow & Lane, Antibodies, A Laboratory Manual 
(1988); and Goding, Monoclonal Antibodies: Principles and Practice (2d ed. 1986)). 
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Techniques for the production of single chain antibodies (U.S. Patent 4,946,778) can be 
adapted to produce antibodies to polypeptides of this invention. Also, transgenic mice, or 
other organisms such as other mammals, may be used to express humanized antibodies. 
Alternatively, phage display technology can be used to identify antibodies and heteromeric 
5 Fab fragments that specifically bind to selected antigens {see, e.g., McCafferty et al 9 Nature 
348:552-554 (1990); Marks et aL, Biotechnology 10:779-783 (1992)). 

A "chimeric antibody" is an antibody molecule in which (a) the constant 
region, or a portion thereof, is altered, replaced or exchanged so that the antigen binding site 
(variable region) is linked to a constant region of a different or altered class, effector function 
1 0 and/or species, or an entirely different molecule which confers new properties to the chimeric 
antibody, eg., an enzyme, toxin, hormone, growth factor, drug, etc.; or (b) the variable 
region, or a portion thereof, is altered, replaced or exchanged with a variable region having a 
different or altered antigen specificity. 

The detailed description of the invention includes discussion of the following 
1 5 aspects of the invention: Expression of angiogenesis-associated sequences 

Informatics 

Angiogenesis-associated sequences 

Detection of angiogeriesis sequence for diagnostic and 
therapeutic applications 

20 Modulators of angiogenesis 

Methods of identifying variant angiogenesis-associated 
sequences 

Administration of pharmaceutical and vaccine compositions 
Kits for use in diagnostic and/or prognostic applications. 

25 Expression of angiogenesis-associated sequences 

In one aspect, the expression levels of genes are determined in different 
patient samples for which diagnosis information is desired, to provide expression profiles. 
An expression profile of a particular sample is essentially a "fingerprint" of the state of the 
sample; while two states may have any particular gene similarly expressed, the evaluation of 

30 a number of genes simultaneously allows the generation of a gene expression profile that is 
unique to the state of the cell. That is, normal tissue may be distinguished from AD tissue. 
By comparing expression profiles of tissue in known different angiogenesis states, 
information regarding which genes are important (including both up- and down-regulation of 
genes) in each of these states is obtained. The identification of sequences that are 
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differentiaUy expressed in angiogenic versus non-angiogenic tissue allows the use of this 
information in a number of ways. For example, a particular treatment regime may be 
evaluated: does a chemotherapeutic drug act to down-regulate angiogenesis, and thus tumor 
growth or recurrence, in a particular patient. Similarly, diagnosis and treatment outcomes 
5 may be done or confirmed by comparing patient samples with the known expression profiles. 
Angiogenic tissue can also be analyzed to determine the stage of angiogenesis in the tissue. 
Furthermore, these gene expression profiles (or individual genes) allow screening of drug 
candidates with an eye to mimicking or altering a particular expression profile; for example, 
screening can be done for drugs that suppress the angiogenic expression profile. This may be 

10 done by making biochips comprising sets of the important angiogenesis genes, which can 
then be used in these screens. These methods can also be done on the protein basis; that is, 
protein expression levels of the angiogenic proteins can be evaluated for diagnostic purposes 
or to screen candidate agents. In addition, the angiogenic nucleic acid sequences can be 
administered for gene therapy purposes, including the administration of antisense nucleic 

1 5 acids, or the angiogenic proteins (including antibodies and other modulators thereof) 
administered as therapeutic drugs. 

Thus the present invention provides nucleic acid and protein sequences that 
are differentially expressed in angiogenesis, herein termed "angiogenesis sequences". As 
outlined below, angiogenesis sequences include those that are up-regulated (i.e. expressed at 

20 a higher level) in disorders associated with angiogenesis, as well as those that are down- 
regulated (i.e. expressed at a lower level). In a preferred embodiment, the angiogenesis 
sequences are from humans; however, as will be appreciated by those in the art, angiogenesis 
sequences from other organisms may be useful in animal models of disease and drug 
evaluation; thus, other angiogenesis sequences are provided, from vertebrates, including 

25 mammals, including rodents (rats, mice, hamsters, guinea pigs, etc.), primates, farm animals 
(including sheep, goats, pigs, cows, horses, etc). Angiogenesis sequences from other 
organisms may be obtained using the techniques outlined below. 

Angiogenesis sequences can include both nucleic acid and amino acid 
sequences. In a preferred embodiment, the angiogenesis sequences are recombinant nucleic 

30 acids. By the term "recombinant nucleic acid" herein is meant nucleic acid, originally formed 
in vitro, in general, by the manipulation of nucleic acid e.g., using polymerases and 
endonucleases, in a form not normally found in nature. Thus an isolated nucleic acid, in a 
linear form, or an expression vector formed in vitro by ligating DNA molecules that are not 
normally joined, are both considered recombinant for the purposes of this invention. It is 
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understood that once a recombinant nucleic acid is made and reintroduced into a host cell or 
organism, it will replicate non-recombinantly, i.e. using the in vivo cellular machinery of the 
host cell rather than in vitro manipulations; however, such nucleic acids, once produced 
recombinantly, although subsequently replicated non-recombinantly, are still considered 
5 recombinant for the purposes of the invention. 

Similarly, a "recombinant protein' 1 is a protein made using recombinant 
techniques, Le. through the expression of a recombinant nucleic acid as depicted above. A 
recombinant protein is distinguished from naturally occurring protein by at least one or more 
characteristics. For example, the protein may be isolated or purified away from some or all 

10 of the proteins and compounds with which it is normally associated in its wild type host, and 
thus may be substantially pure. For example, an isolated protein is unaccompanied by at least 
some of the material with which it is normally associated in its natural state, preferably 
constituting at least about 0.5%, more preferably at least about 5% by weight of the total 
protein in a given sample. A substantially pure protein comprises at least about 75% by 

15 weight of the total protein, with at least about 80% being preferred, and at least about 90% 
being particularly preferred. The definition includes the production of an angiogenesis protein 
from one organism in a different organism or host cell. Alternatively, the protein may be 
made at a significantly higher concentration than is normally seen, through the use of an 
inducible promoter or high expression promoter, such that the protein is made at increased 

20 concentration levels. Alternatively, the protein may be in a form not normally found in 
nature, as in the addition of an epitope tag or amino acid substitutions, insertions and 
deletions, as discussed below. 

In a preferred embodiment, the angiogenesis sequences are nucleic acids. As 
will be appreciated by those in the art and is more fully outlined below, angiogenesis 

25 sequences are useful in a variety of applications, including diagnostic applications, which will 
detect naturally occurring nucleic acids, as well as screening applications; for example, 
biochips comprising nucleic acid probes to the angiogenesis sequences can be generated. In 
the broadest sense, then, by "nucleic acid" or "oligonucleotide" or grammatical equivalents 
herein means at least two nucleotides covalently linked together. A nucleic acid of the 

30 present invention will generally contain phosphodiester bonds, although in some cases, 
nucleic acid analogs are included that may have alternate backbones, comprising, for 
example, phosphoramidate, phosphorothioate, phosphorodithioate, or O- 
methylphophoroamidite linkages, (see Eckstein, Oligonucleotides and Analogues: A Practical 
Approach, Oxford University Press); and peptide nucleic acid backbones and linkages. Other 
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analog nucleic acids include those with positive backbones; non-ionic backbones, and non- 
ribose backbones, including those described in U.S. Patent Nos. 5,235,033 and 5,034,506, 
and Chapters 6 and 7, ASC Symposium Series 580, "Carbohydrate Modifications in 
Antisense Research", Ed. Y.S. Sanghui and P. Dan Cook. Nucleic acids containing one or 
5 more carbocyclic sugars are also included within one definition of nucleic acids. 

Modifications of the ribose-phosphate backbone may be done for a variety of reasons, for 
example to increase the stability and half-life of such molecules in physiological 
environments or as probes on a biochip. 

As will be appreciated by those in the art, nucleic acid analogs may find use in 

10 the present invention. In addition, mixtures of naturally occurring nucleic acids and analogs 
can be made; alternatively, mixtures of different nucleic acid analogs, and mixtures of 
naturally occurring nucleic acids and analogs may be made. 

Particularly preferred are peptide nucleic acids (PNA) which includes peptide 
nucleic acid analogs. These backbones are substantially non-ionic under neutral conditions, in 

1 5 contrast to the highly charged phosphodiester backbone of naturally occurring nucleic acids. 
This results in two advantages. First, the PNA backbone exhibits improved hybridization 
kinetics. PNAs have larger changes in the melting temperature (Tm) for mismatched versus 
perfectly matched basepairs. DNA and RNA typically exhibit a 2-4°C drop in T m for an 
internal mismatch. With the non-ionic PNA backbone, the drop is closer to 7-9°C. Similarly, 

20 due to their non-ionic nature, hybridization of the bases attached to these backbones is 

relatively insensitive to salt concentration. In addition, PNAs are not degraded by cellular 
enzymes, and thus can be more stable. 

The nucleic acids may be single stranded or double stranded, as specified, or 
contain portions of both double stranded or single stranded sequence. As will be appreciated 

25 by those in the art, the depiction of a single strand also defines the sequence of the 

complementary strand; thus the sequences described herein also provide the complement of 
the sequence. The nucleic acid may be DNA, both genomic and cDNA, RNA or a hybrid, 
where the nucleic acid may contain combinations of deoxyribo- and ribo-nucleotides, and 
combinations of bases, including uracil, adenine, thymine, cytosine, guanine, inosine, 

30 xanthine hypoxanthine, isocytosine, isoguanine, etc. As used herein, the term "nucleoside" 
includes nucleotides and nucleoside and nucleotide analogs, and modified nucleosides such 
as amino modified nucleosides. In addition, "nucleoside" includes non-naturally occurring 
analog structures. Thus for example the individual units of a peptide nucleic acid, each 
containing a base, are referred to herein as a nucleoside. 
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An angiogenesis sequence can be initially identified by substantial nucleic 
acid and/or amino acid sequence homology to the angiogenesis sequences outlined herein. 
Such homology can be based upon the overall nucleic acid or amino acid sequence, and is 
generally determined as outlined below, using either homology programs or hybridization 
5 conditions. 

For identifying angiogenesis-associated sequences, the angiogenesis screen 
typically includes comparing genes identified in a modification of an in vitro model of 
angiogenesis as described in Hiraoka, Cell 95:365 (1998) with genes identified in controls. 
Samples of normal tissue and tissue undergoing angiogenesis are applied to biochips 

10 comprising nucleic acid probes. The samples are first microdissected, if applicable, and 
treated as is known in the art for the preparation of mRNA. Suitable biochips are 
commercially available, for example from Affymetrix. Gene expression profiles as described 
herein are generated and the data analyzed. 

In a preferred embodiment, the genes showing changes in expression as 

15 between normal and disease states are compared to genes expressed in other normal tissues, 
including, but not limited to lung, heart, brain, liver, breast, kidney, muscle, prostate, small 
intestine, large intestine, spleen, bone and placenta. In a preferred embodiment, those genes 
identified during the angiogenesis screen that are expressed in any significant amount in other 
tissues are removed from the profile, although in some embodiments, this is not necessary. 

20 That is, when screening for drugs, it is usually preferable that the target be disease specific, to 
minimize possible side effects. 

In a preferred embodiment, angiogenesis sequences are those that are up- 
regulated in angiogenesis disorders; that is, the expression of these genes is higher in the 
disease tissue as compared to normal tissue. <c Up-regulation" as used herein means at least 

25 about a two-fold change, preferably at least about a three fold change, with at least about 
five-fold or higher being preferred. All accession numbers herein are for the GenBank 
sequence database and the sequences of the accession numbers are hereby expressly 
incorporated by reference. GenBank is known in the art, see, e.g., Benson, DA, et al., 
Nucleic Acids Research 26:1-7 (1998) and http://www.ncbi.nlm.nih.gov/. Sequences are also 

30 avialable in other databases, e.g. , European Molecular Biology Laboratory (EMBL) and 
DNA Database of Japan (DDBJ). In addition, most preferred genes were found to be 
expressed in a limited amount or not at all in heart, brain, lung, liver, breast, kidney, prostate, 
small intestine and spleen. 
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In another preferred embodiment, angiogenesis sequences are those that are 
down-regulated in the angiogenesis disorder; that is, the expression of these genes is lower in 
angiogenic tissue as compared to normal tissue. "Down-regulation" as used herein means at 
least about a two-fold change, preferably at least about a three fold change, with at least about 
5 five-fold or higher being preferred 

Angiogenesis sequences according to the invention may be classified into 
discrete clusters of sequences based on common expression profiles of the sequences. 
Expression levels of angiogenesis sequences may increase or decrease as a function of time in 
a manner that correlates with the induction of angiogenesis. Alternatively, expression levels 

10 of angiogenesis sequences may both increase and decrease as a function of time. For 
example, expression levels of some angiogenesis sequences are temporarily induced or 
diminished during the switch to the angiogenesis phenotype, followed by a return to baseline 
expression levels. Tables 1-8 provides genes, the mRNA expression of which varies as a 
function of time in angiogenesis tissue when compared to normal tissue. 

15 In a particularly preferred embodiment, angiogenesis sequences are those that 

are induced for a period of time, typically by positive angiogenic factors, followed by a return 
to the baseline levels. Sequences that are temporarily induced provide a means to target 
angiogenesis tissue, for example neovascularized tumors, at a particular stage of 
angiogenesis, while avoiding rapidly growing tissue that require perpetual vascularization. 

20 Such positive angiogenic factors include aFGF, jiFGF, VEGF, angiogenin and the like. 

Induced angiogenesis sequences also are further categorized with respect to 
the timing of induction. For example, some angiogenesis genes may be induced at an early 
time period, such as within 10 minutes of the induction of angiogenesis. Others may be 
induced later, such as between 5 and 60 minutes, while yet others may be induced for a time 

25 period of about two hours or more followed by a return to baseline expression levels. 

In another preferred embodiment are angiogenesis sequences that are inhibited 
or reduced as a function of time followed by a return to "normal" expression levels. 
Inhibitors of angiogenesis are examples of molecules that have this expression profile. These 
sequences also can be further divided into groups depending on the timing of diminished 

30 expression. For example, some molecules may display reduced expression within 10 minutes 
of the induction of angiogenesis. Others may be diminished later, such as between 5 and 60 
minutes, while others may be diminished for a time period of about two hours or more 
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followed by a return to baseline. Examples of such negative angiogenic factors include 
thrombospondin and endostatin to name a few. 

In yet another preferred embodiment are angiogenesis sequences that are 
induced for prolonged periods. These sequences are typically associated with induction of 
5 angiogenesis and may participate in induction and/or maintenance of the angiogenesis 
phenotype. 

In another preferred embodiment are angiogenesis sequences, the expression 
of which is reduced or diminished for prolonged periods in angiogenic tissue. These 
sequences are typically angiogenesis inhibitors and their diminution is correlated with an 
1 0 increase in angiogenesis. 

Informatics 

The ability to identify genes that undergo changes in expression with time 
during angiogenesis can additionally provide high-resolution, high-sensitivity datasets which 

1 5 can be used in the areas of diagnostics, therapeutics, drug development, biosensor 

development, and other related areas. For example, the expression profiles can be used in 
diagnostic or prognostic evaluation of patients with angiogenesis-associated disease. Or as 
another example, subcellular toxicological information can be generated to better direct drug 
structure and activity correlation {see, Anderson, L., "Pharmaceutical Proteomics: Targets, 

20 Mechanism, and Function," paper presented at the IBC Proteomics conference, Coronado, 
CA (June 11-12, 1998)). Subcellular toxicological information can also be utilized in a 
biological sensor device to predict.the likely toxicological effect of chemical exposures and 
likely tolerable exposure thresholds (see, U.S. Patent No. 5,81 1,231). Similar advantages 
accrue from datasets relevant to other biomolecules and bioactive agents (e.g., nucleic acids, 

25 saccharides, lipids, drugs, and the like). 

Thus, in another embodiment, the present invention provides a database that 
includes at least one set of data assay data. The data contained in the database is acquired , 
e.g., using array analysis either singly or in a library format. The database can be in 
substantially any form in which data can be maintained and transmitted, but is preferably an 

30 electronic database. The electronic database of the invention can be maintained on any 
electronic device allowing for the storage of and access to the database, such as a personal 
computer, but is preferably distributed on a wide area network, such as the World Wide Web. 
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The focus of the present section on databases that include peptide sequence 
data is for clarity of illustration only. It will be apparent to those of skill in the art that similar 
databases can be assembled for any assay data acquired using an assay of the invention. 

The compositions and methods for identifying and/or quantitating the relative 
5 and/or absolute abundance of a variety of molecular and macromolecular species from a 
biological sample undergoing angiogenesis, i.e., the identification of angiogenesis-associated 
sequences described herein, provide an abundance of information, which can be correlated 
with pathological conditions, predisposition to disease, drug testing, therapeutic monitoring, 
gene-disease causal linkages, identification of correlates of immunity and physiological 

10 status, among others. Although the data generated from the assays of the invention is suited 
for manual review and analysis, in a preferred embodiment, prior data processing using high- 
speed computers is utilized. 

An array of methods for indexing and retrieving biomolecular information is 
known in the art. For example, U.S. Patents 6,023,659 and 5,966,712 disclose a relational 

1 5 database system for storing biomolecular sequence information in a manner that allows 
sequences to be catalogued and searched according to one or more protein function 
hierarchies. U.S. Patent 5,953,727 discloses a relational database having sequence records 
containing information in a format that allows a collection of partial-length DNA sequences 
to be catalogued and searched according to association with one or more sequencing projects 

20 for obtaining full-length sequences from the collection of partial length sequences. U.S. 
Patent 5,706,498 discloses a gene database retrieval system for making a retrieval of a gene 
sequence similar to a sequence data item in a gene database based on the degree of similarity 
between a key sequence and a target sequence. U.S. Patent 5,538,897 discloses a method 
using mass spectroscopy fragmentation patterns of peptides to identify amino acid sequences 

25 in computer databases by comparison of predicted mass spectra with experimentally-derived 
mass spectra using a closeness-of-fit measure. U.S. Patent 5,926,818 discloses a multi- 
dimensional database comprising a functionality for multi-dimensional data analysis 
described as on-line analytical processing (OLAP), which entails the consolidation of 
projected and actual data according to more than one consolidation path or dimension. U.S. 

30 Patent 5,295,261 reports a hybrid database structure in which the fields of each database 
record are divided into two classes, navigational and informational data, with navigational 
fields stored in a hierarchical topological map which can be viewed as a tree structure or as 
the merger of two or more such tree structures. 
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The present invention provides a computer database comprising a computer 
and software for storing in computer-retrievable form assay data records cross-tabulated, e.g., 
with data specifying the source of the target-containing sample from which each sequence 
specificity record was obtained. 
5 In an exemplary embodiment, at least one of the sources of target-containing 

sample is from a control tissue sample known to be free of pathological disorders. In a 
variation, at least one of the sources is a known pathological tissue specimen, e.g., a 
neoplastic lesion or another tissue specimen to be analyzed for angiogenesis. In another 
variation, the assay records cross-tabulate one or more of the following parameters for each 

10 target species in a sample: (1) a unique identification code, which can include, eg., a target 
molecular structure and/or characteristic separation coordinate (e.g., electrophoretic 
coordinates); (2) sample source; and (3) absolute and/or relative quantity of the target species 
present in the sample. 

The invention also provides for the storage and retrieval of a collection of 

1 5 target data in a computer data storage apparatus, which can include magnetic disks, optical 
disks, magneto-optical disks, DRAM, SRAM, SGRAM, SDRAM, RDRAM, DDR RAM, 
magnetic bubble memory devices, and other data storage devices, including CPU registers 
and on-CPU data storage arrays. Typically, the target data records are stored as a bit pattern 
in an array of magnetic domains on a magnetizable medium or as an array of charge states or 

20 transistor gate states, such as an array of cells in a DRAM device {e.g., each cell comprised of 
a transistor and a charge storage area, which may be on the transistor). In one embodiment, 
the invention provides such storage devices, and computer systems built therewith, 
comprising a bit pattern encoding a protein expression fingerprint record comprising unique 
identifiers for at least 10 target data records cross-tabulated with target source. 

25 When the target is a peptide or nucleic acid, the invention preferably provides 

a method for identifying related peptide or nucleic acid sequences, comprising performing a 
computerized comparison between a peptide or nucleic acid sequence assay record stored in 
or retrieved from a computer storage device or database and at least one other sequence. The 
comparison can include a sequence analysis or comparison algorithm or computer program 

30 embodiment thereof (e.g. , FASTA, TFASTA, GAP, BESTFIT) and/or the comparison may 
be of the relative amount of a peptide or nucleic acid sequence in a pool of sequences 
determined from a polypeptide or nucleic acid sample of a specimen. 

The invention also preferably provides a magnetic disk, such as an IBM- 
compatible (DOS, Windows, Windows95/98/2000, Windows NT, OS/2) or other format 
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i 

(e.g., Linux, SunOS, Solaris, ADC, SCO Unix, VMS, MV, Macintosh, etc.} floppy diskette or 
hard (fixed, Winchester) disk drive, comprising a bit pattern encoding data from an assay of 
the invention in a file format suitable for retrieval and processing in a computerized sequence 
analysis, comparison, or relative quantitation method. 
5 The invention also provides a network, comprising a plurality of computing 

devices linked via a data link, such as an Ethernet cable (coax or lOBaseT), telephone line, 
ISDN line, wireless network, optical fiber, or other suitable signal tranmission medium, 
whereby at least one network device (e.g., computer, disk array, etc.) comprises a pattern of 
magnetic domains (e.g., magnetic disk) and/or charge domains (e.g., an array of DRAM 
10 cells) composing a bit pattern encoding data acquired from an assay of the invention. 

The invention also provides a method for transmitting assay data that includes 
generating an electronic signal on an electronic communications device, such as a modem, 
ISDN terminal adapter, DSL, cable modem, ATM switch, or the like, wherein the signal 
includes (in native or encrypted format) a bit pattern encoding data from an assay or a 
1 5 database comprising a plurality of assay results obtained by the method of the invention. 

In a preferred embodiment, the invention provides a computer system for 
comparing a query target to a database containing an array of data structures, such as an assay 
result obtained by the method of the invention, and ranking database targets based on the 
degree of identity and gap weight to the target data. A central processor is preferably 
20 initialized to load and execute the computer program for alignment and/or comparison of the 
assay results. Data for a query target is entered into the central processor via an I/O device. 
Execution of the computer program results in the central processor retrieving the assay data 
from the data file, which comprises a binary description of an assay result. 

The target data or record and the computer program can be transferred to 
25 secondary memory, which is typically random access memory (e.g., DRAM, SRAM, 
SGRAM, or SDRAM). Targets are ranked according to the degree of correspondence 
between a selected assay characteristic (e.g., binding to a selected affinity moiety) and the 
same characteristic of the query target and results are output via an I/O device. For example, 
a central processor can be a conventional computer (e.g., Intel Pentium, PowerPC, Alpha, 
30 PA-8000, SPARC, MIPS 4400, MIPS 10000, VAX, etc.); a program can be a commercial or 
public domain molecular biology software package (e.g., UWGCG Sequence Analysis 
Software, Darwin); a data file can be an optical or magnetic disk, a data server, a memory 
device (e.g., DRAM, SRAM, SGRAM, SDRAM, EPROM, bubble memory, flash memory, 
etc.); an I/O device can be a terminal comprising a video display and a keyboard, a modem, 
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an ISDN terminal adapter, an Ethernet port, a punched card reader, a magnetic strip reader, or 
other suitable I/O device. 

The invention also preferably provides the use of a computer system, such as 
that described above, which comprises: (1) a computer; (2) a stored bit pattern encoding a 
5 collection of peptide sequence specificity records obtained by the methods of the invention, 
which may be stored in the computer; (3) a comparison target, such as a query target; and (4) 
a program for alignment and comparison, typically with rank-ordering of comparison results 
on the basis of computed similarity values. 

10 Angiogenesis-associated sequences 

Angiogenesis proteins of the present invention may be classified as secreted 
proteins, transmembrane proteins or intracellular proteins. In one embodiment,the 
angiogenesis protein is an intracellular protein. Intracellular proteins may be found in the 
cytoplasm and/or in the nucleus or associated with the intracellular side of the plasma 

15 membrane. Intracellular proteins are involved in all aspects of cellular function and 

replication (including, signaling pathways); aberrant expression of such proteins often 
results in unregulated or disregulated cellular processes (see, e.g., Molecular Biology of the 
Cell, 3rd Edition, Alberts, Ed., Garland Pub., 1994). For example, many intracellular 
proteins have enzymatic activity such as protein kinase activity, protein phosphatase activity, 

20 protease activity, nucleotide cyclase activity, polymerase activity and the like. Intracellular 
proteins also serve as docking proteins that are involved in organizing complexes of proteins, 
or targeting proteins to various subcellular localizations, and are involved in maintaining the 
structural integrity of organelles. 

An increasingly appreciated concept in characterizing proteins is the presence 

25 in the proteins of one or more motifs for which defined functions have been attributed. In 

addition to the highly conserved sequences found in the enzymatic domain of proteins, highly 
conserved sequences have been identified in proteins that are involved in protein-protein 
interaction. For example, Src-homology-2 (SHE) domains bind tyrosine-phosphorylated 
targets in a sequence dependent manner. PTB domains, which are distinct from SH2 

30 domains, also bind tyrosine phosphorylated targets. SH3 domains bind to proline-rich 

targets. In addition, PH domains, tetratricopeptide repeats and WD domains to name only a 
few, have been shown to mediate protein-protein interactions. Some of these may also be 
involved in binding to phospholipids or other second messengers. As will be appreciated by 
one of ordinary skill in the art, these motifs can be identified on the basis of primary 
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sequence; thus, an analysis of the sequence of proteins may provide insight into both the 
enzymatic potential of the molecule and/or molecules with which the protein may associate. 

In another embodiment, the angiogenesis sequences are transmembrane 
proteins. Transmembrane proteins are molecules that span a phospholipid bilayer of a cell. 
5 They may have an intracellular domain, an extracellular domain, or both. The intracellular 
domains of such proteins may have a number of functions including those already described 
for intracellular proteins. For example, the intracellular domain may have enzymatic activity 
and/or may serve as a binding site for additional proteins. Frequently the intracellular 
domain of transmembrane proteins serves both roles. For example certain receptor tyrosine 

10 kinases have both protein kinase activity and SH2 domains. In addition, autophosphorylation 
of tyrosines on the receptor molecule itself, creates binding sites for additional SH2 domain 
containing proteins. 

Transmembrane proteins may contain from one to many transmembrane 
domains. For example, receptor tyrosine kinases, certain cytokine receptors, receptor 

1 5 guanylyl cyclases and receptor serine/threonine protein kinases contain a single 

transmembrane domain. However, various other proteins including channels and adenylyl 
cyclases contain numerous transmembrane domains. Many important cell surface receptors 
such as G protein coupled receptors (GPCRs) are classified as "seven transmembrane 
domain" proteins, as they contain 7 membrane spanning regions. Characteristics of 

20 transmembrane domains include approximately 20 consecutive hydrophobic amino acids that 
may be followed or flanked by charged amino acids. Therefore, upon analysis of the amino 
acid sequence of a particular protein, the localization and number of transmembrane domains 
within the protein may be predicted (see, e.g. PSORT web site http://psort.nibb.ac.jp/). 

The extracellular domains of transmembrane proteins are diverse; however, 

25 conserved motifs are found repeatedly among various extracellular domains. Conserved 
structure and/or functions have been ascribed to different extracellular motifs. Many 
extracellular domains are involved in binding to other molecules. In one aspect, extracellular 
domains are found on receptors. Factors that bind the receptor domain include circulating 
ligands, which may be peptides, proteins, or small molecules such as adenosine and the like, 

30 For example, growth factors such as EGF, FGF and PDGF are circulating growth factors that 
bind to their cognate receptors to initiate a variety of cellular responses. Other factors include 
cytokines, mitogenic factors, neurotrophic factors and the like. Extracellular domains also 
bind to cell-associated molecules. In this respect, they mediate cell-cell interactions. Cell- 
associated ligands can be tethered to the cell for example via a glycosylphosphatidylinositol 
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(GPI) anchor, or may themselves be transmembrane proteins. Extracellular domains also 
associate with the extracellular matrix and contribute to the maintenance of the cell structure. 

Angiogenesis proteins that are transmembrane are particularly preferred in the 
present invention as they are readily accessible targets for immunotherapeutics, as are 
5 described herein. In addition, as outlined below, transmembrane proteins can be also useful 
in imaging modalities. Antibodies may be used to label such readily accessible proteins in 
situ. Alternatively, antibodies can also label intracellular proteins, in which case samples are 
typically permeablized to provide acess to intracellular proteins. 

It will also be appreciated by those in the art that a transmembrane protein can 

10 be made soluble by removing transmembrane sequences, for example through recombinant 
methods. Furthermore, transmembrane proteins that have been made soluble can be made to 
be secreted through recombinant means by adding an appropriate signal sequence. 

In another embodiment, the angiogenesis proteins are secreted proteins; the 
secretion of which can be either constitutive or regulated. These proteins have a signal 

1 5 peptide or signal sequence that targets the molecule to the secretory pathway. Secreted 

proteins are involved in numerous physiological events; by virtue of their circulating nature, 
they serve to transmit signals to various other cell types. The secreted protein may function in 
an autocrine manner (acting on the cell that secreted the factor), a paracrine manner (acting 
on cells in close proximity to the cell that secreted the factor) or an endocrine manner (acting 

20 on cells at a distance). Thus secreted molecules find use in modulating or altering numerous 
aspects of physiology. Angiogenesis proteins that are secreted proteins are particularly 
preferred in the present invention as they serve as good targets for diagnostic markers, e.g., 
for blood or serum tests. 

An angiogenesis sequence is typically initially identified by substantial nucleic 

25 acid and/or amino acid sequence homology or linkage to the angiogenesis sequences outlined 
herein. Such homology can be based upon the overall nucleic acid or amino acid sequence, 
and is generally determined as outlined below, using either homology programs or 
hybridization conditions. Typically, linked sequences on a mRNA are found on the same 
molecule. 

30 As detailed in the definitions, percent identity can be determined using an 

algorithm such as BLAST. A preferred method utilizes the BLASTN module of WU- 
BLAST-2 set to the default parameters, with overlap span and overlap fraction set to 1 and 
0.125, respectively. The alignment may include the introduction of gaps in the sequences to 
be aligned. In addition, for sequences which contain either more or fewer nucleotides than 
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those of the nucleic acids of the figures, it is understood that the percentage of homology will 
be determined based on the number of homologous nucleosides in relation to the total number 
of nucleosides. Thus, for example, homology of sequences shorter than those of the 
sequences identified herein and as discussed below, will be determined using the number of 
5 nucleosides in the shorter sequence. 

In one embodiment, the nucleic acid homology is determined through 
hybridization studies. Thus, e.g. 9 nucleic acids which hybridize under high stringency to a 
nucleic acid of Tables 1-8 , or its complement, or is also found on naturally occurring 
mRNAs is considered an angiogenesis sequence. In another embodiment, less stringent 

10 hybridization conditions are used; for example, moderate or low stringency conditions may 
be used, as are known in the art; see Ausubel, supra, and Tijssen, supra. 

In addition, the angiogenesis nucleic acid sequences of the invention, e.g, the 
sequence in Tables 1-8 , are fragments of larger genes, i.e. they are nucleic acid segments. 
"Genes" in this context includes coding regions, non-coding regions, and mixtures of coding 

15 and non-coding regions. Accordingly, as will be appreciated by those in the art, using the 
sequences provided herein, extended sequences, in either direction, of the angiogenesis genes 
can be obtained, using techniques well known in the art for cloning either longer sequences or 
the full length sequences; see Ausubel, et al, supra. Much can be done by informatics and 
many sequences can be clustered to include multiple sequences, e.g., systems such as 

20 UniGene (see, http://www.ncbi.nlm.nih.gov/UniGene/). 

Once the angiogenesis nucleic acid is identified, it can be cloned and, if 
necessary, its constituent parts recombined to form the entire angiogenesis nucleic acid 
coding regions or the entire mRNA sequence. Once isolated from its natural source, e.g., 
contained within a plasmid or other vector or excised therefrom as a linear nucleic acid 

25 segment, the recombinant angiogenesis nucleic acid can be further-used as a probe to identify 
and isolate other angiogenesis nucleic acids, for example extended coding regions. It can 
also be used as a "precursor" nucleic acid to make modified or variant angiogenesis nucleic 
acids and proteins. 

The angiogenesis nucleic acids of the present invention are used in several 
30 ways. In a first embodiment, nucleic acid probes to the angiogenesis nucleic acids are made 
and attached to biochips to be used in screening and diagnostic methods, as outlined below, 
or for administration, for example for gene therapy, vaccine, and/or antisense applications. 
Alternatively, the angiogenesis nucleic acids that include coding regions of angiogenesis 
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proteins can be put into expression vectors for the expression of angiogenesis proteins, again 
for screening purposes or for administration to a patient. 

In a preferred embodiment, nucleic acid probes to angiogenesis nucleic acids 
(both the nucleic acid sequences outlined in the figures and/or the complements thereof) are 
5 made. The nucleic acid probes attached to the biochip are designed to be substantially 
complementary to the angiogenesis nucleic acids, Le. the target sequence (either the target 
sequence of the sample or to other probe sequences, for example in sandwich assays), such 
that hybridization of the target sequence and the probes of the present invention occurs. As 
outlined below, this complementarity need not be perfect; there may be any number of base 

10 pair mismatches which will interfere with hybridization between the target sequence and the 
single stranded nucleic acids of the present invention. However, if the number of mutations 
is so great that no hybridization can occur under even the least stringent of hybridization 
conditions, the sequence is not a complementary target sequence. Thus, by "substantially 
complementary" herein is meant that the probes are sufficiently complementary to the target 

15 sequences to hybridize under normal reaction conditions, particularly high stringency 
conditions, as outlined herein. 

A nucleic acid probe is generally single stranded but can be partially single 
and partially double stranded. The strandedness of the probe is dictated by the structure, 
composition, and properties of the target sequence. In general, the nucleic acid probes range 

20 from about 8 to about 100 bases long, with from about 10 to about 80 bases being preferred, 
and from about 30 to about 50 bases being particularly preferred. That is, generally whole 
genes are not used. In some embodiments, much longer nucleic acids can be used, up to 
hundreds of bases. 

In a preferred embodiment, more than one probe per sequence is used, with 
25 either overlapping probes or probes to different sections of the target being used. That is, 
two, three, four or more probes, with three being preferred, are used to build in a redundancy 
for a particular target. The probes can be overlapping (i.e. have some sequence in common), 
or separate. In some cases, PCR primers may be used to amplify signal for higher sensitivity. 

As will be appreciated by those in the art, nucleic acids can be attached or 
30 immobilized to a solid support in a wide variety of ways. By "immobilized" and grammatical 
equivalents herein is meant the association or binding between the nucleic acid probe and the 
solid support is sufficient to be stable under the conditions of binding, washing, analysis, and 
removal as outlined below. The binding can typically be covalent or non-covalent. By "non- 
covalent binding" and grammatical equivalents herein is meant one or more of electrostatic, 
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hydrophilic, and hydrophobic interactions. Included in non-covalent binding is the covalent 
attachment of a molecule, such as, streptavidin to the support and the non-covalent binding of 
the biotinylated probe to the streptavidin. By "covalent binding" and grammatical 
equivalents herein is meant that the two moieties, the solid support and the probe, are 
5 attached by at least one bond, including sigma bonds, pi bonds and coordination bonds. 
Covalent bonds can be formed directly between the probe and the solid support or can be 
formed by a cross linker or by inclusion of a specific reactive group on either the solid 
support or the probe or both molecules. Immobilization may also involve a combination of 
covalent and non-covalent interactions. 
10 In general, the probes are attached to the biochip in a wide variety of ways, as 

will be appreciated by those in the art. As described herein, the nucleic acids can either be 
synthesized first, with subsequent attachment to the biochip, or can be directly synthesized on 
the biochip. 

The biochip comprises a suitable solid substrate. By "substrate" or "solid 

15 support" or other grammatical equivalents herein is meant a material that can be modified to 
contain discrete individual sites appropriate for the attachment or association of the nucleic 
acid probes and is amenable to at least one detection method. As will be appreciated by those 
in the art, the number of possible substrates are very large, and include, but are not limited to, 
glass and modified or functionalized glass, plastics (including acrylics, polystyrene and 

20 copolymers of styrene and other materials, polypropylene, polyethylene, polybutylene, 

polyurethanes, TeflonJ, etc.), polysaccharides, nylon or nitrocellulose, resins, silica or silica- 
based materials including silicon and modified silicon, carbon, metals, inorganic glasses, 
plastics, etc. In general, the substrates allow optical detection and do not appreciably 
fluorescese. A preferred substrate is described in copending application entitled Reusable 

25 Low Fluorescent Plastic Biochip, U.S. Application Serial No. 09/270,214, filed March 15, 
1999, herein incorporated by reference in its entirety. 

Generally the substrate is planar, although as will be appreciated by those in 
the art, other configurations of substrates may be used as well. For example, the probes may 
be placed on the inside surface of a tube, for flow-through sample analysis to minimize 

30 sample volume. Similarly, the substrate may be flexible, such as a flexible foam, including 
closed cell foams made of particular plastics. 

In a preferred embodiment, the surface of the biochip and the probe may be 
derivatized with chemical functional groups for subsequent attachment of the two. Thus, for 
example, the biochip is derivatized with a chemical functional group including, but not 
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limited to, amino groups, carboxy groups, oxo groups and thiol groups, with amino groups 
being particularly preferred. Using these functional groups, the probes can be attached using 
functional groups on the probes. For example, nucleic acids containing amino groups can be 
attached to surfaces comprising amino groups, for example using linkers as are known in the 
5 art; for example, homo-or hetero-bifunctional linkers as are well known (see 1994 Pierce 
Chemical Company catalog, technical section on cross-linkers, pages 155-200, incorporated 
herein by reference). In addition, in some cases, additional linkers, such as alkyl groups 
(including substituted and heteroalkyl groups) may be used. 

In this embodiment, oligonucleotides are synthesized as is known in the art, 

10 and then attached to the surface of the solid support. As will be appreciated by those skilled 
in the art, either the 5 f or 3' terminus may be attached to the solid support, or attachment may 
be via an internal nucleoside. 

In another embodiment, the immobilization to the solid support may be very 
strong, yet non-covalent. For example, biotinylated oligonucleotides can be made, which 

1 5 bind to surfaces covalently coated with streptavidin, resulting in attachment. 

Alternatively, the oligonucleotides may be synthesized on the surface, as is 
known in the art. For example, photoactivation techniques utilizing photopolymerization 
compounds and techniques are used. In a preferred embodiment, the nucleic acids can be 
synthesized in situ, using well known photolithographic techniques, such as those described 

20 in WO 95/25 1 16; WO 95/35505; U.S. Patent Nos. 5,700,637 and 5,445,934; and references 
cited within, all of which are expressly incorporated by reference; these methods of 
attachment form the basis of the Affimetrix GeneChip™ technology. 

Often, amplification-based assays are performed to measure the expression 
level of angiogenesis-associated sequences. These assays are typically performed in 

25 conjunction with reverse transcription. In such assays, an angiogenesis-associated nucleic 
acid sequence acts as a template in an amplification reaction (e.g., Polymerase Chain 
Reaction, or PCR). In a quantitative amplification, the amount of amplification product will 
.be proportional to the amount of template in the original sample. Comparison to appropriate 
controls provides a measure of the amount of angiogenesis-associated RNA. Methods of 

30 quantitative amplification are well known to those of skill in the art. Detailed protocols for 

» 

quantitative PCR are provided, e.g., in Innis et al (1990) PCR Protocols, A Guide to Methods 
and Applications, Academic Press, Inc. N.Y.). 

In some embodiments, a TaqMan based assay is used to measure expression. 
TaqMan based assays use a fluorogenic oligonucleotide probe that contains a 5' fluorescent 
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dye and a 3' quenching agent The probe hybridizes to a PCR product, but cannot itself be 
extended due to a blocking agent at the 3* end. When the PCR product is amplified in 
subsequent cycles, the 5' nuclease activity of the polymerase, e.g., AmpliTaq, results in the 
cleavage of the TaqMan probe. This cleavage separates the 5' fluorescent dye and the 3' 
5 quenching agent, thereby resulting in an increase in fluorescence as a function of 

amplification {see, for example, literature provided by Perkin-Elmer, e.g., www2.perkin- 
elmer.com). 

Other suitable amplification methods include, but are not limited to, ligase 
chain reaction (LCR) (see, Wu and Wallace (1989) Genomics 4: 560, Landegren et ah (1988) 
10 Science 241 : 1077, and Barringer et al. (1990) Gene 89: 1 17), transcription amplification 

(Kwoh etal (1989) Proc. Natl. Acad. Set USA 86: 1173), self-sustained sequence replication 
(Guatelli et al (1990) Proc. Nat. Acad. Set USA 87: 1874), dot PCR, and linker adapter PCR, 
etc. 

In a preferred embodiment, angiogenesis nucleic acids, e.g., encoding 

15 angiogenesis proteins are used to make a variety of expression vectors to express 

angiogenesis proteins which can then be used in screening assays, as described below. 
Expression vectors and recombinant DNA technology are well known to those of skill in the 
art (see, e.g., Ausubel, supra, and Gene Expression Systems, Fernandez & Hoeffler, Eds, 
Academic Press, 1999) and are used to express proteins. The expression vectors may be 

20 either self-replicating extrachromosomal vectors or vectors which integrate into a host 
genome. Generally, these expression vectors include transcriptional and translational 
regulatory nucleic acid operably linked to the nucleic acid encoding the angiogenesis protein. 
The term "control sequences" refers to DNA sequences used for the expression of an 
operably linked coding sequence in a particular host organism. Control sequences that are 

25 suitable for prokaryotes, for example, include a promoter, optionally an operator sequence, 
and a ribosome binding site. Eukaryotic cells are known to utilize promoters, 
polyadenylation signals, and enhancers. 

Nucleic acid is "operably linked" when it is placed into a functional 
relationship with another nucleic acid sequence. For example, DNA for a presequence or 

30 secretory leader is operably linked to DNA for a polypeptide if it is expressed as a preprotein 
that participates in the secretion of the polypeptide; a promoter or enhancer is operably linked 
to a coding sequence if it affects the transcription of the sequence; or a ribosome binding site 
is operably linked to a coding sequence if it is positioned so as to facilitate translation. 
Generally, "operably linked" means that the DNA sequences being linked are contiguous, 
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and, in the case of a secretory leader, contiguous and in reading phase. However, enhancers 
do not have to be contiguous. Linking is typically accomplished by ligation at convenient 
restriction sites. If such sites do not exist, synthetic oligonucleotide adaptors or linkers are 
used in accordance with conventional practice. Transcriptional and translational regulatory 
5 nucleic acid will generally be appropriate to the host cell used to express the angiogenesis 
protein; for example, transcriptional and translational regulatory nucleic acid sequences from 
Bacillus are preferably used to express the angiogenesis protein in Bacillus. Numerous types 
of appropriate expression vectors, and suitable regulatory sequences are known in the art for 
a variety of host cells. 

1 0 In general, transcriptional and translational regulatory sequences may include, 

but are not limited to, promoter sequences, ribosomal binding sites, transcriptional start and 
stop sequences, translational start and stop sequences, and enhancer or activator sequences. 
In a preferred embodiment, the regulatory sequences include a promoter and transcriptional 
start and stop sequences. 

15 Promoter sequences encode either constitutive or inducible promoters. The 

promoters may be either naturally occurring promoters or hybrid promoters. Hybrid 
promoters, which combine elements of more than one promoter, are also known in the art, 
and are useful in the present invention. 

In addition, an expression vector may comprise additional elements. For 

20 example, the expression vector may have two replication systems, thus allowing it to be 

maintained in two organisms, for example in mammalian or insect cells for expression and in 
a procaryotic host for cloning and amplification. Furthermore, for integrating expression 
vectors, the expression vector contains at least one sequence homologous to the host cell 
genome, and preferably two homologous sequences which flank the expression construct. 

25 The integrating vector may be directed to a specific locus in the host cell by selecting the 
appropriate homologous sequence for inclusion in the vector. Constructs for integrating 
vectors are well known in the art {e.g., Fernandez & Hoeffler, supra). See also Kitamura, et 
al. (1995) PNAS 92:9146-9150. 

In addition, in a preferred embodiment, the expression vector contains a 

30 selectable marker gene to allow the selection of transformed host cells. Selection genes are 
well known in the art and will vary with the host cell used. 

The angiogenesis proteins of the present invention are produced by culturing a 
host cell transformed with an expression vector containing nucleic acid encoding an 
angiogenesis protein, under the appropriate conditions to induce or cause expression of the 
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angiogenesis protein. Conditions appropriate for angiogenesis protein expression will vary 
with the choice of the expression vector and the host cell, and will be easily ascertained by 
one skilled in the art through routine experimentation or optimization. For example, the use 
of constitutive promoters in the expression vector will require optimizing the growth and 
5 proliferation of the host cell, while the use of an inducible promoter requires the appropriate 
growth conditions for induction. In addition, in some embodiments, the timing of the harvest 
is important. For example, the baculoviral systems used in insect cell expression are lytic 
viruses, and thus harvest time selection can be crucial for product yield. 

Appropriate host cells include yeast, bacteria, archaebacteria, fungi, and insect 

10 and animal cells, including mammalian cells. Of particular interest are Saccharomyces 
cerevisiae and other yeasts, E. coli, Bacillus subtilis, Sf9 cells, C129 cells, 293 cells, 
Neurospora, BHK, CHO, COS, HeLa cells, HUVEC (human umbilical vein endothelial 
cells), THP1 cells (a macrophage cell line) and various other human cells and cell lines. 

In a preferred embodiment, the angiogenesis proteins are expressed in 

15 mammalian cells. Mammalian expression systems are also known in the art, and include 
retroviral and adenoviral systems. Of particular use as mammalian promoters are the 
promoters from mammalian viral genes, since the viral genes are often highly expressed and 
have a broad host range. Examples include the S V40 early promoter, mouse mammary tumor 
virus LTR promoter, adenovirus major late promoter, herpes simplex virus promoter, and the 

20 CMV promoter (see, e.g., Fernandez & Hoeffler, supra). Typically, transcription termination 
and polyadenylation sequences recognized by mammalian cells are regulatory regions located 
y to the translation stop codon and thus, together with the promoter elements, flank the 
coding sequence. Examples of transcription terminator and polyadenlytion signals include 
those derived form S V40. 

25 The methods of introducing exogenous nucleic acid into mammalian hosts, as 

well as other hosts, is well known in the art, and will vary with the host cell used. 
Techniques include dextran-mediated transfection, calcium phosphate precipitation, 
polybrene mediated transfection, protoplast fusion, electroporation, viral infection, 
encapsulation of the polynucleotide(s) in liposomes, and direct microinjection of the DNA 

30 into nuclei. 

La a preferred embodiment, angiogenesis proteins are expressed in bacterial 
systems. Bacterial expression systems are well known in the art. Promoters from 
bacteriophage may also be used and are known in the art. In addition, synthetic promoters 
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and hybrid promoters are also useful; for example, the tac promoter is a hybrid of the trp and 
lac promoter sequences. Furthermore, a bacterial promoter can include naturally occurring 
promoters of non-bacterial origin that have the ability to bind bacterial RNA polymerase and 
initiate transcription. In addition to a functioning promoter sequence, an efficient ribosome 
5 binding site is desirable. The expression vector may also include a signal peptide sequence 
that provides for secretion of the angiogenesis protein in bacteria. The protein is either 
secreted into the growth media (gram-positive bacteria) or into the periplasmic space, located 
between the inner and outer membrane of the cell (gram-negative bacteria). The bacterial 
expression vector may also include a selectable marker gene to allow for the selection of 

10 bacterial strains that have been transformed. Suitable selection genes include genes which 
render the bacteria resistant to drugs such as ampicillin, chloramphenicol, erythromycin, 
kanamycin, neomycin and tetracycline. Selectable markers also include biosynthetic genes, 
such as those in the histidine, tryptophan and leucine biosynthetic pathways. These 
components are assembled into expression vectors. Expression vectors for bacteria are well 

1 5 known in the art, and include vectors for Bacillus subtilis, E. coli, Streptococcus cremoris, 
and Streptococcus lividans, among others (e.g., Fernandez & Hoeffler, supra). The bacterial 
expression vectors are transformed into bacterial host cells using techniques well known in 
the art, such as calcium chloride treatment, electroporation, and others. 

In one embodiment, angiogenesis proteins are produced in insect cells. 

20 Expression vectors for the transformation of insect cells, and in particular, baculovirus-based 
expression vectors, are well known in the art. 

In a preferred embodiment, angiogenesis protein is produced in yeast cells. 
Yeast expression systems are well known in the art, and include expression vectors for 
Saccharomyces cerevisiae, Candida albicans and C. maltosa, Hansenula polymorpha, 

25 Kluyveromyces fragilis and K. lactis, Pichia guillerimondii and P. pastoris, 
Schizosaccharomyces pombe, and Yarroma lipolytica. 

The angiogenesis protein may also be made as a fusion protein, using 
techniques well known in the art. Thus, for example, for the creation of monoclonal 
antibodies, if the desired epitope is small, the angiogenesis protein may be fused to a carrier 

30 protein to form an immunogen. Alternatively, the angiogenesis protein may be made as a 
fusion protein to increase expression, or for other reasons. For example, when the 
angiogenesis protein is an angiogenesis peptide, the nucleic acid encoding the peptide may be 
linked to another nucleic acid for expression purposes. Fusion with detection epitope tags 
can be made, e.g., with FLAG, His 6, myc, HA, etc. 
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In one embodiment, the angiogenesis nucleic acids, proteins and antibodies of 
the invention are labeled. By "labeled" herein is meant that a compound has at least one 
element, isotope or chemical compound attached to enable the detection of the compound. In 
general, labels fall into three classes: a) isotopic labels, which may be radioactive or heavy 
5 isotopes; b) immune labels, which may be antibodies, antigens, or epitope tags and c) colored 
or fluorescent dyes. The labels may be incorporated into the angiogenesis nucleic acids, 
proteins and antibodies at any position. For example, the label should be capable of 
producing, either directly or indirectly, a detectable signal. The detectable moiety may be a 
radioisotope, such as 3 H, I4 C, 32 P, 35 S, or 125 I, a fluorescent or chemiluminescent compound, 

10 such as fluorescein isothiocyanate, rhodamine, or luciferin, or an en2yme, such as alkaline 
phosphatase, beta-galactosidase or horseradish peroxidase. Any method known in the art for 
conjugating the antibody to the label may be employed, including those methods described by 
Hunter et al, Nature, 144:945 (1962); David et al., Biochemistry, 13:1014 (1974); Pain et 
al., J. Immunol. Meth., 40:219 (1981); andNygren, J. Histochem. and Cytochem., 30:407 

15 (1982). 

Accordingly, the present invention also provides angiogenesis protein 
sequences. An angiogenesis protein of the present invention may be identified in several 
ways. "Protein" in this sense includes proteins, polypeptides, and peptides. As will be 
appreciated by those in the art, the nucleic acid sequences of the invention can be used to 

20 generate protein sequences. There are a variety of ways to do this, including cloning the 
entire gene and verifying its frame and amino acid sequence, or by comparing it to known 
sequences to search for homology to provide a frame, assuming the angiogenesis protein has 
an identifiable motif or homology to some protein in the database being used. Generally, the 
nucleic acid sequences are input into a program that will search all three frames for 

25 homology. This is done in a preferred embodiment using the following NCBI Advanced 

BLAST parameters. The program is blastx or blastn. The database is nr. The input data is as 
"Sequence in FASTA format". The organism list is "none". The "expect" is 10; the filter is 
default. The "descriptions" is 500, the "alignments" is 500, and the "alignment view" is 
pairwise. The "Query Genetic Codes" is standard (1). The matrix is BLOSUM62; gap 

30 existence cost is 1 1 , per residue gap cost is 1 ; and the lambda ratio is .85 default. This 
results in the generation of a putative protein sequence. 

Also included within one embodiment of angiogenesis proteins are amino acid 
variants of the naturally occurring sequences, as determined herein. Preferably, the variants 
are preferably greater than about 75% homologous to the wild-type sequence, more 
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preferably greater than about 80%, even more preferably greater than about 85% and most 
preferably greater than 90%. In some embodiments the homology will be as high as about 93 
to 95 or 98%. As for nucleic acids, homology in this context means sequence similarity or 
identity, with identity being preferred. This homology will be determined using standard 
5 techniques well known in the art as are outlined above for the nucleic acid homologies. 

Angiogenesis proteins of the present invention may be shorter or longer than 
the wild type amino acid sequences. Thus, in a preferred embodiment, included within the 
definition of angiogenesis proteins are portions or fragments of the wild type sequences, 
herein. In addition, as outlined above, the angiogenesis nucleic acids of the invention may be 

10 used to obtain additional coding regions, and thus additional protein sequence, using 
techniques known in the art. 

In a preferred embodiment, the angiogenesis proteins are derivative or variant 
angiogenesis proteins as compared to the wild-type sequence. That is, as outlined more fully 
below, the derivative angiogenesis peptide will often contain at least one amino acid 

15 substitution, deletion or insertion, with amino acid substitutions being particularly preferred. 
The amino acid substitution, insertion or deletion may occur at any residue within the 
angiogenesis peptide. 

Also included within one embodiment of angiogenesis proteins of the present 
invention are amino acid sequence variants. These variants typically fall into one or more of 

20 three classes: substitutional, insertional or deletional variants. These variants ordinarily are 
prepared by site specific mutagenesis of nucleotides in the DNA encoding the angiogenesis 
protein, using cassette or PCR mutagenesis or other techniques well known in the art, to 
produce DNA encoding the variant, and thereafter expressing the DNA in recombinant cell 
culture as outlined above. However, variant angiogenesis protein fragments having up to 

25 about 1 00-1 50 residues may be prepared by in vitro synthesis using established techniques. 
Amino acid sequence variants are characterized by the predetermined nature of the variation, 
a feature that sets them apart from naturally occurring allelic or interspecies variation of the 
angiogenesis protein amino acid sequence. The variants typically exhibit the same qualitative 
biological activity as the naturally occurring analogue, although variants can also be selected 

30 which have modified characteristics as will be more fully outlined below. 

While the site or region for introducing an amino acid sequence variation is 
predetermined, the mutation per se need not be predetermined. For example, in order to 
optimize the performance of a mutation at a given site, random mutagenesis may be 
conducted at the target codon or region and the expressed angiogenesis variants screened for 
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the optimal combination of desired activity. Techniques for making substitution mutations at 
predetermined sites in DNA having a known sequence are well known, for example, Ml 3 
primer mutagenesis and PCR mutagenesis. Screening of the mutants is done using assays of 
angiogenesis protein activities. 
5 Amino acid substitutions are typically of single residues; insertions usually 

will be on the order of from about 1 to 20 amino acids, although considerably larger 
insertions may be tolerated. Deletions range from about 1 to about 20 residues, although in 
some cases deletions may be much larger. 

Substitutions, deletions, insertions or any combination thereof may be used to 

10 arrive at a final derivative. Generally these changes are done on a few amino acids to 

minimize the alteration of the molecule. However, larger changes may be tolerated in certain 
circumstances. When small alterations in the characteristics of the angiogenesis protein are 
desired, substitutions are generally made in accordance with the amino acid substitution chart 
provided in the definition section. 

1 5 Substantial changes in function or immunological identity are made by 

selecting substitutions that are less conservative than those provided in the definition of 
"conservative substitution". For example, substitutions may be made which more 
significantly affect: the structure of the polypeptide backbone in the area of the alteration, for 
example the alpha-helical or beta-sheet structure; the charge or hydrophobicity of the 

20 molecule at the target site; or the bulk of the side chain. The substitutions which in general 
are expected to produce the greatest changes in the polypeptide's properties are those in 
which (a) a hydrophilic residue, e.g. seryl or threonyl, is substituted for (or by) a hydrophobic 
residue, e.g. leucyl, isoleucyl, phenylalanyl, valyl or alanyl; (b) a cysteine or proline is 
substituted for (or by) any other residue; (c) a residue having an electropositive side chain, 

25 e.g. lysyl, arginyl, or histidyl, is substituted for (or by) an electronegative residue, e.g. 
glutamyl or aspartyl; or (d) a residue having a bulky side chain, e.g. phenylalanine, is 
substituted for (or by) one not having a side chain, e.g. glycine. 

The variants typically exhibit the same qualitative biological activity and will 
elicit the same immune response as the naturally-occurring analog, although variants also are 

30 selected to modify the characteristics of the angiogenesis proteins as needed. Alternatively, 
the variant may be designed such that the biological activity of the angiogenesis protein is 
altered. For example, glycosylation sites may be altered or removed. 

Covalent modifications of angiogenesis polypeptides are included within the 
scope of this invention. One type of covalent modification includes reacting targeted amino 
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acid residues of an angiogenesis polypeptide with an organic derivatizing agent that is 
capable of reacting with selected side chains or the N-or C-terminal residues of an 
angiogenesis. polypeptide. Derivatization with bifunctional agents is useful, for instance, for 
crosslinking angiogenesis polypeptides to a water-insoluble support matrix or surface for use 
5 in the method for purifying anti-angiogenesis polypeptide antibodies or screening assays, as 
is more fully described below. Commonly used crosslinking agents include, 1,1- 
bis(diazoacetyl)-2-phenylethane, glutaraldehyde, N-hydroxysuccinimide esters, for example, 
esters with 4-azidosalicylic acid, homobifiinctional imidoesters, including disuccinimidyl 
esters such as 3,3 l -dithiobis(succinimidylpropionate), bifunctional maleimides such as bis-N- 

1 0 maleimido- 1 ,8-octane and agents such as me1hyl-3-[(p-azidophenyl)dithio]propioimidate. 

Other modifications include deamidation of glutaminyl and asparaginyl 
residues to the corresponding glutamyl and aspartyl residues, respectively, hydroxylation of 
proline and lysine 9 phosphorylation of hydroxyl groups of seryl, threonyl or tyrosyl residues, 
methylation of the y-amino groups of lysine, arginine, and histidine side chains [T.E. 

1 5 Creighton, Proteins: Structure and Molecular Properties, W.H. Freeman & Co., San 

Francisco, pp. 79-86 (1983)], acetylation of the N-tenninal amine, and amidation of any C- 
terminal carboxyl group. 

Another type of covalent modification of the angiogenesis polypeptide 
included within the scope of this invention comprises altering the native glycosylation pattern 

20 of the polypeptide. "Altering the native glycosylation pattern" is intended for purposes herein 
to mean deleting one or more carbohydrate moieties found in native sequence angiogenesis 
polypeptide, and/or adding one or more glycosylation sites that are not present in the. native 
sequence angiogenesis polypeptide. Glycosylation patterns can be altered in many ways. For 
example the use of different cell types to express angiogenesis-associated sequences can 

25 result in different glycosylation patterns. 

Addition of glycosylation sites to angiogenesis polypeptides may also be 
accomplished by altering the amino acid sequence thereof. The alteration may be made, for 
example, by the addition of, or substitution by, one or more serine or threonine residues to the 
native sequence angiogenesis polypeptide (for O-linked glycosylation sites). The 

30 angiogenesis amino acid sequence may optionally be altered through changes at the DNA 

level, particularly by mutating the DNA encoding the angiogenesis polypeptide at preselected 
bases such that codons are generated that will translate into the desired amino acids. 
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Another means of increasing the number of carbohydrate moieties on the 
angiogenesis polypeptide is by chemical or enzymatic coupling of glycosides to the 
polypeptide. Such methods are described in the art, e.g., in WO 87/05330 published 1 1 
September 1987, and in Aplin and Wriston, CRC Crit. Rev. Biochem., pp. 259-306 (1981). 
5 Removal of carbohydrate moieties present on the angiogenesis polypeptide 

may be accomplished chemically or enzymatically or by mutational substitution of codons 
encoding for amino acid residues that serve as targets for glycosylation. Chemical 
deglycosylation techniques are known in the art and described, for instance, by Hakimuddin, 
et al., Arch. Biochem. Biophys., 259:52 (1987) and by Edge et al., Anal. Biochem., 118:131 

10 (1981). Enzymatic cleavage of carbohydrate moieties on polypeptides can be achieved by the 
use of a variety of endo-and exo-glycosidases as described by Thotakura et al., Meth. 
EnzymoL, 138:350 (1987). 

Another type of covalent modification of angiogenesis comprises linking the 
angiogenesis polypeptide to one of a variety of nonproteinaceous polymers, e.g., 

1 5 polyethylene glycol, polypropylene glycol, or polyoxyalkylenes, in the manner set forth in 
U.S. Patent Nos. 4,640,835; 4,496,689; 4,301,144; 4,670,417; 4,791,192 or 4,179,337. 

Angiogenesis polypeptides of the present invention may also be modified in a 
way to form chimeric molecules comprising an angiogenesis polypeptide fused to another, 
heterologous polypeptide or amino acid sequence. In one embodiment, such a chimeric 

20 molecule comprises a fusion of an angiogenesis polypeptide with a tag polypeptide which 
provides an epitope to which an anti-tag antibody can selectively bind. The epitope tag is 
generally placed at the amino-or carboxyl-terminus of the angiogenesis polypeptide. The 
presence of such epitope-tagged forms of an angiogenesis polypeptide can be detected using 
an antibody against the tag polypeptide. Also, provision of the epitope tag enables the 

25 angiogenesis polypeptide to be readily purified by affinity purification using an anti-tag 
antibody or another type of affinity matrix that binds to the epitope tag. In an alternative 
embodiment, the chimeric molecule may comprise a fusion of an angiogenesis polypeptide 
with an immunoglobulin or a particular region of an immunoglobulin. For a bivalent form of 
the chimeric molecule, such a fusion could be to the Fc region of an IgG molecule. 

30 Various tag polypeptides and their respective antibodies are well known in the 

art. Examples include poly-histidine (poly-his) or poly-histidine-glycine (poly-his-gly) tags; 
HIS6 and metal chelation tags, the flu HA tag polypeptide and its antibody 12CA5 [Field et 
al, Mol Cell Biol, 8:2159-2165 (1988)]; the c-myc tag and the 8F9, 3C7, 6E10, G4, B7 and 
9E10 antibodies thereto [Evan et al, Molecular and Cellular Biology, 5:3610-3616 (1985)]; 
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and the Herpes Simplex virus glycoprotein D (gD) tag and its antibody [Paborsky et al, 
Protein Engineering, 3(6):547-553 (1990)]. Other tag polypeptides include the Flag-peptide 
[Hopp et al, BioTechnology, 6:1204-1210 (1988)]; the KT3 epitope peptide [Martin et al, 
Science, 255:192-194 (1992)]; tubulin epitope peptide [Skinner et al, J. Biol Chem., 
5 266:15163-15166 (1991)]; and the T7 gene 10 protein peptide tag [Lutz-Freyennuth et al, 
Proc. Natl Acad. Set USA, 87:6393-6397 (1990)]. 

Also included with an embodiment of angiogenesis protein are other 
angiogenesis proteins of the angiogenesis family, and angiogenesis proteins from other 
organisms, which are cloned and expressed as outlined below. Thus, probe or degenerate 

10 polymerase chain reaction (PCR) primer sequences may be used to find other related 

angiogenesis proteins from humans or other organisms. As will be appreciated by those in 
the art, particularly useful probe and/or PCR primer sequences include the unique areas of the 
angiogenesis nucleic acid sequence. As is generally known in the art, preferred PCR primers 
are from about 15 to about 35 nucleotides in length, with from about 20 to about 30 being 

15 preferred, and may contain inosine as needed. The conditions for the PCR reaction are well 
known in the art (e.g., Innis, PCR Protocols, supra). 

In addition, as is outlined herein, angiogenesis proteins can be made that are 
longer than those encoded by the nucleic acids of the figures, e.g., by the elucidation of 
extended sequences, the addition of epitope or purification tags, the addition of other fusion 

20 sequences, etc. 

Angiogenesis proteins may also be identified as being encoded by 
angiogenesis nucleic acids. Thus, angiogenesis proteins are encoded by nucleic acids that 
will hybridize to the sequences of the sequence listings, or their complements, as outlined 
herein. 

25 In a preferred embodiment, when the angiogenesis protein is to be used to 

generate antibodies, e.g., for immunotherapy or immunodiagnosis, the angiogenesis protein 
should share at least one epitope or determinant with the ftdl length protein. By "epitope" or 
"determinant" herein is typically meant a portion of a protein which will generate and/or bind 
an antibody or T-cell receptor in the context of MHC. Thus, in most instances, antibodies 

30 made to a smaller angiogenesis protein will be able to bind to the full-length protein, 
particularly linear epitopes. In a preferred embodiment, the epitope is unique; that is, 
antibodies generated to a unique epitope show little or no cross-reactivity. In a preferred 
embodiment, the epitope is selected from a protein sequence set out in Table 8. 
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Methods of preparing polyclonal antibodies are known to the skilled artisan 
(e.g., Coligan, supra; and Harlow & Lane, supra). Polyclonal antibodies can be raised in a 
mammal, e.g., by one or more injections of an immunizing agent and, if desired, an adjuvant. 
Typically, the immunizing agent and/or adjuvant will be injected in the mammal by multiple 
5 subcutaneous or intraperitoneal injections. The immunizing agent may include a protein 
encoded by a nucleic acid of the figures or fragment thereof or a fusion protein thereof. It 
may be useful to conjugate the immunizing agent to a protein known to be immunogenic in 
the mammal being immunized. Examples of such immunogenic proteins include but are not 
limited to keyhole limpet hemocyanin, serum albumin, bovine thyroglobulin, and soybean 

10 trypsin inhibitor. Examples of adjuvants which may be employed include Freund's complete 
adjuvant and MPL-TDM adjuvant (monophosphoryl Lipid A, synthetic trehalose 
dicorynomycolate). The immunization protocol may be selected by one skilled in the art 
without undue experimentation. 

The antibodies may, alternatively, be monoclonal antibodies. Monoclonal 

1 5 antibodies may be prepared using hybridoma methods, such as those described by Kohler and 
Milstein, Nature, 256:495 (1975). In a hybridoma method, a mouse, hamster, or other 
appropriate host animal, is typically immunized with an immunizing agent to elicit 
lymphocytes that produce or are capable of producing antibodies that will specifically bind to 
the immunizing agent. Alternatively, the lymphocytes may be immunized in vitro. The 

20 immunizing agent will typically include a polypeptide encoded by a nucleic acid of Tables 1- 
8 , or fragment thereof, or a fusion protein thereof. Generally, either peripheral blood 
lymphocytes ("PBLs") are used if cells of human origin are desired, or spleen cells or lymph 
node cells are used if non-human mammalian sources are desired. The lymphocytes are then 
fused with an immortalized cell line using a suitable fusing agent, such as polyethylene 

25 glycol, to form a hybridoma cell [Goding, Monoclonal Antibodies: Principles and Practice, 
Academic Press, (1986) pp. 59-103]. Immortalized cell lines are usually transformed 
mammalian cells, particularly myeloma cells of rodent, bovine and human origin. Usually, 
rat or mouse myeloma cell lines are employed. The hybridoma cells may be cultured in a 
suitable culture medium that preferably contains one or more substances that inhibit the 

30 growth or survival of the unfused, immortalized cells. For example, if the parental cells lack 
the enzyme hypoxanthine guanine phosphoribosyl transferase (HGPRT or HPRT), the culture 
medium for the hybridomas typically will include hypoxanthine, aminopterin, and thymidine 
("HAT medium"), which substances prevent the growth of HGPRT-deficient cells. 
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In one embodiment, the antibodies are bispecific antibodies. Bispecific 
antibodies are monoclonal, preferably human or humanized, antibodies that have binding 
specificities for at least two different antigens or that have binding specificities for two 
epitopes on the same antigen. In one embodiment, one of the binding specificities is for a 
5 protein encoded by a nucleic acid Tables 1-8 or a fragment thereof, the other one is for any 
other antigen, and preferably for a cell-surface protein or receptor or receptor subunit, 
preferably one that is tumor specific. Alternatively, tetramer-type technology may create 
multivalent reagents. 

In a preferred embodiment, the antibodies to angiogenesis protein are capable 

10 of reducing or eliminating a biological function of an angiogenesis protein, as is described 
below. That is, the addition of anti-angiogenesis protein antibodies (either polyclonal or 
preferably monoclonal) to angiogenic tissue (or cells containing angiogenesis) may reduce or 
eliminate the angiogenesis activity. Generally, at least a 25% decrease in activity is 
preferred, with at least about 50% being particularly preferred and about a 95-100% decrease 

1 5 being especially preferred. 

In a preferred embodiment the antibodies to the angiogenesis proteins are 
humanized antibodies (e.g., Xenerex Biosciences, Mederex, Inc., Abgenix, Inc., Protein 
Design Labs,Inc.) Humanized forms of non-human (e.g., murine) antibodies are chimeric 
molecules of immunoglobulins, immunoglobulin chains or fragments thereof (such as Fv, 

20 Fab, Fab', F(ab')2 or other antigen-binding subsequences of antibodies) which contain 

minimal sequence derived from non-human immunoglobulin. Humanized antibodies include 
human immunoglobulins (recipient antibody) in which residues form a complementary 
determining region (CDR) of the recipient are replaced by residues from a CDR of a non- 
human species (donor antibody) such as mouse, rat or rabbit having the desired specificity, 

25 affinity and capacity. In some instances, Fv framework residues of the human 

immunoglobulin are replaced by corresponding non-human residues. Humanized antibodies 
. may also comprise residues which are found neither in the recipient antibody nor in the 
imported CDR or framework sequences. In general, a humanized antibody will comprise 
substantially all of at least one, and typically two, variable domains, in which all or 

30 substantially all of the CDR regions correspond to those of a non-human immunoglobulin 
and all or substantially all of the framework (FR) regions are those of a human 
immunoglobulin consensus sequence. The humanized antibody optimally also will comprise 
at least a portion of an immunoglobulin constant region (Fc), typically that of a human 
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immunoglobulin [Jones et al., Nature, 321:522-525 (1986); Riechmann et al., Nature, 
332:323-329 (1988); andPresta, Curr. Op. Struct. Biol., 2:593-596 (1992)]. 

Methods for humanizing non-human antibodies are well known in the art. 
Generally, a humanized antibody has one or more amino acid residues introduced into it from 
5 a source which is non-human. These non-human amino acid residues are often referred to as 
import residues, which are typically taken from an import variable domain. Humanization 
can be essentially performed following the method of Winter and co-workers [Jones et al., 
Nature, 321:522-525 (1986); Riechmann et al., Nature, 332:323-327 (1988); Verhoeyen et 
al, Science, 239:1534-1536 (1988)], by substituting rodent CDRs or CDR sequences for the 

10 corresponding sequences of a human antibody. Accordingly, such humanized antibodies are 
chimeric antibodies (U.S. Patent No. 4,816,567), wherein substantially less than an intact 
human variable domain has been substituted by the corresponding sequence from a non- 
human species. In practice, humanized antibodies are typically human antibodies in which 
some CDR residues and possibly some FR residues are substituted by residues from 

1 5 analogous sites in rodent antibodies. 

Human antibodies can also be produced using various techniques known in the 
art, including phage display libraries [Hoogenboom and Winter, J. Mol. Biol., 227:381 
(1991); Marks et al., J. Mol. Biol., 222:581 (1991)]. The techniques of Cole et al. and 
Boerner et al. are also available for the preparation of human monoclonal antibodies (Cole et 

20 al., Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, p. 77 (1985) and Boerner et 
al., J. Immunol, 147(l):86-95 (1991)]. Similarly, human antibodies can be made by 
introducing of human immunoglobulin loci into transgenic animals, e.g., mice in which the 
endogenous immunoglobulin genes have been partially or completely inactivated. Upon 
challenge, human antibody production is observed, which closely resembles that seen in 

25 humans in all respects, including gene rearrangement, assembly, and antibody repertoire. 
This approach is described, for example, in U.S. Patent Nos. 5,545,807; 5,545,806; 
5,569,825; 5,625,126; 5,633,425; 5,661,016, arid in the following scientific publications: 
• Marks et al., Bio/Technology 10, 779-783 (1992); Lonberg et al., Nature 368 856-859 (1994); 
Morrison, Nature 368, 812-13 (1994); Fishwild et al., Nature Biotechnology 14, 845-51 

30 (1996); Neuberger, Nature Biotechnology 14, 826 (1996); Lonberg and Huszar, Intern. Rev. 
Immunol. 13 65-93 (1995). 

By immunotherapy is meant treatment of angiogenesis with an antibody raised 
against angiogenesis proteins. As used herein, immunotherapy can be passive or active. 
Passive immunotherapy as defined herein is the passive transfer of antibody to a recipient 
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(patient). Active immunization is the induction of antibody and/or T-cell responses in a 
recipient (patient). Induction of an immune response is the result of providing the recipient 
with an antigen to which antibodies are raised. As appreciated by one of ordinary skill in the 
art, the antigen may be provided by injecting a polypeptide against which antibodies are 
5 desired to be raised into a recipient, or contacting the recipient with a nucleic acid capable of 
expressing the antigen and under conditions for expression of the antigen, leading to an 
immune response. 

In a preferred embodiment the angiogenesis proteins against which antibodies 
are raised are secreted proteins as described above. Without being bound by theory, 

10 antibodies used for treatment, bind and prevent the secreted protein from binding to its 
receptor, thereby inactivating the secreted angiogenesis protein. 

..; .„ In another preferred embodiment, the angiogenesis protein to which antibodies 
are raised is a transmembrane protein. Without being bound by theory, antibodies used for 
treatment, bind the extracellular domain of the angiogenesis protein and prevent it from 

15 binding to other proteins, such as circulating ligands or cell-associated molecules. The 

antibody may cause down-regulation of the transmembrane angiogenesis protein. As will be 
appreciated by one of ordinary skill in the art, the antibody may be a competitive, non- 
competitive or uncompetitive inhibitor of protein binding to the extracellular domain of the 
angiogenesis protein. The antibody is also an antagonist of the angiogenesis protein. 

20 Further, the antibody prevents activation of the transmembrane angiogenesis protein. In one 
aspect, when the antibody prevents the binding of other molecules to the angiogenesis 

( protein, the antibody prevents growth of the cell. The antibody may also be used to target or 
sensitize the cell to cytotoxic agents, including, but not limited to TNF-a, TNF-p, IL-1, USTF-y 
. and IL-2, or chemotherapeutic agents including 5FU, vinblastine, actinomycin D, cisplatin, 

25 methotrexate, and the like. In some instances the antibody belongs to a sub-type that 
activates serum complement when complexed with the transmembrane protein thereby 
mediating cytotoxicity or antigen-dependent cytotoxicity (ADCC). Thus, angiogenesis is 
treated by administering to a patient antibodies directed against the transmembrane 
angiogenesis protein. Antibody-labeling may activate a co-toxin, localize a toxin payload, or 

30 otherwise provide means to locally ablate cells. 

In another preferred embodiment, the antibody is conjugated or fused to an 
effector moiety. The effector moiety can be any number of molecules, including labelling 
moieties such as radioactive labels or fluorescent labels, or can be a therapeutic moiety. In 
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one aspect the therapeutic moiety is a small molecule that modulates the activity of the 
angiogenesis protein. In another aspect the therapeutic moiety modulates the activity of 
molecules associated with or in close proximity to the angiogenesis protein. The therapeutic 
moiety may inhibit enzymatic activity such as protease or collagenase activity associated with 
5. . angiogenesis, or bean attractant of other cells, such as NK cells. 

In a preferred embodiment, the therapeutic moiety can also be a cytotoxic 
agent. In this method, targeting the cytotoxic agent to angiogenesis tissue or cells, results in a 
reduction in the number of afflicted cells, thereby reducing symptoms associated with 
angiogenesis. Cytotoxic agents are numerous and varied and include, but are not limited to, 

10 cytotoxic drugs or toxins or active fragments of such toxins. Suitable toxins and their 

corresponding fragments include diphtheria A chain, exotoxin A chain, ricin A chain, abrin A 
chain, curcin, crotiri, phenomycin, enomycin and the like. Cytotoxic agents also include 
radiochemicals made by conjugating radioisotopes to antibodies raised against angiogenesis 
proteins, or binding of a radionuclide to a chelating agent that has been covalently attached to 

15 the antibody. Targeting the thierapeutic moiety to transmembrane angiogenesis proteins not 
only serves to increase the local concentration of therapeutic moiety in the angiogenesis 
afflicted area, but also serves to reduce deleterious side effects that may be associated with 
the therapeutic moiety. 

In another preferred embodiment, the angiogenesis protein against which the 

20 antibodies are raised is an intracellular protein. In this case, the antibody may be conjugated 
or fused to a protein which facilitates entry into the cell. In one case, the antibody enters the , 
.. cell by endocytosis. In another embodiment, a nucleic acid encoding the antibody is 
administered to the individual or cell. Moreover, wherein the angiogenesis protein can be 
targeted within a cell, i.e., the nucleus, an antibody thereto contains a signal for that target 

25 localization, i.e., a nuclear localization signal. 

The angiogenesis antibodies of the invention specifically bind to angiogenesis 
proteins. By "specifically bind" herein is meant that the antibodies bind to the protein with a 
K<j of at least about 0. 1 mM, more usually at least about 1 pM, preferably at least about 0. 1 
jiM or better, and most preferably, 0.01 or better. Selectivity of binding is also 

30 important. 

In a preferred embodiment, the angiogenesis protein is purified or isolated 
after expression. Angiogenesis proteins may be isolated or purified in a variety of ways 
known to those skilled in the art depending on what other components are present in the 
sample. Standard purification methods include electrophoretic, molecular, immunological 
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and chromatographic techniques, including ion exchange, hydrophobic, affinity, and reverse- 
phase HPLC chromatography, and chromatofocusing. For example, the angiogenesis protein 
may be purified using a standard anti-angiogenesis protein antibody column. Ultrafiltration 
and diafiltration techniques, in conjunction with protein concentration, are also useful. For 
5 general guidance in suitable purification techniques, see Scopes, R., Protein Purification, 
Springer-Verlag, NY (1982). The degree of purification necessary will vary depending on 
the use of the angiogenesis protein. In some instances no purification will be necessary. 

Once expressed and purified if necessary, the angiogenesis proteins and 
nucleic acids are useful in a number of applications. They may be used as immunoselection 
10 reagents, as vaccine reagents, as screening agents, etc. 

Detection of angiogenesis sequence for diagnostic and therapeutic applications 

In one aspect, the RNAexpression levels of genes are determined for different 
cellular states in the angiogenesis phenotype. Expression levels of genes in normal tissue 

15 ;(Le. 9 not undergoing angiogenesis) and in angiogenesis tissue (and in some cases, for varying 
severities of angiogenesis that relate to prognosis, as outlined below) are evaluated to provide 
expression profiles. An expression profile of a particular cell state or point of development is 
essentially a "fingerprint" of the state. While two states may have any particular gene 
similarly expressed, the evaluation of a number of genes simultaneously allows the 

20 generation of a gene expression profile that is reflective of the state of the cell. By comparing 
expression profiles of cells in different states, information regarding which genes are 
: ' important (including both up- and down-regulation of genes) in each of these states is 
obtained. Then, diagnosis may be performed or confirmed to determine whether a tissue 
sample has the gene expression profile of normal or angiogenesic tissue. This will provide 

25 for molecular diagnosis of related conditions. 

"Differential expression," or grammatical equivalents as used herein, refers to 
qualitative or quantitative differences in the temporal and/or cellular gene expression 
patterns within and among cells and tissue. Thus, a differentially expressed gene can 
qualitatively have its expression altered, including im activation or inactivation, in, e.g., 

30 normal versus angiogenic tissue. Genes may be turned on or turned off in a particular state, 
relative to another state thus permitting comparison of two or more statese. A qualitatively 
regulated gene will exhibit an expression pattern within a state or cell type which is 
detectable by standard techniques. Some genes will be expressed in one state or cell type, but 
not in both. Alternatively, the difference in expression may be quantitative, e.g., in that 
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expression is increased or decreased; i.e. y gene expression is either upregulated, resulting in 
an increased amount of transcript, or downregulated, resulting in a decreased amount of 
transcript. The degree to which expression differs need only be large enough to quantify via 
standard characterization techniques as outlined below, such as by use of Affymetrix 
5 .GeneChip™ expression arrays, Lockhart, Nature Biotechnology, 14:1675-1680 (1996), 

hereby expressly incorporated by reference. Other techniques include, but are not limited to, 
quantitative reverse transcriptase PCR, Northern analysis and RNase protection. As outlined 
above, preferably the change in expression (i.e., upregulation or downregulation) is at least 
about 50%, more preferably at least about 100%, more preferably at least about 150%, more 

10 preferably at least about 200%, with from 300 to at least 1000% being especially preferred. 

Evaluation may be at the gene transcript, or the protein level. The amount of 
gene expression may be monitored using nucleic acid probes to the DNA or RNA equivalent 
of the gene transcript, and the quantification of gene expression levels, or, alternatively, the 
final gene product itself (protein) can be monitored, e.g., with antibodies to the angiogenesis 

15 protein and standard immunoassays (ELISAs, etc.) or other techniques, including mass 
spectroscopy assays, 2D gel electrophoresis assays, etc. Proteins corresponding to 
angiogenesis genes, i.e., those identified as being important in an angiogenesis phenotype, 
can be evaluated in an angiogenesis diagnostic test. 

In a preferred embodiment, gene expression monitoring is performed 

20 simultaneously on a number of genes. Multiple protein expression monitoring can be 

performed as well. Similarly, these assays may be performed on an individual basis as well. 

In this embodiment, the angiogenesis nucleic acid probes are attached to 
biochips as outlined herein for the detection and quantification of angiogenesis sequences in a 
•. particular cell. The assays are further described below in the example. PCR techniques can 

25 be used to provide greater sensitivity. 

; . r ; v \ In a preferred embodiment nucleic acids encoding the angiogenesis protein are 
detected. Although DNA or RNA encoding the angiogenesis protein may be detected, of 
particular interest are methods wherein an mRNA encoding an angiogenesis protein is 
detected. Probes to detect mRNA can be a nucleotide/deoxynucleotide probe that is 

30. complementary to and hybridizes with the mRNA and includes, but is not limited to, 

oligonucleotides, cDNA or RNA. Probes also should contain a detectable label, as defined 
herein. In one method the mRNA is detected after immobilizing the nucleic acid to be 
examined on a solid support such as nylon membranes and hybridizing the probe with the 
sample. Following washing to remove the non-specifically bound probe, the label is 
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detected In another method detection of the mRNA is performed in situ, hi this method 
permeabilized cells or tissue samples are contacted with a detectably labeled nucleic acid 
probe for sufficient time to allow the probe to hybridize with the target mRNA. Following 
washing to remove the non-specifically bound probe, the label is detected. For example a 
5 digoxygenin labeled riboprobe (RNA probe) that is complementary to the mRNA encoding 
an angiogenesis protein is detected by binding the digoxygenin with an anti-digoxygenin 
secondary antibody and developed with nitro blue tetrazolium and 5-bromo-4-chloro-3- 
indoyl phosphate. 

In a preferred embodiment, various proteins from the three classes of proteins 
10 as described herein (secreted, transmembrane or intracellular proteins) are used in diagnostic 
assays. The angiogenesis proteins, antibodies, nucleic acids, modified proteins and cells 
containing angiogenesis sequences are used in diagnostic assays. This can be performed on 
an individual gene or corresponding polypeptide level. In a preferred embodiment, the 
expression profiles are used, preferably in conjunction with high throughput screening 
15 techniques to allow monitoring for expression profile genes and/or corresponding 
polypeptides. 

As described and defined herein angiogenesis proteins, including 
intracellular, transmembrane or secreted proteins, find use as markers of angiogenesis. 
Detection of these proteins in putative angiogenesis tissue allows for detection or diagnosis of 

20 angiogenesis. In one embodiment, antibodies are used to detect angiogenesis proteins. A 
preferred method separates proteins from a sample by electrophoresis on a gel (typically a 
denaturing and reducing protein gel, but may be another type of gel, including isoelectric 
focusing gels and the like). Following separation of proteins, the angiogenesis protein is 
detected, e.g. , by immunob lotting with antibodies raised against the angiogenesis protein. 

25 Methods of immunoblotting are well known to those of ordinary skill in the art. 

In another preferred method, antibodies to the angiogenesis protein find use in 
in situ imaging techniques, e.g., in histology (e.g., Methods in Cell Biology: Antibodies in 
Ce//5/o/ogy, volume 37 (Asai, ed. 1993)). In this method cells are contacted with from one 
to many antibodies to the angiogenesis protein(s). Following washing to remove non-specific 

30 antibody binding, the presence of the antibody or antibodies is detected. In one embodiment 
the antibody is detected by incubating with a secondary antibody that contains a detectable 
label. In another method the primary antibody to the angiogenesis protein(s) contains a 
detectable label, for example an enzyme marker that can act on a substrate. In another 
preferred embodiment each one of multiple primary antibodies contains a distinct and 
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detectable label. This method finds particular use in simultaneous screening for a plurality of 
angiogenesis proteins. As will be appreciated by one of ordinary skill in the art, many other 
histological imaging techniques are alsoprovided by the invention. 

In a preferred embodiment the label is detected in a fluorometer which has the 
5 ability to detect and distinguish emissions of different wavelengths. In addition, a 
fluorescence activated cell sorter (FACS) can be used in the method. 

In another preferred embodiment, antibodies find use in diagnosing 
angiogenesis from biological samples, such as blood, urine, sputum, or other bodily fluids. 
As previously described, certain angiogenesis proteins are secreted/circulating molecules. 

1 0 Blood samples, therefore, are useful as samples to be probed or tested for the presence of 
secreted angiogenesis proteins. Antibodies can be used to detect an angiogenesis protein by 
previously described immunoassay techniques including ELISA, immunoblotting (W estern 
blotting), immunoprecipitation, BIACORE technology and the like. Conversely, the presence 
of antibodies may indicate an immune response against an endogenous angiogenesis protein. 

1 5 In a preferred embodiment, in situ hybridization of labeled angiogenesis 

nucleic acid probes to tissue arrays is done. For example, arrays of tissue samples, including 
angiogenesis tissue and/or normal tissue, are made. In situ hybridization (see, e.g., Ausubel, 
supra) is then performed. When comparing the fingerprints between an individual and a 
« standard, the skilled artisan can make a diagnosis, a prognosis, or a prediction based on the 

20 findings. It is further understood that the genes which indicate the diagnosis. may differ from 
those which indicate the prognosis and molecular profiling of the condition of the cells may 
lead to distinctions between responsive or refractory conditions or may be predictive of 

. outcomes. , . •• ; 

In a preferred embodiment, the angiogenesis proteins, antibodies, nucleic 

25 acids, modified proteins and cells containing angiogenesis sequences are used in prognosis 
assays. As above, gene expression profiles can be generated that correlate to angiogenesis 
severity, in terms of long term prognosis. Again, this may be done on either a protein or gene 
level, with the use of genes being preferred. As above, angiogenesis probes may be attached 
to biochips for the detection and quantification of angiogenesis sequences in a tissue or 

30 patient. The assays proceed as outlined above for diagnosis. PCR method may provide more 
sensitive and accurate quantification. 

In a preferred embodiment members of the three classes of proteins as 
described herein are used in drug screening assays. The angiogenesis proteins, antibodies, 
nucleic acids, modified proteins and cells containing angiogenesis sequences are used in drug 
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screening assays or by evaluating the effect of drug candidates on a "gene expression profile" 
or expression profile of polypeptides. In a preferred embodiment, the expression profiles are 
used, preferably in conjunction with high throughput screening techniques to allow 
monitoring for expression profile genes after treatment with a candidate agent (e.g., 
5 ZIokamik, et al., Science 279, 84-8 (1 998); Heid, Genome Res 6:986-94, 1996). 
. .. In a preferred embodiment, the angiogenesis proteins, antibodies, nucleic 

acids, modified proteins and cells containing the native or modified angiogenesis proteins are 
used in screening assays. That is, the present invention provides novel methods for screening 
for compositions which modulate the angiogenesis phenotype or an identified physiological 

10 function of an angiogenesis protein. As above, this can be done on an individual gene level 
or by evaluating the effect of drug candidates on a "gene expression profile". In a preferred 
embodiment, the expression profiles are used, preferably in conjunction with high throughput 
screening techniques to allow monitoring for expression profile genes after treatment with a 
candidate agent, see Zlokarnik, supra. 

15 . Having identified the differentially expressed genes herein, a variety of assays 

may be executed. In a preferred embodiment, assays may be run on an individual gene or 
protein level. That is, having identified a particular gene as up regulated in angiogenesis, test 
compounds can be screened for the ability to modulate gene expression or for binding to the 
angiogenic protein. 'Modulation" thus includes both an increase and a decrease in gene 

20 expression. The preferred amount of modulation will depend on the original change of the 
gene expression in normal versus tissue undergoing angiogenesis, with changes of at least 
10%, preferably 50%, more preferably 100-300%, and in some embodiments 300-1000% or 
greater. Thus, if a gene exhibits a 4-fold increase in angiogenic tissue compared to normal 
tissue, a decrease of about four-fold is often desired; similarly, a 1 0-fold decrease in 

25 angiogenic tissue compared to normal tissue often provides a target value of a 10-fold 
increase in expression to be induced by the test compound. 

The amount of gene expression may be monitored using nucleic acid probes 
and the quantification of gene expression levels, or, alternatively, the gene product itself can 
be monitored, e.g., through the use of antibodies to the angiogenesis protein and standard 

30 immunoassays. Proteomics and separation techniques may also allow quantification of 
expression. 

In a preferred embodiment, gene expression or protein monitoring of a number 
of entitites, Le. 9 an expression profile, is monitored simultaneously. Such profiles will 
typically invove a plurality of those entitites described herein. . 
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In this embodiment, the angiogenesis nucleic acid probes are attached to 
biochips as outlined herein for the detection and quantification of angiogenesis sequences in a 
particular cell. Alternatively, PCR may be used. Thus, a series, e.g., of microtiter plate, may 
be used with dispensed primers in desired wells. A PCR reaction can then be performed and 
5 analyzed for each well. 

Modulators of angiogenesis 

Expression monitoring can be performed to identify compounds that modify 
the expression of one or more angiogenesis-associated sequences, e.g., a polynucleotide 

10 sequence set out in Tables 1 -8 . Generally, in a preferred embodiment, a test modulator is 
added to the cells prior to analysis. Moreover, screens are also provided to identify agents 
that modulate angiogenesis, modulate angiogenesis proteins, bind to an angiogenesis protein, 
or interfere with the binding of an angiogenesis protein and an antibody or other binding 
partner. .. 

15 The term "test compound" or "drug candidate" or "modulator" or grammatical 

equivalents as used herein describes any molecule, e.g., protein, oligopeptide, small organic 
molecule, polysaccharide, polynucleotide, etc., to be tested for the capacity to directly or 
indirectly alter the angiogenesis phenotype or the expression of an angiogenesis sequence, 
e.g., a nucleic acid or protein sequence. In preferred embodiments, modulators alter 

20 expression profiles, or expression profile nucleic acids or proteins provided herein. In one 
embodiment, the modulator suppresses an angiogenesis phenotype, for example to a normal 
tissue fingerprint. In another embodiment, a modulator induced ah angiogenesis phenotype. 
Generally, a plurality of assay mixtures are run in parallel with different agent concentrations 
to obtain a differential response to the various concentrations. Typically, one of these 

25 concentrations serves as a negative control, i.e., at zero concentration or below the level of 
. detection, . - 

In one aspect, a modulator will neutralize the effect of an angiogenesis protein. 
By "neutralize" is meant that activity of a protein is inhibited or blocked and thereby has 
substantially no effect on a cell. 

30 In certain embodiments, combinatorial libraries of potential modulators will be 

screened for an ability to bind to an angiogenesis polypeptide or to modulate activity. 
Conventionally, new chemical entities with useful properties are generated by identifying a 
chemical compound (called a "lead compound") with some desirable property or activity, 
e.g., inhibiting activity, creating variants of the lead compound, and evaluating the property 
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and activity of those variant compounds: . Often, high throughput screening (HTS) methods 
are employed for such an analysis. 

In one preferred embodiment, high throughput screening methods involve . 
providing a library containing a large number of potential therapeutic compounds (candidate 
5 compounds). Such "combinatorial chemical libraries" are then screened in one or more 
assays to identify those library members (particular chemical species or subclasses) that 
display a desired characteristic activity. The compounds thus identified can serve as 
conventional "lead compounds" or can themselves be used as potential or actual therapeutics. 
A combinatorial chemical library is a collection of diverse chemical 

10 compounds generated by either chemical synthesis or biological synthesis by combining a 
number of chemical "building blocks" such as reagents. For example, a linear combinatorial 
chemical library, such as a polypeptide {e.g., mutein) library, is formed by combining a set of 
chemical building blocks called amino acids in every possible way for a given compound 
length {i.e., the number of amino acids in a polypeptide compound). Millions of chemical 

1 5 compounds can be synthesized through such combinatorial mixing of chemical building 
blocks (Gallop et al (1994) J. Med. Chem. 37(9): 1233-1251). 

Preparation and screening of combinatorial chemical libraries is well known to 
those of skill in the art. Such combinatorial chemical libraries include, but are not limited to, 
peptide libraries {see, e.g., U.S. Patent No. 5,010,175, Furka (1991) Int. J. Pept. Prot. Res. 9 

20 37: 487-493, Houghton et al. {\99\).Nature, 354: 84-88), peptoids (PCT Publication No WO 
. 91/19735, 26 Dec. 1991), endoded peptides (PCT Publication WO 93/20242, 14 Oct. 1993), 
random bio-oligomers (PCT Publication WO 92/00091, 9 Jan. 1992), benzodiazepines (U.S. 
Pat. No. 5,288,5 1 4), diversomers such as hydantoins, benzodiazepines and dipeptides (Hobbs 
et al, (1993) Prod. Nat. Acad. Sci. USA 90: 6909-6913), vinylogous polypeptides (Hagihara 

25 et al. (1992) J: Amer. Chem. Soc. 1 14: 6568), nonpeptidal peptidomimetics with a Beta-D- 
Giucose scaffolding (Hirschmann et dl 9 (1992) J. Amer. Chem. Soc. 114: 9217-9218), 
analogous organic syntheses of small compound libraries (Chen et al. (1994) J, Amer. Chem. 
Soc. 116: 2661), oligocarbamates (Cho, et al., (1993) Science 261:1303), and/or peptidyl 
phosphonates (Campbell et al, (1994) J. Org. Chem. 59: 658). See, generally, Gordon et al, 

30 (1994) J. Med. Chem. 37:1385, nucleic acid libraries {see, e.g., Strategene, Corp.), peptide 
nucleic acid libraries {see, e.g., U.S. Patent 5,539,083), antibody libraries {see, e.g., Vaughn 
et al (1996) Nature Biotechnology, 14(3): 309-314), and PCT/US96/1 0287), carbohydrate 
libraries {see, e.g., Liang et al, (1996) Science, 274: 1520-1522, and U.S. Patent No. 
5,593,853), and small organic molecule libraries {see, e.g., benzodiazepines, Baum (1993) 
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C&EN, Jan 18, page 33; isoprenoids, U.S. Patent No. 5,569,588; thiazolidinones and 
metathiazanones, U.S. Patent No. 5,549,974; pyrrolidines, U.S. Patent Nos. 5,525,735 and 
5,519,134; morpholino compounds, U.S. Patent No. 5,506,337; benzodiazepines, U.S. Patent 
No. 5,288,514; and the like). 
5 Devices for the preparation of combinatorial libraries are commercially 

available (see, e.g., 357 MPS, 390 MPS, Advanced Chem Tech, Louisville KY, Symphony, 
Rainin, Woburn, MA, 433A Applied Biosystems, Foster City, CA, 9050 Plus, Millipore, 
Bedford, MA). 

A number of well known robotic systems have also been developed for 
1 0 solution phase chemistries. These systems include automated workstations like the 

automated synthesis apparatus developed by Takeda Chemical Industries, LTD. (Osaka, 
Japan) and many robotic systems utilizing robotic arms (Zymate II, Zymark Corporation, 
Hopkintdn, Mass.; Orca, Hewlett-Packard, Palo Alto, Calif), which mimic the manual 
synthetic operations performed by a chemist. Any of the above devices are suitable for use 
1 5 with the present invention. The nature and implementation of modifications to these devices 
(if any) so. that they can operate as discussed herein will be apparent to persons skilled in the 
relevant art. In addition, numerous combinatorial libraries are themselves commercially 
available (see, e.g., ComGenex, Princeton, N. J., Asinex, Moscow, Ru, Tripos, Inc., St. Louis, 
MO, ChemStar, Ltd, Moscow, RU, 3D Pharmaceuticals, Exton, PA, Martek Biosciences, 
20 Columbia, MD, etc.). 

The assays to identify modulators are amenable to high throughput screening. 
Preferred assays thus detect enhancement or inhibition of angiogenesis gene transcription, 
inhibition or enhancement of polypeptide expression, and inhibition or enhancement of 
polypeptide activity. 

25 : High throughput assays for the presence, absence, quantification, or other 

properties of particular nucleic acids or protein products are well known to those of skill in 
the art. Similarly, binding assays and reporter gene assays are similarly well known. Thus, 
for example, U.S. Patent No. 5,559,410 discloses high throughput screening methods for 
. proteins, U.S. Patent No. 5,585,639 discloses high throughput screening methods for nucleic 

30 acid binding (i.e., in arrays), while U.S. Patent Nos. 5,576,220 and 5,541,061 disclose high 
throughput methods of screening for ligand/antibody binding. 

In addition, high throughput screening systems are commercially available 
(see, e.g., Zymark Corp., Hopkinton, MA; Air Technical Industries, Mentor, OH; Beckman 
Instruments, Inc. Fullerton, CA; Precision Systems, Inc., Natick, MA, etc.). These systems 
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typically automate entire procedures, including all sample and reagent pipetting, liquid 
dispensing, timed incubations, and final readings of the microplate in detectors) appropriate 
for the assay. These configurable systems provide high throughput and rapid start up as well 
as a high degree of flexibility and customization. The manufacturers of such systems provide 
5 detailed protocols for various high throughput systems. Thus, for example, Zymark Corp. 
provides technical bulletins describing screening systems for detecting the modulation of 
gene transcription, ligand binding, and the like. 

In one embodiment, modulators are proteins, often naturally occurring 
proteins or fragments of naturally occurring proteins. Thus, e.g., cellular extracts containing 

10 proteins, or random or directed digests of proteinaceous cellular extracts, may be used. In 
this way libraries of proteins may be made for screening in the methods of the invention. 
Particularly preferred in this embodiment are libraries of bacterial, fungal, viral, and 
mammalian proteins, with the latter being preferred, and human proteins being especially 
preferred. Paticularly useful test compound will be directed to the class of proteins to which 

15 the target belongs, substrates for enzymes or ligands and receptors. 

In a preferred embodiment, modulators are peptides of from about 5 to about 
30 amino acids, with from about 5 to about 20 amino acids being preferred, and from about 7 
to about 15 being particularly preferred. The peptides may be digests of naturally occurring 
proteins as is outlined above, random peptides, or ''biased" random peptides. By 

20 "randomized" or grammatical equivalents herein is meant that each nucleic acid and peptide 
consists of essentially random nucleotides and amino acids, respectively. Since generally 
these random peptides (or nucleic acids, discussed below) are chemically synthesized, they 
may incorporate any nucleotide or amino acid at any position. The synthetic process can be 
designed to generate randomized proteins or nucleic acids, to allow the formation of all or 

25 most of the possible combinations over the length of the sequence, thus forming a library of 
randomized candidate bioactive proteinaceous agents. 

In one embodiment, the library is fully randomized, with no sequence 
preferences or constants at any position. In a preferred embodiment, the library is biased. 
That is, some positions within the sequence are either held constant, or are selected from a 

30 limited number of possibilities. For example, in a preferred embodiment, the nucleotides or 
amino acid residues are .randomized within a defined class, for example, of hydrophobic 
amino acids, hydrophilic residues, sterically biased (either small or large) residues, towards 
the creation of nucleic acid binding domains, the creation of cysteines, for cross-linking, 
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prolines for SH-3 domains, serines, threonines, tyrosines or histidines for phosphorylation 
. sites, etc., or to purines, etc. 

Modulators of angiogenesis can also be nucleic acids, as defined above. 

As described above generally for proteins, nucleic acid modulating agents may 
5 be naturally occurring nucleic acids, random nucleic acids, or "biased" random nucleic acids. 
For example, digests of procaryotic or eucaiyotic genomes may be used as is outlined above 
for proteins. 

In a preferred embodiment, the candidate compounds are organic chemical 
moieties, a wide variety of which are available in the literature. 

10 After the candidate agent has been added and the cells allowed to incubate for 

some period of time, the sample containing a target sequence to be analyzed is added to the 
biochip. If required, the target sequence is prepared using known techniques. For example, 
the sample may be treated to lyse the cells, using known lysis buffers, electroporation, etc., 
with purification and/or amplification such as PCR performed as appropriate. For example, 

1 5 an in vitro transcription with labels covalently attached to the nucleotides is performed. 
Generally, the nucleic acids are labeled with biotin-FITC or PE, or with cy3 or cy5. 

In a preferred embodiment, the target sequence is labeled with, for example, a 
fluorescent, a chemiluminescent, a chemical, or a radioactive signal, to provide a means of 
detecting the target sequence's specific binding to a probe. The label also can be an enzyme, 

20 such as, alkaline phosphatase or horseradish peroxidase, which when provided with an 

appropriate substrate produces a product that can be detected. Alternatively, the label can be 
a labeled compound or small molecule, such as an, enzyme inhibitor, that binds but is not 
catalyzed or altered by the enzyme. The label also can be a moiety or compound, such as, an 
epitope tag or biotin which specifically binds to streptavidin. For the example of biotin, the 

25 streptavidin is labeled as described above, thereby, providing a detectable signal for the 

bound target sequence. Unbound labeled streptavidin is typically removed prior to analysis. 

As will be appreciated by those in the art, these assays can be direct 
hybridization assays or can comprise "sandwich assays", which include the use of multiple 
probes, as is generally outlined in U.S. Patent Nos. 5,681,702, 5,597,909, 5,545,730, 

30 5,594,117, 5,591,584, 5,571,670, 5,580,731, 5,571,670, 5,591,584, 5,624,802, 5,635,352, 
5,594,118, 5,359,100, 5,124,246 and 5,681,697, all of which are hereby incorporated by 
reference. In this embodiment, in general, the target nucleic acid is prepared as outlined 
above, and then added to the biochip comprising a plurality of nucleic acid probes, under 
conditions that allow the formation of a hybridization complex. 
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A variety of hybridization conditions may be used in the present invention, 
including high, moderate and low stringency conditions as outlined above. The assays are 
generally run under stringency conditions which allows formation of the label probe 
hybridization complex only in the presence of target. Stringency can be controlled by 
5 altering a step parameter that is a thermodynamic variable, including, but not limited to, 
temperature, formamide concentration, salt concentration, chaotropic salt concentration pH, 
organic solvent concentration, etc. 

These parameters may also be used to control non-specific binding, as is 
generally outlined in U.S. Patent No. 5,681,697. Thus it may be desirable to perform certain 
10 steps at higher stringency conditions to reduce non-specific binding. 

The reactions outlined herein may be accomplished in a variety of ways. 
Components of the reaction may be added simultaneously, or sequentially, in different orders, 
with preferred embodiments outlined below. In addition, the reaction may include a variety 
of other reagents. These include salts, buffers, neutral proteins, e.g. albumin, detergents, etc. 
15 which may be used to facilitate optimal hybridization and detection, and/or reduce non- 
specific or background interactions. Reagents that otherwise improve the efficiency of the 
assay, such as protease inhibitors, nuclease inhibitors, anti-microbial agents, etc., may also be 
used as appropriate, depending on the sample preparation methods and purity of the target. 

The assay data are analyzed to determine the expression levels, and changes in 
20 expression levels as between states, of individual genes, forming a gene expression profile. 

Screens are performed to identify modulators of the angiogenesis phenotype. 
In onej embodiment, screening is performed to identify modulators that can induce or 
suppress a particular expression profile, thus preferably generating the associated phenotype. 
In another embodiment, e.g., for diagnostic applications, having identified differentially 
25 expressed genes important in a particular state, screens can be performed to identify 

modulators that alter expression of individual genes. In an another embodiment, screening is 
performed to identify modulators that alter a biological function of the expression product of 
a differentially expressed gene. Again, having identified the importance of a gene in a 
particular state, screens are performed to identify agents that bind and/or modulate the 
30 biological activity of the gene product. 

In addition screens can be done for genes that are induced in response to a 
candidate agent. After identifying a modulator based upon its ability to suppress an 
angiogenesis expression pattern leading to a normal expression pattern, or to modulate a 
single angiogenesis gene expression profile so as to mimic the expression of the gene from 
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normal tissue, a screen as described above can be performed to identify genes that are 
• specifically modulated in response to the agent. Comparing expression profiles between 
normal tissue and agent treated angiogenesis tissue reveals genes that are not expressed in 
normal tissue or angiogenesis tissue, but are expressed in agent treated tissue. These agent- 
5 specific sequences can be identified and used by methods described herein for angiogenesis 
genes or proteins. In particular these sequences and the proteins they encode find use in 
marking or identifying agent treated cells. In addition, antibodies can be raised against the 
agent induced proteins and used to target novel therapeutics to the treated angiogenesis tissue 
sample. 

10 Thus, in one embodiment, a test compound is administered to a population of 

angiogenic cells, that have an associated angiogenesis expression profile. By 
"administration" or "contacting" herein is meant that the candidate agent is added to the cells 
in such a maimer as to allow the agent to act upon the cell, whether by uptake and 
intracellular action, or by action at the cell surface. In some embodiments, nucleic acid 

1 5 encoding a proteinaceous candidate agent (i. e. , a peptide) may be put into a viral construct 
such as an adenoviral or retroviral construct, and added to the cell, such that expression of 
the peptide agent is accomplished, e.g., PCTUS97/01019. Regulatable gene therapy systems 
can also be used. 

Once the test compound has been administered to the cells, the cells can be 
20 washed if desired and are allowed to incubate under preferably physiological conditions for 
some period of time. The cells are then harvested and a new gene expression profile is 
generated, as outlined herein. 

Thus, for example, angiogenesis tissue may be screened for agents that 
modulate, e.g., induce or suppress the angiogenesis phenotype. A change in at least one 
25 gene, preferably many, of the expression profile indicates that the agent has an effect on 
angiogenesis activity. By defining such a signature for the angiogenesis phenotype, screens 
for new drugs that alter the phenotype can be devised. With this approach, the drug target 
need not be known and need not be represented in the original expression screening platform, 
nor does the level of transcript for the target protein need to change. 
30 Measure of angiogenesis polypeptide activity, or of angiogenesis or the 

angiogenic phenotype can be performed using a variety of assays. For example, the effects of 
the test compounds upon the function of the anagiogenesis polypeptides can be measured by 
examining parameters described above. A suitable physiological change that affects activity 
can be used to assess the influence of a test compound on the polypeptides of this invention. 
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When the functional consequences are determined using intact cells or animals, one can also 
measure a variety of effects such as, in the case of angiogenesis associated with tumors, 
tumor growth, neovascularization, hormone release, transcriptional changes to both known 
and uncharacterized genetic markers (e.g., northern blots), changes in cell metabolism such as 
5 cell growth or pH changes, and changes in intracellular second messengers such as cGMP. In 
the assays of the invention, mammalian angiogenesis polypeptide is typically used, e.g., 
mouse, preferably human. 

A variety of angiogenesis assays are known to those of skill in the art. Various 
models have been employed to evaluate angiogenesis (e.g. , Croix et ah, Science 289: 1 1 97- 

10 1202, 2000 and Kahn et al 9 Amer. J. Pathol 156:1887-1900). Assessement of angiogenesis 
in the presence of a potential modulator of angiogenesis can be performed using cell-cultre- 
based angiogenesis assays, e.g., endothelial cell tube formation assays, as well as other 
bioassays such as the chick CAM assay, the mouse corneal assay, and assays measuring the 
effect of administering potential modulators on implanted tumors. The chick CAM assay is 

15 described by O'Reilly, et al. Cell 79: 315-328, 1994. Briefly, 3 day old chicken embryos with 
intact yolks are separated from the egg and placed in a petri dish. After 3 days of incubation, 
a methylcellulose disc containing the protein to be tested is applied to the CAM of individual 
embryos. After about 48 hours of incubation, the embryos and CAMs are observed to 
determine whether endothelial growth has been inhibited. The mouse corneal assay involves 

20 implanting a growth factor-containing pellet, along with another pellet containing the 

suspected endothelial growth inhibitor, in the cornea of a mouse and observing the pattern of 
capillaries that are elaborated in the cornea. Angiogenesis can also be measured by 
determining the extent of neovascularization of a tumor. For example, carcinoma cells can be 
subcutaneously inoculated into athymic nude mice and tumor growth then monitored. The 

25 . cancer cells are treated with an angiogenesis inhibitor, such as an antibody, or other 

compound that is exogeriously administered, or can be transfected prior to inoculation with a 
polynucleotide inhibitor of angiogenesis. Immunoassays using endothelial cell-specific 
antibodies are typically used to stain for vascularization of tumor and the number of vessels 
in the tumor. 

30 Assays to identify compounds with modulating activity can be performed in 

vitro. For example, an angiogenesis polypeptide is first contacted with a potential modulator 
and incubated for a suitable amount of time, e.g., from 0.5 to 48 hours. In one embodiment, 
the angiogenesis polypeptide levels are determined in vitro by measuring the level of protein 
or niRNA. The level of protein is measured using immunoassays such as western blotting, 
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ELISA and the like with an antibody that selectively binds to the angiogenesis polypeptide or 
a fragment thereof. For measurement of mRNA, amplification, e.g., using PCR, LCR, or 
hybridization assays, e.g., northern hybridization, RNAse protection, dot blotting, are 
preferred. The level of protein or mRNA is detected using directly or indirectly labeled 
5 detection agents, e.g., fluorescently or radioactively labeled nucleic acids, radioactively or 
enzymatically labeled antibodies, and the like, as described herein. 

Alternatively, a reporter gene system can be devised using the angiogenesis 
protein promoter operably linked to a reporter gene such as luciferase, green fluorescent 
protein, CAT, or p-gal. The reporter construct is typically transfected into a cell. After 

10 treatment with a potential modulator, the amount of reporter gene transcription, translation, or 
activity is measured according to standard techniques known to those of skill in the art. 

In a preferred embodiment, as outlined above, screens may be done on 
individual genes and gene products (proteins). That is, having identified a particular 
differentially expressed gene as important in a particular state, screening of modulators of the 

1 5 expression of the gene or the gene product itself can be done. The gene products of 

differentially expressed genes are sometimes referred to herein as "angiogenesis proteins". In 
preferred embodiments the angiogenesis protein comprises a sequence shown in Table 8. 
The angiogenesis protein may be a fragment, or alternatively, be the full length protein to a 
fragment shown herein. 

20 Preferably, the angiogenesis protein is a fragment of approximately 14 to 24 

amino acids long. More preferably the fragment is a soluble fragment. In one embodiment 
an angiogenesis protein is conjugated or fused to an immunogenic agent or BS A. 

- In one embodiment, screening for modulators of expression of specific genes 
is performed. Typically, the expression of only one or a few genes are evaluated. In another 

25 embodiment, screens are designed to first find compounds that bind to differentially 
expressed proteins. These compounds are then evaluated for the ability to modulate 
differentially expressed activity. Moreover, once initial candidate compounds. are identified, 
variants can be further screened to better evaluate strucutre activity relationships. 

In a preferred embodiment, binding assays are done. In general, purified or 

30 isolated gene product is used; that is, the gene products of one or more differentially 

expressed nucleic acids are made. For example, antibodies are generated to the protein gene 
products, and standard immunoassays are run to determine the amount of protein present. 
Alternatively, cells comprising the angiogenesis proteins can be used in the assays. 
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Thus, in a preferred embodiment, the methods comprise combining an 
angiogenesis protein and a candidate compound, and determining the binding of the 
compound to the angiogenesis protein. Preferred embodiments utilize the human 
angiogenesis protein, although other mammalian proteins may also be used, for example for • 
5 the development of animal models of human disease. In some embodiments, as outlined 
herein, variant or derivative angiogenesis proteins may be used. 

Generally, in a preferred embodiment of the methods herein, the angiogenesis 
protein or the candidate agent is non-diffusably bound to an insoluble support having isolated 
sample receiving areas (e.g. a microtiter plate, an array, etc.). The insoluble supports may be 

10 made of any composition to which the compositions can be bound, is readily separated from 
soluble material, and is otherwise compatible with the overall method of screening. The 
surface of such supports may be solid or porous and of any convenient shape. Examples of 
suitable insoluble supports include microtiter plates, arrays, membranes and beads. These are 
typically made of glass, plastic (e.g., polystyrene), polysaccharides, nylon or nitrocellulose, 

15 teflon™, etc. Microtiter plates and arrays are especially convenient because a large number 
of assays can be carried out simultaneously, using small amounts of reagents and samples. 
The particular manner of binding of the composition is not crucial so long as it is compatible 
with the reagents and overall methods of the invention, maintains the activity of the 
composition and is nondiffiisable. Preferred methods of binding include the use of antibodies 

20 (which do not sterically block either the ligand binding site or activation sequence when the 
protein is bound to the support), direct binding to "sticky" or ionic supports, chemical 
■ crosslinking, the synthesis of the protein or agent on the surface, etc. Following binding of 
the protein or agent, excess unbound material is removed by washing. The sample receiving 
areas may then be blocked through incubation with bovine serum albumin (BS A), casein or 

25 other innocuous protein or other moiety. 

In a preferred embodiment, the angiogenesis protein is bound to the support, 
and a test compound is added to the assay. Alternatively, the candidate agent is bound to the 
support and the angiogenesis protein is added. Novel binding agents include specific 
antibodies, non-natural binding agents identified in screens of chemical libraries, peptide 

30 analogs, etc. Of particular interest are screening assays for agents that have a low toxicity for 
human cells. A wide variety of assays may be used for this purpose, including, labeled in 
vitro protein-protein binding assays, electrophoretic mobility shift assays, immunoassays for 
protein binding, functional assays (phosphorylation assays, etc.) and the like. 
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The determination of the binding of the test modulating compound to the 
angiogenesis protein may be done in a number of ways. In a preferred embodiment, the 
compound is labelled, and binding determined directly, e.g., by attaching all or a portion of 
the angiogenesis protein to a solid support, adding a labelled candidate agent (e.g., a 
5 fluorescent label), washing off excess reagent, and determining whether the label is present 
on the solid support. Various blocking and washing steps may be utilized as appropriate. 

By "labeled" herein is meant that the compound is either directly or indirectly 
labeled with a label which provides a detectable signal, e.g. radioisotope, fluorescers, 
enzyme, antibodies, particles such as magnetic particles, chemiluminescers, or specific 

10 binding molecules, etc. Specific binding molecules include pairs, such as biotin and 
streptavidin, digoxin and antidigoxin, etc. For the specific binding members, the 
complementary member would normally be labeled with a molecule which provides for 
detection, in accordance with known procedures, as outlined above. The label can directly or 
indirectly provide a detectable signal. 

15 In some embodiments, only one of the components is labeled, e.g. , the 

proteins (or proteinaceous candidate compounds) can be labeled. Alternatively, more than 
one component can be labeled with different labels, e.g., 125 I for the proteinsand a fluorophor 
for the compound. Proximity reagents, e.g., quenching or energy transfer reagents are also 
useful. 

20 In one embodiment, the binding of the test compound is determined by 

competitive binding assay. The competitor is a binding moiety known to bind to the target 
molecule (i.e. an angiogenesis protein), such as an antibody, peptide, binding partner, ligand, 
etc. Under certain circumstances,.there may be competitive binding between the compound 
and the binding moiety, with the binding moiety displacing the compound. In one 

25 embodiment, the test compound is labeled. Either the compound, or the competitor, or both, 
is added first to the protein for a time sufficient to allow binding, if present. Incubations may 
be performed at a temperature which facilitates optimal activity, typically between 4 and 
40°C. Incubation periods are typically optimized, e.g., to facilitate rapid high throughput 
screening. Typically between 0.1 and 1 hour will be sufficient. Excess reagent is generally 

30 removed or washed away. The second component is then added, and the presence or absence 
of the labeled component is followed, to.indicate binding. 

In a preferred embodiment, the competitor is added first, followed by the test 
compound. Displacement of the competitor is an indication that the test compound is binding 
to the angiogenesis protein and thus is capable of binding to, and potentially modulating, the 
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activity of the angiogenesis protein. In this embodiment, either component can be labeled. 
Thus, for example, if the competitor is labeled, the presence of label in the wash solution 
indicates displacement by the agent Alternatively, if the test compound is labeled, the 
presence of the label on the support indicates displacement. 
5 In an alternative embodiment, the test compound is added first, with 

incubation and washing, followed by the competitor. The absence of binding by the 
competitor may indicate that the test compound is bound to the angiogenesis protein with a 
higher affinity. Thus, if the test compound is labeled, the presence of the label on the 
support, coupled with a lack of competitor binding, may indicate that the test compound is 

10 capable of binding to the angiogenesis protein. 

In a preferred embodiment, the methods comprise differential screening to 
identity agents that are capable of modulating the activitity of the angiogenesis proteins. In 
this embodiment, the methods comprise combining an angiogenesis protein and a competitor 
in a first sample. A second sample comprises a test compound, an angiogenesis protein, and 

15 a competitor. The binding , of the competitor is determined for both samples, and a change, or 
difference in binding between the two samples indicates the presence of an agent capable of 
binding to the angiogenesis protein and potentially modulating its activity. That is, if the 
binding of the competitor is different in the second sample relative to the first sample, the 
agent is capable of binding to the angiogenesis protein. 

20 Alternatively, differential screening is used to identify drug candidates that 

bind to the native angiogenesis protein, but cannot bind to modified angiogenesis proteins. 
The structure of the angiogenesis protein may be modeled, and used in rational drug design to 
. ' . synthesize agents that interact with that site. Drug candidates that affect the activity of an 
angiogenesis protein are also identified by screening drugs for the ability to either enhance or 

25 reduce the activity of the protein. 

Positive controls and negative controls may be used in the assays. Preferably 
control and test samples are performed in at least triplicate to obtain statistically significant 
results. Incubation of all samples is for a time sufficient for the binding of the agent to the 
protein. Following incubation, samples are washed free of non-specifically bound material 

30 and the amount of bound, generally labeled agent determined. For example, where a 

radiolabel is employed, the samples may be counted in a scintillation counter to determine the 
amount of bound compound. 

A variety of other reagents may be included in the screening assays. These 
include reagents like salts, neutral proteins, e.g. albumin, detergents, etc. which may be used 



65 



WO 02/079492 



PCT/US02/04915 



to facilitate optimal protein-protein binding and/or reduce non-specific or background 
interactions. Also reagents that otherwise improve the efficiency of the assay, such as 
protease inhibitors, nuclease inhibitors, anti-microbial agents, etc., may be used. The mixture 
of components may be added in an order that provides for the requisite binding. 
5 In a preferred embodiment, the invention provides methods for screening for a 

compound capable of modulating the activity of an angiogenesis protein. The methods 
comprise adding a test compound, as defined above, to a cell comprising angiogenesis 
proteins. Preferred cell types include almost any cell. The cells contain a recombinant 
nucleic acid that encodes an angiogenesis protein. In a preferred embodiment, a library of 
10 candidate agents are tested on a plurality of cells. 

In one aspect, the assays are evaluated in the presence or absence or previous 
or subsequent exposure of physiological signals, for example hormones, antibodies, peptides, 
. antigens, cytokines, growth factors, action potentials, pharmacological agents including 
chemotherapeutics, radiation, carcinogenics, or other cells (i.e. cell-cell contacts). In another 
15 example, the determinations are determined at different stages of the cell cycle process. 

In this way, compounds that modulate angiogenesis agents are identified. 
Compounds with pharmacological activity are able to enhance or interfere with the activity of 
the angiogenesis protein. Once identified, similar structures are evaluated to identify critical 
structural feature of the compound. 
20 In one embodiment, a method of inhibiting angiogenic cell division is 

provided. The method comprises administration of an angiogenesis inhibitor. In another 
- embodiment, a method of inhibiting angiogenesis is provided. The method comprises 
administration of an angiogenesis inhibitor. In a further embodiment, methods of treating 
cells or individuals with angiogenesis are provided. The method comprises administration of 
25 an angiogenesis inhibitor. 

In one embodiment, an angiogenesis inhibitor is an antibody as discussed 
above. In another embodiment, the angiogenesis inhibitor is an antisense molecule. 

Polynucleotide modulators of angiogenesis 
30 Antisense Polynucleotides 

In certain embodiments, the activity of an angiogenesis-associated protein is 
downregulated, or entirely inhibited, by the use of antisense polynucleotide, z.e., a nucleic 
acid complementary to, and which can preferably hybridize specifically to, a coding mRNA 
nucleic acid sequence, e.g., an angiogenesis protein mRNA, or a subsequence thereof. 
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Binding of the antisense polynucleotide to the mRNA reduces the translation and/or stability 
ofthemRNA. 

In the context of this invention, antisense polynucleotides can comprise 
naturally-occurring nucleotides, or synthetic species formed from naturally-occurring 
5 subunits or their close homologs. Antisense polynucleotides may also have altered sugar 
moieties or inter-sugar linkages. Exemplary among these are the phosphorothioate and other 
sulfur containing species which are known for use in the art. Analogs are comprehended by 
this invention so long as they function effectively to hybridize with the angiogenesis protein 
mRNA. See, e.g., Isis Pharmaceuticals, Carlsbad, CA; Sequitor, Inc., Natick, MA. 

10 . Such antisense polynucleotides can readily be synthesized using recombinant 

means, or can be synthesized in vitro. Equipment for such synthesis is sold by several 
vendors, including Applied Biosystems. The preparation of other oligonucleotides such as 
phosphorothioates and alkylated derivatives is also well known to those of skill in the art. 
Antisense molecules as used herein include antisense or sense 

15 oligonucleotides. Sense oligonucleotides can, e.g., be employed to block trancription by 
binding to the anti-sense strand. The antisense and sense oligonucleotide comprise a single- 
stranded nucleic acid sequence (either RNA or DNA) capable of binding to target mRNA 
(sense) or DNA (antisense) sequences for angiogenesis molecules. A preferred antisense 
.molecule is for an angiogenesis sequences in Tables 1-8 , or for a ligand or activator thereof. 

20 Antisense or sense oligonucleotides, according to the present invention, comprise a fragment 
generally at least about 14 nucleotides, preferably from about 14 to 30 nucleotides. The 
ability to derive an antisense or a sense oligonucleotide, based upon a cDNA sequence 
encoding a given protein is described in, for example, Stein and Cohen (Cancer Res. 48:2659, 
1988) and van der Krol et al. (BioTechniques 6:958, 1988). 

25 

Ribozymes 

In addition to antisense polynucleotides, ribozymes can be used to target and 
inhibit transcription of angiogenesis-associated nucleotide sequences. A ribozyme is an RNA 
molecule that catalytically cleaves other RNA molecules. Different kinds of ribozymes have 
30 been described, including group I ribozymes, hammerhead ribozymes, hairpin ribozymes, 

RNase P, and axhead ribozymes (see, e.g., Castanotto et al (1994) Adv. in Pharmacology 25: 
289-3 17 for a general review of the properties of different ribozymes). 

The general features of hairpin ribozymes are described, e.g., in Hampel et al. 
(1990) Nucl Acids Res. 18: 299-304; Hampel et al (1990) European Patent Publication No. 0 
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360 257; U.S. Patent No. 5,254,678. Methods of preparing are well known to those of skill in 
the art (see, e.g., Wong-Staal et al, WO 94/26877; Ojwang et al (1993) Proc. Natl. Acad. 
ScL USA 90: 6340-6344; Yamada et al. (1994) Human Gene Therapy 1 : 39-45; Leavitt et al. 
(1995) Proc. Natl. Acad. Sci. USA 92: 699-703; Leavitt et al. (1994) Human Gene Therapy 5: 
5 1151-120; and Yamada ef a/. (1994) Virology 205: 121-126). 

Polynucleotide modulators of angiogenesis may be introduced into a cell . 
containing the target nucleotide sequence by formation of a conjugate with a ligand binding 
molecule, as described in WO 91/04753. Suitable ligand binding molecules include, but are 
not limited to, cell surface receptors, growth factors, other cytokines, or other ligands that 

10 bind to cell surface receptors. Preferably, conjugation of the ligand binding molecule does 
not substantially interfere with the ability of the ligand binding molecule to bind to its 
corresponding molecule or receptor, or block entry of the sense or antisense oligonucleotide 
or its conjugated version into the cell. Alternatively, a polynucleotide modulator of 
angiogenesis may be introduced into a cell containing the target nucleic acid sequence, e.g., 

15 by formation of an polynucleotide-lipid complex, as described in WO 90/1 0448. It is 

understood that the use of antisense molecules or knock out and knock in models may also be 
used in screening assays as discussed above, in addition to methods of treatment. 

Thus, in one embodiment, methods of modulating angiogenesis in cells or 
organisms are provided. In one embodiment, the methods comprise administering to a cell an 

20 anti-angiogenesis antibody that reduces or eliminates the biological activity of an 

• endogeneous angiogenesis protein. Alternatively, the methods comprise administering to a 
. cell or organism a recombinant nucleic acid encoding an angiogenesis protein. This may be 
accomplished in any number of ways. In a preferred embodiment, for example when the 
angiogenesis sequence is down-regulated in angiogenesis, such state may be reversed by 

25 increasing the amount of angiogenesis gene product in the cell. This can be accomplished, 
e.g., by overexpressing the endogeneous angiogenesis gene or administering a gene encoding 
the angiogenesis sequence, using known gene-therapy techniques, for example. In a 
preferred embodiment, the gene therapy techniques include the incorporation of the 
exogenous gene using enhanced homologous recombination (EHR), for example as described 

30 in PCT/US93/03868, hereby incorporated by reference in its entireity. Alternatively, for 

example when the angiogenesis sequence is up-regulated in angiogenesis, the activity of the 
endogeneous angiogenesis gene is decreased, for example by the administration of a 
angiogenesis antisense nucleic acid or other inhibitor, such as RNAi. 
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In one embodiment, the angiogenesis eproteins of the present invention may 
be used to generate polyclonal and monoclonal antibodies to angiogenesis proteins. 
Similarly, the angiogenesis proteins can be coupled, using standard technology, to affinity 
chromatography columns. These columns may then be used to purify angiogenesis 
5 antibodies useful for production, diagnostic, or therapeutic purposes. In a preferred 

embodiment, the antibodies are generated to epitopes unique to a angiogenesis protein; that 
is, the antibodies show little or no cross-reactivity to other proteins. The angiogenesis 
. antibodies may be coupled to standard affinity chromatography columns and used to purify 
angiogenesis proteins. The antibodies may also be used as blocking polypeptides, as outlined 
10 above, since they will specifically bind to the angiogenesis proterin. 



Methods of identifying variant angiogenesis-associated sequences 

Without being bound by theory, expression of various angiogenesis sequences 
is correlated with angiogenesis. Accordingly, disorders based on mutant or variant 

15 angiogenesis genes may be determined. In one embodiment, the invention provides methods 
for identifying cells containing variant angiogenesis genes, e.g., determining all or part of the 
sequence of at least one endogeneous angiogenesis genes in a cell. This may be 
accomplished using any number of sequencing techniques. In a preferred embodiment, the 
invention provides methods of identifying the angiogenesis genotype of an individual, e.g., 

20 determining all or part of the sequence of at least one angiogenesis gene of the individual. 
This is generally done in at least one tissue of the individual, and may include the evaluation 
of a number of tissues or different samples of the same tissue. The method may include 
comparing the sequence of the sequenced angiogenesis gene to a known angiogenesis gene, 
i.e., a wild-type gene. 

25 . The sequence of all or part of the angiogenesis gene can then be compared to 

the sequence of a known angiogenesis gene to determine if any differences exist. This can be 
done using any number of known homology programs, such as Bestfit, etc. In a preferred 
embodiment, the presence of a a difference in the sequence between the angiogenesis gene of 
the patient and the known angiogenesis gene correlates with a disease state or a propensity 

30 for a disease state, as outlined herein. 

In a preferred embodiment, the angiogenesis genes are used as probes to 
determine the number of copies of the angiogenesis gene in the genome. 

In another preferred embodiment, the angiogenesis genes are used as probes to 
determine the chromosomal localization of the angiogenesis genes. Information such as 
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chromosomal localization finds use in providing a diagnosis or prognosis in particular when 
chromosomal abnormalities such as translocations, and the like are identified in the 
angiogenesis gene locus. 

5 Administration of pharmaceutical and vaccine compositions 

In one embodiment, a therapeutically effective dose of an angiogenesis protein 
or modulator thereof, is administered to a patient. By "therapeutically effective dose" herein 
is meant a dose that produces effects for which it is administered. The exact dose will depend 
on the purpose of the treatment, and will be ascertainable by one skilled in the art using 

10 known techniques (e.g., Ansel et al. 9 Pharmaceuitcal Dosage Forms and Drug Delivery, 
Lippincott, Williams & Wilkins Publishers, ISBN:0683305727; Lieberman (1992) 
Pharmaceutical Dosage Forms (vols. 1-3), Dekker, ISBN 0824770846, 08247691 8X, 
0824712692, 0824716981; Lloyd (1999) The Art, Science and Technology of Pharmaceutical 
Compounding, Amer. Pharmacutical Assn, ISBN 0917330889; and Pickar (1999) Dosage 

1 5 Calculations, Delmar Pub, ISBN 0766805042). As is known in the art, adjustments for 
angiogenesis degradation, systemic versus localized delivery, and rate of new protease 
synthesis, as well as the age, body weight, general health, sex, diet, time of administration, 
drug interaction and the severity of the condition may be necessary, and will be ascertainable 
with routine experimentation by those skilled in the art. 

20 A "patient" for the purposes of the present invention includes both humans and 

other animals, particularly mammals. Thus the methods are applicable to both human 
therapy and veterinary applications. In the preferred embodiment the patient is a mammal, 
preferably a primate, and in the most preferred embodiment the patient is human. 

The administration of the angiogenesis proteins and modulators thereof of the 

25 present invention can be done in a variety of ways as discussed above, including, but not 
limited to, orally, subcutaneously, intravenously, intranasally, transdermally, 
intraperitoneally, intramuscularly, intrapulmonary, vaginally, rectally, or intraocularly. In 
some instances, for example, in the treatment of wounds and inflammation, the angiogenesis 
proteins and modulators may be directly applied as a solution or spray. 

30 The pharmaceutical compositions of the present invention comprise an 

angiogenesis protein in a form suitable for administration to a patient. In the preferred 
embodiment, the pharmaceutical compositions are in a water soluble form, such as being 
present as pharmaceutical^ acceptable salts, which is meant to include both acid and base 
addition salts. "Pharmaceutical^ acceptable acid addition salt" refers to those salts that retain 
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the biological effectiveness of the free bases and that are not biologically or otherwise 
undesirable, formed with inorganic acids such as hydrochloric acid, hydrobromic acid, 
sulfuric acid, nitric acid, phosphoric acid and the like, and organic acids such as acetic acid, 
propionic acid, glycolic acid, pyruvic acid, oxalic acid, maleic acid, malonic acid, succinic 
5 acid, fumaric acid, tartaric acid, citric acid, benzoic acid, cinnamic acid, mandelic acid, 

methanesulfonic acid, ethanesulfonic acid, p-toluenesulfonic acid, salicylic acid and the like. 
"Pharmaceutically acceptable base addition salts" include those derived from inorganic bases 
such as sodium, potassium, lithium, ammonium, calcium, magnesium, iron, zinc, copper, 
manganese, aluminum salts and the like. Particularly preferred are the ammonium, 

10 potassium, sodium, calcium, and magnesium salts. Salts derived from pharmaceutically 
acceptable organic non-toxic bases include salts of primary, secondary, and tertiary amines, 
substituted amines including naturally occurring substituted amines, cyclic amines and basic 
ion exchange resins, such as isopropylamine, trimethylamine, diethylamide, triethylamine, 
tripropylamine, and ethanolamine. 

15 The pharmaceutical compositions may also include one or more of the 

following: carrier proteins such as serum albumin; buffers; fillers such as microcrystalline 
cellulose, lactose, corn and other starches; binding agents; sweeteners and other flavoring 
agents; coloring agents; and polyethylene glycol. 

The pharmaceutical compositions can be administered in a variety of unit 

20 dosage forms depending upon the method of administration. For example, unit dosage forms 
suitable for oral administration include, but are not limited to, powder, tablets, pills, capsules 
and lozenges. It is recognized that angiogenesis protein modulators (e.g., antibodies, 
antisense constructs, ribozymes, small organic molecules, etc.) when administered orally, 
should be protected from digestion. This is typically accomplished either by complexing the 

25 molecule(s) with a composition to render it resistant to acidic and enzymatic hydrolysis, or by 
packaging the molecule(s) in an appropriately resistant carrier, such as a liposome or a 
protection barrier. Means of protecting agents from digestion are well known in the art. 

The compositions for administration will commonly comprise an angiogenesis 
protein modulator dissolved in a pharmaceutically acceptable carrier, preferably an aqueous 

30 carrier. A variety of aqueous carriers can be used, e.g., buffered saline and the like. These 
solutions are sterile and generally free of undesirable matter. These compositions may be 
sterilized by conventional, well known sterilization techniques. The compositions may 
contain pharmaceutically acceptable auxiliary substances as required to approximate 
physiological conditions such as pH adjusting and buffering agents, toxicity adjusting agents 
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and the like, for example, sodium acetate, sodium chloride, potassium chloride, calcium 
chloride, sodium lactate and the like. The concentration of active agent in these formulations 
can vary widely, and will be selected primarily based on fluid volumes, viscosities, body 
weight and the like in accordance with the particular mode of administration selected and the 
5 patient's needs (e.g. , Remington 's Pharmaceutical Science, 1 5th ed., Mack Publishing 
Company, Easton, Pennsylvania (1980) and Goodman and Gillman, The Pharmacologial 
Basis ofTherapeuticsffidrdmzn, J.G, Limbird, L.E, Molinoff, P.B., Ruddon, R.W, and 
Gilman, A.G.,eds) TheMcGraw-Hill Companies, Inc., 1 996). 

Thus, a typical pharmaceutical composition for intravenous administration 

10 would be about 0.1 to 10 mg per patient per day. Dosages from 0. 1 up to about 100 mg per 
. patient per day may be used, particularly when the drug is administered to a secluded site and 
not into the blood stream, such as into a body cavity or into a lumen of an organ. 
Substantially higher dosages are possible in topical administration. Actual methods for 
preparing parenterally administrable compositions will be known or apparent to those skilled 

15 in the art, e.g.. Remington 's Pharmaceutical Science and Goodman and Gillman, The 
Pharmacologial Basis of Therapeutics, supra. 

The compositions containing modulators of angiogenesis proteins can be 
administered for therapeutic or prophylactic treatments. In therapeutic applications, 
compositions are administered to a patient suffering from a disease (e.g., a cancer) in an 

20 amount sufficient to cure or at least partially arrest the disease and its complications. An 

amount adequate to accomplish this is defined as a "therapeutically effective dose." Amounts 
effective for this use will depend upon the severity of the disease and the general state of the 
patient's health. Single or multiple administrations of the compositions may be administered 
depending on the dosage and frequency as required and tolerated by the patient. In any event, 

25 the composition should provide a sufficient quantity of the agents of this invention to 

effectively treat the patient. An amount of modulator that is capable of preventing or slowing 
the development of cancer in a mammal is referred to as a "prophylactically effective dose." 
The particular dose required for a prophylactic treatment will depend upon the medical 
condition and history of the mammal, the particular cancer being prevented, as well as other 

30 factors such as age, weight, gender, administration route, efficiency, etc. Such prophylactic 
treatments may be used, e.g., in a mammal who has previously had cancer to prevent a 
recurrence of the cancer, or in a mammal who is suspected of having a significant likelihood 
of developing cancer. 
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It will be appreciated that the present angiogenesis protein-modulating 
compounds can be administered alone or in combination with additional angiogenesis 
modulating compounds or with other therapeutic agent, e.g., other anti-cancer agents or 
treatments. 

5 In numerous embodiments, one or more nucleic acids, e.g., polynucleotides 

comprising nucleic acid sequences set forth in Tables 1-8 , such as antisense polynucleotides 
or ribozymes, will be introduced into cells, in vitro or in vivo. The present invention provides 
methods, reagents, vectors, and cells useful for expression of angiogenesis-associated 
polypeptides and nucleic acids using in vitro (cell-free), ex vivo or in vivo (cell or 

10 organism-based) recombinant expression systems. 

The particular procedure used to introduce the nucleic acids into a host cell for 
expression of a protein or nucleic acid is application specific. Many procedures for 
introducing foreign nucleotide sequences into host cells may be used. These include the use 
of calcium phosphate transfection, spheroplasts, electroporation, liposomes, microinjection, 

15 plasma vectors, viral vectors and any of the other well known methods for introducing cloned 
genomic DNA, cDNA, synthetic DNA or other foreign genetic material into a host cell {see, 
e.g., Berger and Kimmel, Guide to Molecular Cloning Techniques, Methods in Enzymology 
volume 152 Academic Press, Inc., San Diego, CA (Berger), F.M. Ausubel et al, eds., Current 
Protocols, a joint venture between Greene Publishing Associates, Inc. and John Wiley & 

20 Sons, Inc., (supplemented through 1999), and Sambrook et al, Molecular Cloning - A 
Laboratory Manual (2nd Ed.), Vol. 1-3, Cold Spring Harbor Laboratory, Cold Spring 
Harbor, New York, 1989. 

In a preferred embodiment, angiogenesis proteins and modulators are 
administered as therapeutic agents, and can be formulated as outlined above. Similarly, 

25 angiogenesis genes (including both the full-length sequence, partial sequences, or regulatory 
sequences of the angiogenesis coding regions) can be administered in a gene therapy 
application. These angiogenesis genes can include antisense applications, either as gene 
therapy (i.e. for incorporation into the genome) or as antisense compositions, as will be 
appreciated by those in the art. 

30 Angiogenesis polypeptides and polynucleotides can also be administered as 

vaccine compositions to stimulate HTL, CTL and antibody responses.. Such vaccine 
compositions can include, for example, lipidated peptides (e.#.,Vitiello, A. et al, J. Clin. 
Invest 95:341, 1995), peptide compositions encapsulated in poly (DL-lactide-co-glycolide) 
("PLG") microspheres {see, e.g., Eldridge, etal.,Molec. Immunol 28:287-294, 1991: Alonso 
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etal, Vaccine 12:299-306, 1994; Jones etal, Vaccine 13:675-681, 1995), peptide 
compositions contained in immune stimulating complexes (ISCOMS) {see, e.g., Takahashi et 
al, Nature 344:873-875, 1990; Hue* a/., Clin Exp Immunol 113:235-243, 1998), multiple 
antigen peptide systems (MAPs) (see e.g., Tarn, J. P., Proc. Natl Acad. ScL U.S.A. 85:5409- 
5 5413, 1988; Tam, J.P., J. Immunol Methods 196:17-32, 1996), peptides formulated as 
multivalent peptides; peptides for use in ballistic delivery systems, typically crystallized 
peptides, viral delivery vectors (Perkus, M. E. et al, In: Concepts in vaccine development, 
Kaufinann, S. H. E., ed., p. 379, 1996; Chakrabarti, S. etal, Nature 320:535, 1986; Hu, S. L. 
et al, Nature 320:537, 1986; Kieny, M.-P. et al, AIDS Bio/Technology 4:790, 1986; Top, F. 

10 H. et al, J. Infect. Dis. 124:148, 1971; Chanda, P. K. et al, Virology 175:535, 1990), 
particles of viral or synthetic origin {e.g., Kofler, N. et al,J. Immunol Methods. 192:25, 
1996; Eldridge, J. H. et al, Sem. Hematol 30:16, 1993; Falo, L. D., Jr. et al, Nature Med. 
7:649, 1995), adjuvants (Warren, H. S., Vogel, F. R., and Chedid, L. A. Annu. Rev. Immunol 
4:369, 1986; Gupta, R. K. et al, Vaccine 11:293, 1993), liposomes (Reddy, R. et al,J. 

15 Immunol. 148:1585, 1992; Rock, K. L., Immunol Today 17:131, 1996), or, naked or particle 
absorbed cDNA (Ulmer, J. B. et al, Science 259:1745, 1993; Robinson, H. L., Hunt, L. A., 
and Webster, R. G., Vaccine 1 1:957, 1993; Shiver, J. W. et al, In: Concepts in vaccine 
development, Kaufinann, S. H. E., ed., p. 423, 1996; Cease, K. B., and Berzofsky, J. A., 
Annu. Rev. Immunol 12:923, 1994 and Eldridge, J. H. et al, Sem. Hematol 30:16, 1993). 

20 Toxin-targeted delivery technologies, also known as receptor mediated targeting, such as 
those of Avant Immunotherapeutics, Inc. (Needham, Massachusetts) may also be used. 

Vaccine compositions often include adjuvants. Many adjuvants contain a 
substance designed to protect the antigen from rapid catabolism, such as aluminum hydroxide 
or mineral oil, and a stimulator of immune responses, such as lipid A, Bortadella pertussis or 

25 Mycobacterium tuberculosis derived proteins. Certain adjuvants are commercially available 
as, for example, Freund's Incomplete Adjuvant and Complete Adjuvant (Difco Laboratories, 
Detroit, MI); Merck Adjuvant 65 (Merck and Company, Inc., Rahway, NJ); AS-2 
(SmithKline Beecham, Philadelphia, PA); aluminum salts such as aluminum hydroxide gel 
(alum) or aluminum phosphate; salts of calcium, iron or zinc; an insoluble suspension of 

30 acylated tyrosine; acylated sugars; cationically or anionically derivatized polysaccharides; . 
polyphosphazenes; biodegradable microspheres; monophosphoryl lipid A and quil A. 
Cytokines, such as GM-CSF, interleukin-2, -7, -12, and other like growth factors, may also be 
used as adjuvants. 
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Vaccines can be administered as nucleic acid compositions wherein DNA or 
RNA encoding one or more of the polypeptides, or a fragment thereof, is administered to a 
patient. This approach is described, for instance, in Wolff et aL, Science 247:1465 (1990) as 
well as U.S. Patent Nos. 5,580,859; 5,589,466; 5,804,566; 5,739,118; 5,736,524; 5,679,647; 
5 WO 98/04720; and in more detail below. Examples of DNA-based delivery technologies 
include "naked DNA", facilitated (bupivicaine, polymers, peptide-mediated) delivery, 
cationic lipid complexes, and particle-mediated ("gene gun") or pressure-mediated delivery 
(see, e.g., U.S. Patent No. 5,922,687). 

For therapeutic or prophylactic immunization purposes, the peptides of the 

1 0 invention can be expressed by viral or bacterial vectors. Examples of expression vectors 

include attenuated viral hosts, such as vaccinia or fowlpox. This approach involves the use of 
vaccinia virus, for example, as a vector to express nucleotide sequences that encode 
angiogenic polypeptides or polypeptide fragments. Upon introduction into a host, the 
recombinant vaccinia virus expresses the immunogenic peptide, and thereby elicits an 

1 5 immune response. Vaccinia vectors and methods useful in immunization protocols are 
described in, e.g., U.S. Patent No. 4,722,848. Another vector is BCG (Bacille Calmette 
Guerin). BCG vectors are described in Stover et al, Nature 351:456-460 (1991). A wide 
variety of other vectors useful for therapeutic administration or immunization e.g. adeno and 
adeno-associated virus vectors, retroviral vectors, Salmonella typhi vectors, detoxified 

20 anthrax toxin vectors, and the like, will be apparent to those skilled in the art from the 

description herein (see, e.g., Shata et al (2000) Mol Med Today, 6: 66-71; Shedlock et al z J 
Leukoc Biol 68,:793-806, 2000; Hipp et al, In Vivo 14:571r85, 2000). 

Methods for the use of genes as DNA vaccines are well known, and include 
placing an angiogenesis gene or portion of an angiogenesis gene under the control of a 

25 regulatable promoter or a tissue-specific promoter for expression in an angiogenesis patient. 
The angiogenesis gene used for DNA vaccines can encode full-length angiogenesis proteins, 
but more preferably encodes portions of the angiogenesis proteins including peptides derived 
from the angiogenesis protein. In one embodiment, a patient is immunized with a DNA 
vaccine comprising a plurality of nucleotide sequences derived from an angiogenesis gene. 

30 For example, angiogenesis-associated genes or sequence encoding subfragments of an 
angiogenesis protein are introduced into expression vectors and tested for their 
immunogenicity in the context of Class I MHC and an ability to generate cytotoxic T cell 
responses. This procedure provides for production of cytotoxic T cell responses against ceils 
which present antigen, including intracellular epitopes. 
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In a preferred embodiment, the DNA vaccines include a gene encoding an 
adjuvant molecule with the DNA vaccine. Such adjuvant molecules include cytokines that 
increase the immunogenic response to the angiogenesis polypeptide encoded by the DNA 
vaccine. Additional or alternative adjuvants are available. 

In another preferred embodiment angiogenesis genes find use in generating 
animal models of angiogenesis. When the angiogenesis gene identified is repressed or 
diminished in angiogenesic tissue, gene therapy technology, e.g., wherein antisense RNA 
directed to the angiogenesis gene will also diminish or repress expression of the gene. 
Animal models of angiogenesis find use in screening for modulators of an angiogenesis- 
associated sequence or modulators of angiogenesis. Similarly, transgenic animal technology 
including gene knockout technology, for example as a result of homologous recombination 
with an appropriate gene targeting vector, will result in the absence or increased expression 
of the angiogenesis protein. When desired, tissue-specific expression or knockout of the 
angiogenesis protein may be necessary. 

It is also possible that the angiogenesis protein is overexpressed in 
angiogenesis. As such, transgenic animals can be generated that overexpress the 
angiogenesis protein. Depending on the desired expression level, promoters of various 
strengths can be employed to express the transgene. Also, the number of copies of the 
integrated transgene can be determined and compared for a determination of the expression 
level of the transgene. Animals generated by such methods find use as animal models of 
angiogenesis and are additionally useful in screening for modulators to treat angiogenesis or 
to evaluate a therapeutic entity. - 

Kits for Use in Diagnostic and/or Prognostic Applications 

For use in diagnostic, research, and therapeutic applications suggested above, 
kits are also provided by the invention. In the diagnostic and research applications such kits 
may include any or all of the following: assay reagents, buffers, angiogenesis-specific nucleic 
acids or antibodies, hybridization probes and/or primers, antisense polynucleotides, 
ribozymes, dominant negative angiogenesis polypeptides or polynucleotides, small molecules 
inhibitors of angiogenesis-associated sequences etc. A therapeutic product may include 
sterile saline or another pharmaceutically acceptable emulsion and suspension base. 

In addition, the kits may include instructional materials containing directions 
(/.&, protocols) for the practice of the methods of this invention. While the instructional 
materials typically comprise written or printed materials they are not limited to such. Any 
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medium capable of storing such instructions and communicating them to an end user is 
contemplated by this invention. Such media include, but are not limited to electronic storage 
media (e.g., magnetic discs, tapes, cartridges, chips), optical media (e.g., CD ROM), and the 
like. Such media may include addresses to internet sites that provide such instructional 
5 materials. 

The present invention also provides for kits for screening for modulators of 
angiogenesis-associated sequences. Such kits can be prepared from readily available 
materials and reagents. For example, such kits can comprise one or more of the following 
materials: an angiogenesis-associated polypeptide or polynucleotide, reaction tubes, and 

10 instructions for testing angiogenic-associated activity. Optionally, the kit contains 

biologically active angio genesis protein. A wide variety of kits and components can be 
prepared according to the present invention, depending upon the intended user of the kit and 
the particular needs of the user. Diagnosis would typically involve evaluation of a plurality 
of genes or products. The genes will be selected based on correlations with important 

1 5 parameters in disease which may be identified in historical or outcome data. 

It is understood that the examples described above in no way serve to limit 
the true scope of this invention, but rather are presented for illustrative purposes. All 
publications, sequences of accession numbers, and patent applications cited in this 
20 specification are herein incorporated by reference as if each individual publication or patent 
- application were specifically and individually indicated to be incorporated by reference. 

EXAMPLES 

25 Example 1: Tissue Preparation. Labeling Chips, and Fingerprints 
Purify total RNA from tissue using TRIzol Reagent 

Homogenize tissue samples in 1ml of TRIzol per 50mg of tissue using a 
Polytron3100homogenizer. The generator/probe used depends upon the tissue size, A 
generator that is too large for the amount of tissue to be homogenized will cause a loss of 

30 sample and lower RNA yield. TRIzol is added directly to frozen tissue, which is then 

homogenize. Following homogenization, insoluble material is removed by centrifiigation at 
7500 x g for 15 min in a Sorvall superspeed or 12,000 x g for 10 min. in an Eppendorf 
centrifuge at 4°C. The clear homogenate is transferred to a new tube for use. The samples 
may be frozen now at -60° to -70°C (and kept for at least one month). The homogenate is 
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mixed with 0.2ml of chloroform per 1ml of TRIzol reagent used in the original 
homogenization and incubated at room temp, for 2-3 minutes. The aqueous phase is then 
separated by centrifugation and transferred to a fresh tube and the RNA precipitated using 
isopropyl alcohol. The pellet is isolated by centrifugation, washed, air-dried, resuspended in 
5 an appropriate volume of DEPC H 2 0, and the absorbance measured. 

Purification of poly A+ mRNA from total RNA is performed as follows. Heat 
an oligotex suspension to 37°C and mixing immediately before adding to RNA. The 
Elution Buffer is heated at 70°C. Warm up 2 x Binding Buffer at 65°C if there is precipitate 
in the buffer. Mix total RNA with DEPC-treated water, 2 x Binding Buffer, and Oligotex 

10 according to Table 2 on page 16 of the Oligotex Handbook. Incubate for 3 minutes at 65°C. 
Incubate for 10 minutes at room temperature. Centrifuge for 2 minutes at 14,000 to 18,000 g. 
Remove supernatant without disturbing Oligotex pellet. A little bit of solution can be left 
behind to reduce the loss of Oligotex. Gently resuspend in Wash Buffer OW2 and pipet onto 
spin column. Centrifuge the spin column at full speed for 1 minute. Transfer spin column to 

15 a new collection tube and gently resuspend in Wash Buffer OW2 and centrifuge as describe 
herein. Transfer spin column to a new tube and elute with 20 to 100 ul of preheated (70oC) 
Elution Buffer. Gently resuspend Oligotex resin by pipetting up and down. Centrifuge as 
above. Repeat elution with fresh elution buffer or use first eluate to keep the elution volume 
low. Read absorbance, using diluted Elution Buffer as the blank. Before proceeding with 

20 cDNA synthesis, precipitate the mRNA as follows: add 0.4 vol. of 7.5 M NH40Ac + 2.5 vol. 
of cold 100% ethanoL Precipitate at -20oC 1 hour to overnight (or 20-30 min. at -70oC). 
Centrifuge at 14,000-16,000 x g for 30 minutes at 4oC. Wash pellet with 0.5ml of 
80%ethanol (-20oC) then centrifuge at 14,000-16,000 x g for 5 minutes at room temperature.. 
Repeat 80% ethanol wash. Air dry the ethanol from the pellet in the hood.. Suspend pellet in 

25 DEPC H 2 0 at lug/ul concentration. 

To further Clean up total RNA using Qiagen's RNeasy kit, add no more than 
lOOug to an RNeasy column. Adjust sample to a volume of lOOul with RNase-free water. 
Add 350ul Buffer RLT then 250ul ethanol (100%) to the sample. Mix by pipetting (do not 
centrifuge) then apply sample to an RNeasy mini spin column. Centrifuge for 15 sec at 

30 >l0,000rpm. Transfer column to a new 2-ml collection tube. Add 500ul Buffer RPE and 
centrifuge for 15 sec at >10,000rpm. Discard flowthrough. Add 500ul Buffer RPE and 
centrifuge for 15 sec at >10,000rpm. Discard flowthrough then centrifuge for 2 min at 
maximum speed to dry column membrane. Transfer column to a new 1 >5-ml collection tube 



78 



WO 02/079492 



PCT/US02/04915 



and apply 30-50ul of RNase-free water directly onto column membrane. Centrifuge 1 min at 
>10,000rpm. Repeat elution. and read absorbance. 

cDNA synthesis using Gibco's "Superscript Choice System for cDNA Synthesis" kit 
5 First Strand cDNA synthesis is performed as follows. Use 5ug of total RNA 

or lug of poly A+ mRNA as starting material. For total RNA, use 2ul of Superscript RT. For 
polyA+ mRNA, use lul of Superscript RT. Final volume of first strand synthesis mix is 
20ul. RNA must be in a volume no greater than lOul. Incubate RNA with lul of lOOpmol 
T7-T24 oligo for 1 0 min at 70C. On ice* add 7 ul of: 4ul 5X 1st Strand Buffer, 2ul of 0. 1M 

10 DTT, and 1 ul of lOmM dNTP mix. Incubate at 37C for 2 min then add Superscript RT. 
Incubate at 37C for 1 hour. 

For the second strand synthesis, place 1st strand reactions on ice and add: 91ul 
DEPC H 2 0; 30ul 5X 2nd Strand Buffer; 3ul lOmM dNTP mix; lul lOU/ul E.coli DNA 
Ligase; 4ul lOU/ul E.coli DNA Polymerase; and lul 2U/ul RNase H. Mix and incubate 2 

15 hours at 16C. Add 2ul T4 DNA Polymerase. Incubate 5 min at 16C. Add lOul of 0.5M 
EDTA. A further clean-up of DNA is performed using phenol:chloroform:isoamyl Alcohol 
(25:24:1) purification. 

In vitro Transcription (TVT) and labeling with biotin is performed as follows: 
Pipet 1 .5ul of cDNA into a thin-wall PCR tube. Make NTP labeling mix by combining 2ul T7 

20 lOxATP (75mM) (Ambion); 2ul T7 lOxGTP (75mM) (Ambion); 1.5ul T7 lOxCTP (75mM) 
(Ambion); 1.5ul T7 lOxUTP (75mM) (Ambion); 3.75ul lOmM Bio-ll-UTP (Boehringer- 
. Mannheim/Roche or Enzo); 3.75ul lOmM Bio-16-CTP (Enzo); 2ul lOx T7 transcription 
buffer (Ambion); and 2ul lOx T7 enzyme mix (Ambion). The final volume is 20ul. Incubate 
6 hours at 37°C in a PCR machine. The RNA can be furthered cleaned. 

25 Fragmentation is performed as follows. 15 ug of labeled RNA is usually 

fragmented. Try to minimize the fragmentation reaction volume; a 10 ul volume is 
recommended but 20 ul is all right. Do not go higher than 20 ul because the magnesium in 
the fragmentation buffer contributes to precipitation in the hybridization buffer. Fragment 
RNA by incubation at 94 C for 35 minutes in 1 x Fragmentation buffer (5 x Fragmentation 

30 buffer is 200 mM Tris-acetate, pH 8. 1 ; 500 mM KOAc; 1 50 mM MgOAc). The labeled 
RNA transcript can be analyzed before and after fragmentation. Samples can be heated to 
65°C for 15 minutes and electrophoresed on 1% agarose/TBE gels to get an approximate idea 
of the transcript size range 
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For hybridization, 200 ul (10ug cRNA) of a hybridization mix is puton the 
chip. If multiple hybridizations are to be done (such as cycling through a 5 chip set), then it 
is recommended that an initial hybridization mix of 300 ul or more be made. The 
hybridization mix is: fragment labeled RNA (50ng/ul final cone); 50 pM 948-b control 
5 oligo; 1.5 pM BioB; 5 pM BioC; 25 pM BioD; 100 pM CRE; O.lmg/ml herring sperm DNA; 
0.5mg/ml acetylated BSA; and 300 ul with IxMES hyb buffer. 

Labeling is performed as follows: The hybridization reaction includes non- 
biotinylated IVT (purified by RNeasy columns); IVT antisense RNA 4 |ig:jal; random 
Hexamers (1 fig/jxl) 4 \il and water to 14 ul. The reaciton is incubated<at 70°C, 10 min. 

10 Reverse transcriptionis performed in the following reaction: 5X First Strand (BRL) buffer, 6 
lil; 0.1 MDTT, 3 50X dNTP mix, 0.6 |d; H 2 G, 2.4 Cy3 or Cy5 dUTP (ImM), 3 jul; SS 
RT II (BRL), 1 jil in a final volume of 16 \il Add to hybridization reaction. Incubate 30 
min., 42°C. Add 1 |il SSII and incubate another hour. Put on ice. 50X dNTP mix (25mM of 
cold dATP, dCTP, and dGTP, lOmM of dTTP: 25 \xl each of lOOmM dATP, dCTP, and 

15 dGTP; 10 |al of lOOmM dTTP to 15 \il H20. dNTPs from Pharmacia) 

RNA degradation is performed as follows. Add 86 ^1 H2Q, 1.5 jil 1M NaOH/ 
2mM EDTA and incubate at 65°C, 10 min.. For U-Con 30, 500 jxl TE/sample spin at 7000g 
for 1 0 min, save flow through for purification. For Qiagen purification, suspend u-con 
recovered material in 500jxl buffer PB and proceed using Qiagen protocol. For DNAse 

20 digestion, add 1 ul of 1/100 dil of DNAse/30ul Rx and incubate at 37°C for 15 min. Incubate 
at 5 min 95°C to denature the DNAse/ 

For sample preparation, add Cot-1 DNA, 10 50X dNTPs, 1 |xl; 20X SSC, 
2.3 jil; Na pyro phosphate, 7.5 jllI; lOmg/ml Herring sperm DNA; lul of 1710 dilution to 21.8 
final vol. Dry in speed vac. Resuspendin 15 \xlE20. Add 0.38 \x\ 10% SDS. Heat95°C,2 

25 min and slow cool at room temp, for 20 min. Put on slide and hybridize overnight at 64°C. 
Washing after the hybridization: 3X SSC/0.03% SDS: 2 min., 37.5 mis 20X SSC+0.75mls 
10% SDS in 250mls H20; IX SSC: 5 min., 12.5 mis 20X SSC in 250mis H20; 0.2X SSC: 5 
min., 2.5 mis 20X SSC in 250mls H20. Dry slides and scan at appropiate PMT's and 
channels. 

30 

Example 2. A model of angiogenesis is used to determine expression in angiogenesis 

In the model of angiogenesis used to determine expression of angiogenesis- 
associated sequences, human umbilical vein endothelial cells (HUVEC) were obtained, e.g., 
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as passage 1 (pi) frozen cells from Cascade Biologies (Oregon) and grown in maintenance 
medium: Medium 199 (Life Technologies) supplemented with 20% pooled human serum, 
100 mg/ml heparin and 75 mg/ml endothelial cell growth supplements (Sigma) and 
gentamicin (Life Technologies). An in vitro cell system model was used in which 2x1 0 5 
5 HUVECs were cultured in 0.5 ml 3 mgs/ml plasminogen-depleted fibrinogen (Calbiochem, 
San Diego, CA) that was polymerized by the addition of 1 unit of maintenance medium 
supplemented with 100 ng/ml VEGF and HGF and 10 ng/ml TGF-a (R&D Systems, 
Minneapolis,MN) added (growth medium). The growth medium was replaced every 2 days. 
Samples for RNA were collected, e.g., at 0, 2, 6, 15, 24, 48, and 96 hours of culture. The 

1 0 fibrin clots were placed in Trizol (Life Technologies) and disrupted using a Tissuemizer. 
Thereafter standard procedures were used for extracting the RNA (e.g., Example 1). 

Angiogenesis associated sequences thus identified are shown in Tables 1-8 . 
As indicated, some of the Accession numbers include expression sequence tags (ESTs). 
Thus, in one embodiment herein, genes within an expression profile, also termed expression 

1 5 profile genes, include ESTs and are not necessarily full length. 
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TABLE 1: 

Pkey: Unique Eos probeset identifier number 

Accession: Accession number used for previous patent filings 
ExAccn: Exempiar Accession number, Genbank accession number 
UnigenelD: Untgene number 
Unigene Title: Unigene gene title 
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II- 4 0»CA-7 

HS.lWb^/ 


100294 D49396 


AA331881 


II. 7C/ C/ 

MS. 70404 


100327 D55640 


D55640 




100335 D63391 


AW247529 


Ue R7Q1 


134495 D63477 


D63477 




100338 D63483 


D86864 


Hs.57735 


135152 D64015 


M96954 


Hs.182741 


134269. D79990 


NIVL014737 


Hs.80905 


100372 D79997 


NiVL014791 


Hs.184339 


134304 D80010 


BE613486 


Hs.81412 


100394 D84276 


D84284 


Hs.66052 


100405 D86425 


AW291587 


Hs.82733 


100418 D86978 


D86978 


Hs.84790 


133154 D87012 


D87012 


Hs.194685 


134347 D87075 


AF164142 


Hs.82042 


128653 D87432 


D87432 


Hs.10315 


100438 D87448 


AA013O51 


Hs.91417 


134593 D87845 


NJVL000437 


Hs.234392 


100481 HG1Q98-HT1098 


X70377 


Hs.121489 


100552 HG2167-HT2237 AA019521 


Hs.301946 


100591 HG2415-HT2511 NM_004091 


Hs.231444 


cds 






100652 HG2825-HT2949 BE613608 


Hs.142653 



100662 HG2887-HT3031_r 
100899 HG4660W5073 AL039123 
100905 HG4704-HT5146 L12260 
100945 HG884-HT884 AF002225 
100950 HG919-HT919 AF128542 
100964 J00212J J00212 
135407 J04029 J04029 
130149 J04031 AW067805 
131877 J04088 J04088 
101016 J04543 J04543 
134786 L06139 T29618 
134100 L07540 AA460085 
134078 L08895 L08895 
101132 L11239 L11239 
134849 L11353 BE409525 
106432 L13773 AK000310 



Hs.103042 
Hs.172816 
Hs.180686 
Hs.166846 

Hs.99936 

Hs.172665 

Hs.156346 

HS.7B637 

Hs.89640 

Hs.171075 

Hs.78995 

Hs.36993 

Hs.902 

Hs.17138 



vaccinia related kinase 2 

Rho guanine exchange factor (GEF) 12 

proteasome (prosome, macropain) 26S subunit, non-ATPase, 12 

tousled-like kinase 2 

homogentisate 1,2-dioxygenase (homogentisate oxidase) 
syntaxin16 

similar to S. cerevislae SSM4 
KIAA1605 protein 
chromosome condensation 1 

v-rai simian leukemia viral oncogene homolog B (ras related; GTP binding protein) 
tissue inhibitor of metalloproteinase 1 (erythroid potentiating activity, coiiagenase inhibitor) 
KIAA01 01 gene product 
D123 gene product 

mannosyl (alpha-1,6-)-glycoprotein beta-1,6-N-acetyi-glucosaminyttransferase 
RAD23 (S. cerevisiae) homolog B 
diacylgiycerol kinase, gamma (90kD) 

DEAD7H (Asp-Glu-Ala^sp/His) box polypeptide 7 (RNA helicase, 52kD) 
caicium/calmodulin-dependent protein kinase IV 
TRAM-ilke protein 
KIAA0061 protein 
KIAA0071 protein 

prostaglandin 12 (prostacyclin) receptor (JP) 
postmeiotic segregation increased 2-iike 4 
RAD21 (S. pombe) homolog 
KIAA01 18 protein 
peroxiredoxin 3 

gb:Human monocyte PABL (pseudoautosomal boundary-like sequence) mRNA, clone Mo2. 
platelet-activating factor acetylhydrolase, isoform lb, gamma subunit (29kD) 
KIAA0143 protein 
acetyl LDL receptor, SREC 

TIA1 cytotoxic granule-associated RNA-binding protein-iike 1 

Ras association (RalGDS/AF-6) domain family 2 . 

WAA0175 gene product _ 

lipinl 

CD38 antigen (p45) 
nidogen2 ... 
KIAA0225 protein 
topdsomerase (DNA) III beta 

solute carrier family 23 (nucleobase transporters), member 1 ■ 

solute carrier family 7 (cationic amino acid transporter, y+ system), member 6 

topoisomerase (DNA) II binding protein 

platelet-activating factor acetylhydrolase 2 (40kD) 

cystatin D 

lysosomal 

Homo sapiens, Similar to hypothetical protein PR01722, clone MGC:156?2, mRNA, complete 
ret finger protein 

Hs.816 SRY (sex determining region Y)-box 2 
microtubule-assodated protein 1B 
neureguRn 1 

ubtquitin protein Bgase E3A (human papilloma vims E6-associated protein, Angelman syndrome) 

polymerase (DMA directed), epsSon 

Empirically selected from AFFX single probeset 

keratin 10 (epidermolytic hyperkeratosis; keratosis palmaris et ptantaris) 

methylenetetrahydrofolate dehydrogenase (NADP+ dependent), methenyitetrahydrofolate 

topoisomerase (DNA) il alpha (170kD) 

annexinA7 

TEK tyrosine kinase, endothelial (venous malformations, multiple cutaneous and mucosal) 
replication factor C (activator 1) 5 (36.5kD) 

MADS box transcription enhancer factor 2, polypeptide C (myocyte enhancer factor 2C) 
gastrulation brain homeo box 1 
neurofibromin 2 (bilateral acoustic neuroma) 
hypothetical protein FU20303 
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101152 L13600 


AI984625 


Hs.9884 


135397 114922 


L14922 


Hs.165563 


131687 L15189 


BE297635 


Hs.3069 


101168 L153B8 


NM.005308 


Hs.211569 


421155 L16895 


H87879 


Hs.102267 


101226 L27476 


AF083892 


Hs.75608 


133975 L27624 


C18356 


Hs.295944. 


134739 L32976 


NWL002419 


Hs.69449 


130155 133404 


AA101043 


Hs.151254 


440538 135263 


W76332 


Hs.79107 


132813 L37347 


BE313625 


Hs.57435 


101294 L40371 


AF168418 


Hs.1 16784 


101300 L40391 


BE535511 


Hs.74137 


101310 L41607 


L41607 


Hs.934 


130344 177566 


AW250122 


Hs.154879 


embryonic lethal 






101381 M13928 


AW675039 


Hs.1227 


101668 M14016 


AW005903 


Hs.78601 


133780 M14219 


AA557660 


Hs.76152 


101396 M15796 


BE267931 


Hs.78996 


101447 M21305 


M21305 




101458 M22092 


M22092 




101470 M22898 


NNL000546 


Hs.1846 


134604 M22995 


NM 002884 


Hs.865 


101478 M23379 


NIVL002890 


Hs.758 


406698 M24364 


X03068 


Hs.73931 


133519 M24400 


AW583062 


Hs.74502 


131185 M25753 


BE280074 


Hs.23960 


134116 M27691 


R84694 


Hs.79194 


133999 M28213 


AA535244 


Hs.78305 


130174 M29550 


M29551 


Hs.151531 


129963 M29971 


M29971 


Hs.1384 . 


132983 M30269 


M30269 


Hs.62041 


133900 M31158 


M31158 


Hs.77439. 


101543 M31166 


M31166 


Hs.2050 


101545 M3121Q 


BE246154 . 


Hs.154210 


101620 M55420 


S55271 


Hs.247930 


134691 M59979 


AW382987 


Hs.88474 


133595 M52810 


AA393273 


Hs.75133 


130425 M53838 


AA243383 


Hs.155530 


101700 M64710 


D90337 


Hs.247916 


101714 M88874 


M68874 


Hs.211587 


134246 M74524 


D28459 


Hs.80612 


101760 M80254 


M80254 


Hs.173125 


133948 M81760_cds3 


X59960 


HsJ7813 


101791 MB3822 


M83822 


Hs.62354 


101812 M86934 


BE439394 


Hs.78991 


101813 M87338 


NMJJ02914 


Hs.139226 


133396 M96326_ma1 


M96326 


Hs.72885 


135152 M96954 


M96954 


Hs.182741 


129026 M98833 


AL120297 • 


Hs.108043 


101901 S66793 


H38026 . 


Hs.308 


134831 S72370 


AA853479 


Hs.89890 


134039 S78569 


NM 002290 


Hs.78672 


134395 S79873 


AA456539 


Hs.8262 


101975 S83325 


AA079717 


Hs.283664 


101977 S83364 


AF112213 


Hs.184062 


101978 S83365 


BE561610 


Hs.5809 


interacting factor) 




Hs.248153 


101998 U01212 


U01212 


102003 U01922 


U01922 


Hs.1 25565 


102007 U02556 


U02556 


HS75307 


102009 U02680 


BE245149 


Hs.82643 


416658 U03272 


U03272 


Hs.79432 


132951 U04209 


AW821182 


Hs.61418 


135389 U05237 


U05237 


Hs.99872 


102048 U07225 


U07225 


Hs.339 


130145 U07620 


U34820 


Hs.151051 


303153 U09759 


U09759 


Hs.246857 


420269 U09820 


U72937 


Hs.96264 


102095 U11313 


U11313 


Hs.75760 


102123 U14518 


NWL001809 


Hs.1594 


102126 U14575 


AW950870 


Hs.78961 


102133 U15173 


AU076845 


Hs.155596 


102139 U15932 


NNL004419 


Hs.2128 


102162 U18291 


AA450274 


Hs.1592 



spindle pole body protein 
replication factor C (activator 1 ) 1 (1 45kD) 
heat shock 70kD protein 9B (morteIin-2) 
G protein-coupled receptor kinase 5 
lysyl oxidase 

tight junction protein 2 (zona ocdudens 2) 
tissue factor pathway Inhibitor 2 
mitogen-activated protein kinase kinase kinase 1 1 
kallikrein 7 (chymotryptic, stratum comeum) 
mitogen-activated protein kinase 14 

solute carrier family 1 1 (proton-coupled divalent metal Ion transporters), member 2 

thyroid hormone receptor interactor 4 

transmembrane trafficking protein 

glucosamlnyl (N-acetyl) transferase 2. l-branching enzyme 

DIGeorge syndrome critical region gene DGSI; likely ortholog of mouse expressed sequence 2 

aminolevulinate, delta-, dehydratase 
uroporphyrinogen decarboxylase 
decorin 

proliferating cell nuclear antigen 

gb:Human alpha satellite and satellite 3 junction DNA sequence. 

gb:Human neural cell adhesion molecule (N-CAM) gene, exon SEC and partial cds. 

tumor protein p53 (U-Fraumeni syndrome) 

RAP1A, member of RAS oncogene family 

RAS p21 protein activator (GTPase activating protein) 1 

major histocompatibility complex, dass II, DQ beta 1 

chymotrypsinogen B1 

cyclin B1 

cAMP responsive element binding protein 1 
RAB2, member RAS oncogene family 

protein phosphatase 3 (formerly 2B), catalytic subunit, beta isoform (caicineuiin A bete) 
O-6-methytguanine-DNA methyitransferase 
nidogen (enactin) 

protein kinase, cAMP-dependent, regulatory, type II, beta 
pentaxin-reiated gene, rapidly induced by M beta 
endothelial differentiation, sphingolipid G-protein-coupled receptor, 1 
Epsilon , IgE 

prostaglandin-endoperoxide synthase 1 (prostaglandin G/H synthase and cyclooxygenase) 

transcription factor 6-like 1 (mitochondrial transcription factor Mike) 

interferon, gamma -inducible protein 16 

natriuretic peptide precursor C . 

phospholipase A2, group IVA (cytosolic, calcium-dependent) 

ubiquitin-conjugating enzyme E2A (RAD6 homolog) 

peptidyiprolyl isomerase F (cyclophilin F) 

sphingomyelin phosphodiesterase 1, acid lysosomal (acid sphingomyelinase) 
cell division cycle 4-like 

DNA segment, numerous copies, expressed probes (GS1 gene) . : 

repycation factor C (activator 1) 2 (40kD) 

azurocidin 1 (cationic antimicrobial protein 37) 

TIA1 cytotoxic granule-associated RNA-blnding protein-like 1 

Friend leukemia virus integration 1 

arrestin 3, retinal (X-amestin) 

pyruvate carboxylase 

Iaminin l aipha4 

lysosomal 

aspartate beta-hydroxytase 
putative RabS-interacting protein 

putative transmembrane protein; homolog of yeast Golgi membrane protein Yiflp (Ylplp- 
olfactory marker proteki 

translocase of Inner mitochondrial membrane 8 (yeast) homolog A 
t-complex-assoclated-testis-expressed l-like 
protein tyrosine kinase 9 

fibrillin 2 (congenital contractural arachnodactyly) 

mlcrofibrillar-associated protein 1 

fetal Alzheimer antigen 

purinerglc receptor P2Y, G-protein coupled, 2 

mitogen-activated protein kinase 10 

mitogen-activated protein kinase 9 

alpha thalassemia/mental retardation syndrome X-linked (RAD54 (S. cerevisiae) homolog)- 

sterol carrier protein 2 

centromere protein A (17kD) 

protein phosphatase 1 , regulatory (fohibitor) subunit 8 

BCL2/adenovirus E1B 19kD-interacting protein 2 

dual specificity phosphatase 5 

CDC16 (cell division cycle 16, S. cerevisiae, homolog) 
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102164 


U18300 


NM 000107 


Hs.77602 


damage-specific DNA binding protein 2 (48kD) 


427653 


U18383 


AA159001 


Hs.180069 


nuclear respiratory factor 1 


131817 


U20536 


U20536 


Hs.3280 


caspase 6, apoptosis-related cysteine protease 


102200 


U21551 


AA2323B2 


Hs.1 57205 


branched chain aminotransferase 1, cytosolic 


102210 


U23028 


BE619413 


Hs.2437 


eukaryctic translation initiation factor 2B, subunit 5 (epsiion, 82kD) 


102214 


U23752 


U23752 


Hs.32964 


SRY (sex determining region Y>box 1 1 


132811 


U25435 


U25435 


Hs.57419 


CCCTC-binding factor (zinc finger protein) 


131319 


U25997 


NM.003155 


Hs.25590 


stanniocaJdn 1 


102256 


U28251_cds2 


U28251 


Hs.53237 


ESTs, Highly similar to Z169J4UMAN ZINC FINGER PROTEIN 169 [Haptens] 


132316 


U28831 


U28831 


Hs.44566 


KIAA1641 protein 


102269 


U30245 


U30245 




gb:Human myelomonocytic specific protein (MNDA) gene, 5' flanking sequence and complete 


exon 1. 
134365 


U32315 


AA568906 


Hs.82240 


syntaxh 3A 


102293 


U32439 


AF090116 


Hs.79348 


regulator of G-protein signalling 7 


102298 


U32849 


AA382169 


Hs.54483 


N-myc (and STAT) interactor 


102325 


U35139 


AI815867 


Hs.50130 


necdin (mouse) homolog 


302344 


U36764 


BE303044 


Hs.192023 


eukaryotic translation initiation factor 3, subunit 2 (beta, 36kD) 


102361 


U39400 


AA223616 


Hs.75859 


chromosome 11 open reading frame 4 


102367 


U39657 


U39656 


Hs.118825 


mltogen-activaied protein kinase kinase 6 


102388 


U41344 


AA362907 


Hs.76494 


proline arginine-rich end ieutine-rich repeat protein 


102394 


U41766 


NM_003816 


Hs.2442 


a dislntegrin and metalloprotelnase domain 9 (meitrin gamma) 


129829 


U41B13 


AF010258 


Hs.127428 


homeoboxA9 


102251 


U41815 


NWL004398 


Hs.41706 


DEAD/H (Asp-Glu-Ala-Asp/His) box polypeptide 10 (RNA helicase) 


102409 


U43266 


BE300330 


Hs.1 18725 


selenophosphate synthetase 2 


133746 


U44378 


AW410035 


Hs.75862 


MAD (mothers against decapentaplegic, Drosophila) homolog 4 


102423 


U44754 


Z47542 


Hs.179312 


small nuclear RNA activating complex, polypeptide 1 , 43kD 


132828 


U47011_cds1 


AB014615 


Hs.57710 


fibroblast growth factor 8 (and rogen-induced) 


130441 


U47077 


U63630 


Hs.155637 


protein kinase, DNA-acSvated, catalytic polypeptide 


102450 


U48251 


U48251 


Hs.75871 


protein kinase C binding protein 1 


129350 


U50535 


U50535 


Hs.1 10630 


Human BRCA2 region, mRNA sequence CG006 


102534 


U56833 


U96759 


Hs.198307 


von Hippel-Lindau binding protein 1 


130457 


U58091 


AB014595 


Hs.155976 


culIin4B 


135065 


U58837 


AA019401 


Hs.93909 


cyclic nucleotide gated channel beta 1 


102560 


U59289 


R97457 


Hs.63984 


cadherin 1 3, H-cadherin (heart) 


102557 


U59863 


U63830 


Hs.146847 


TRAF family member-associated NFKB activator 


134305 


U67122 


U61397 


Hs.81424 . 


ubiquitin-Iike 1 (sentrin) 


102638 


U67319 


U67319 


Hs.9216 


caspase 7, apoptosis-related cysteine protease 


132736 


U68019 


AW081883 


Hs.288261 


Homo sapiens cDNA: FU23037 fis, done LNG02036, highly similar to HSU68019 Homo 


sapiens mad protein homolog (hMAD-3) mRNA 


a disintegrin and metalloproteinase domain 17 (tumor necrosis factor, alpha, converting enzyme) 


133070 


U69611 


U92649 


Hs.64311 


102663 


U70322 


NNL002270 


Hs.168075 


karyopherin (importin) beta 2 


134660 


U73524 


U73524 


Hs.87465 


ATP/GTP-binding protein 


102735 


U79267 


AF111108 


Hs.3382 


protein phosphatase 4, regulatory subunit 1 


102741 


U79291 


AW959829- 


Hs.83572 


hypothetical protein MGC14433 


101175 


U82671 cds2 


U82671 


Hs.36980 


melanoma antigen, family A, 2 


132164 


U84573 


AI752235 


Hs.41270 


procollagen-lysine, 2-oxoglutarate 5-dioxygenase (lysine hydroxylase) 2 


102823 


U90914 


D85390 


Hs.5057 


carboxypeptidaseO 


102826 


U91316 


NM.007274 


Hs.8679 


cytosolic acyi coenzyme A thioester hydrolase 


102831 


U91932 


AA262170 


Hs.80917 


adaptor-related protein complex 3, sigma 1 subunit 


102846 


U96131 


BE264974 


Hs.6566 


thyroid hormone receptor interactor 13 


129777 


U97018 


U97018 •. 


Hs.12451 


echinodermmicrotubule-associated proteln-fike 


134161 


U97188 


AA634543 


Hs.79440 


IGF-ll mRNA-binding protein 3 


134854 


V00503 


J03464 


Hs.179573 


collagen, type I, alpha 2 


302363 


X04327 


AW163799 


Hs.198365 


2,3-bisphosphoglycerate mutase 


133708 


X06389 


AI018666 


Hs.75667 


synaptophysin 


125701 


X07496 


T72104 


Hs.93194 


apolipoprotein A-l 


102915 


X07820 


X07820 ' 


Hs.2258 


matrix metalloproteinase 10 (stromelysin 2) 


134656 


X14787 


AI750878 


Hs.87409 


thrombospondin 1 


413858 


X15525_ma1 


NWL001610 


Hs.75589 


acid phosphatase 2, lysosomal 


102958 


X16396 


AU076611 


Hs.154672 


methylene tetrahydrofoiate dehydrogenase (NAD+ dependent), methenyltetrahydrofolate 


cydohydrolase 






ankyrin 1, erythrocytic 


102971 


X16609 


X16609 


Hs.183805 


134037 


X53586 ma1 


AI8087B0 


Hs.227730 


integrin, alpha 6 


103023 


X53793 


AW500470 


Hs.1 17950 


multifunctional polypeptide similar to SAICAR synthetase and AIR carboxylase 


103037 


X54936 


BE018302 


Hs.2894 


placenta! growth factor, vascular endothelial growth factor-related protein 


130282 


X55740 


BE245380 


Hs.153952 


5' nucleotidase (CD73) 


134542 


X57025 


M14156 


Hs.85112 


tnsulin-ifce growth factor 1 (somatomedin C) 


128568 


X60573_ma1 


H12912 


Hs.274691 


adenylate kinase 3 


103093 


X60708 


S79876 


Hs.44926 


oipeptioyipepuQase iv iiaizo, auenosme oBarninase compieAing prutein c) 


133606 


X62048 


U10564 


Hs.75188 


wee1+ (S. pombe) homolog 


129063 


X63097 


X63094 


Hs.283822 


Rhesus blood group, D antigen 


424460 


X63563 


BE275979 


Hs.296014 


polymerase (RNA) IJ (DNA directed) polypeptide B (140kD) 


133227 


X64037 


AW977263 


Hs.68257 


general transcription factor HF, polypeptide 1 (74kD subunit) 


103181 


X69636 


X69636 


Hs.334731 


Homo sapiens, done IMAGE:3448306, mRNA. partial cds 


103184 


X69878 


U43143 


Hs.74049 


ftns-felated tyrosine kinase 4 


103194 


X70649 


N^004939 


Hs.78580 


DEAD/H (AsivGlu-Ala-Asp/His) box polypeptide 1 
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103208 


X72841 


AW411340 


Hs.31314 


129698 


X74987 


BE242144 


Hs.12013 


131486 


X83107 


F06972 


Hs.27372 


130729 


X84194 


AI963747 


Hs.18573 


103334 


X85753 


NMJ01260 


Hs.25283 


132645 


X87870 


AI654712 


Hs.54424 


135094 


X89066 


NM_003304 


Hs.250687 


103352 


X89398_cds2 


H09366 


Hs.78853 


103353 


X89399 


X89399 


Hs.1 19274 


132173 


X89426 


X89426 


Hs.41716 


103371 


X91247 


X91247 


Hs,13046 


131584 


X91648 


AA598509 


Hs.29117 


103376 


X92098 


AL036166 


Hs.323378 


103378 


X92110 


AL1 19890 


Hs.153618 


128510 


X94703 


X94703 


Hs.296371 


103410 


X96506 


AA158294 


Hs.334879 


133490 


X97230J 


AF022044 . 


Hs.274601 


103438 


X98263 


AW175781 


Hs.152720 


103440 


X98296 


X98295 


Hs.77578 


103452 


X99584 


NM 006936 


Hs.85119 


133536 


Y00264 


W25797.comp Hs.177486 


135185 


Y07566 


AW404908 


Hs.96038 


118523 


Y07759 


Y07759 


Hs,170157 


134662 


Y07827 


NM.007048 


Hs.284283 


132083 


Y07867 


BE386490 


Hs.279663 


103500 


Y09443~ 


AW408009 


Hs.22580 


134389 


Y09858 


Y09858 


Hs.82577 


132084 


Y12394 


NMJJ02267 


Hs.3886 


103540 


Z11559 


NIVL002197 


Hs.154721 


133152 


Z11695 


Z11695 


Hs.324473 


103548 


Z15005 


Z15005 


Hs.75573 


103612 


Z46261 


BE336654 


Hs.70937 


129092 


AA011243_s 


D56365 


Hs.63525 


103692 


AA018418 


AW137912 


Hs.227583 


(CACNA1F) gene, complete cds; HSP27 pseudogene, 


103695 AA018758 


AW207152 


Hs.186600 



129796 AA018804 
132258 AA031993 
132683 AA044217 
131887 AA046548 
member 1 

103723 AA057447_s 
453368 AA058376 
133260 AA083572 

103765 AA085696 

103766 AA088744 

103767 AA089688 
132051 AA091284 
103773 AA092700 
[C.elegans) 
135289 AA092968 
132729 AA094800 
-103794 AA100219 
131471 AA114885 
134319 AA129547 
103807 AA133016 
119159 AA149507 
129863 AA151005 
103850 AA187101 
103855 M195179_s 
322026 AA203138 
135300 AA203645 
103861 AA206236 
130634 AA227621 
[C.eiegans] 
447735 AA248283 
103909 AA249611 
131236 AA282640 
134060 AA287199 
129013 AA313990 
129435 AA314256 
103988 AA314389 
104000 AA324364 
425284 AA329211 s 
128629 AA399187 
133281 AA421079 



BE218319 
AA306325 
BE264633 
W17064 

BE274312 

W20296 

AA403045 

AA085696 

AI920783 

BE244667 

AA393968 

AI219323 

AW372569 

AW970843 

AF244135 

AA164842 

BE304999 

AW958264 

AF142419 

BE379765 

AA187101 

W02363 

AW024973 

AA142922 

AA205236 

AI769067 

AA775268 

AA249611 

AF043117 

D42039 

AA371156 

AF151852 

AA314389 

AH46527 

AF155568 

AL096748 

AK001601 



Hs.5807 
Hs.4311 
Hs.143638 
Hs.332848 

Hs.214783 

Hs.288178 

Hs.6806 

Hs.169600 

Hs.191435 

Hs.296155 

Hs.180145 

Hs.101077 

Hs.9788 

Hs.55682 

Hs.30670 

H&192619 

Hs.75653 

Hs.103832 

Hs.15020 

Hs.129872 

Hs.213194 

Hs.302267 

Hs.283675 

Hs.278626 

Hs.4944 

Hs.127824 

Hs.6127 

Hs,47438 

Hs.24594 

Hs.78871 

Hs.107942 

Hs.111449 

Hs.42500 

Hs.80475 

Hs.155489 

Hs.102708 

Hs.69594 



retinoblastoma-binding protein 7 
ATP-binding cassette, sub-family E (GABP), member 1 
BMX non-receptor tyrosine kinase 
acylphosphatase 1 , erythrocyte (common) type 
cyclin-dependent kinase 8 
hepatocyte nuclear factor 4, alpha 
transient receptor potential channel 1 
uracfl-DNAgtyoosylase 
RASp21[ 

endothelial cell-specific molecule 1 
thioredoxin reductase 1 
purine-rich element binding protein A 
coated vesicle membrane protein 
HCGVIIU protein 

RAB28, member RAS oncogene family 

DR1-associated protein 1 (negative cofactor 2 alpha) 

killer cell immunoglobulin-like receptor, three domains, long cytoplasmic tail, 1 

M-phase phosphoproteln 6 

ubiquitin specific protease 9, X chromosome (Drosophila fat facets related) 

SMT3 (suppressor of mlf two 3, yeast) homolog 1 

amyloid beta (A4) precursor protein (protease nexin-ll, Alzheimer disease) 

Ric (Drosophila)-iike, expressed in many tissues 

myosin VA (heavy polypeptide 12, myoxin) 

butyrophilin, subfamily 3, member A1 . 

Pirin 

alkylglycerone phosphate synthase 
splndlin-like 

karyopherin alpha 3 (importin alpha 4) 
aconilase 1, soluble 
mitogen-activated protein kinase 1 
centromere protein E (312kD) 
H3 histone family, member A 
poly(rC)-btnding protein 2 

Homo sapiens chromosome X map Xp1 1 .23 L-type calcium channel a(pha-1 subun'rt 
implete sequence; and JM1 proteini JM2 protein, and Hb2E genes, complete cds 
ESTs 

GTPase Rab14 

SUMO-1 activating enzyme subunlt 2 
WD repeat domain 4 

SWl/SNF related, matrix associated, actin dependent regulator of chromatin, subfamily e, 

Homo sapiens cDNA FU14041 fis, clone HEMBA1005780 
Homo sapiens cDNA FLJ11968 fis, clone HEMBB1001133 
Homo sapiens cDNA: FU23197 fis, clone REC00917 
KIAA0826 protein 
ESTs 

CGI-100 protein 
HSPC030 protein 

ESTs, Weakly similar to T22363 hypothetical protein F47G9.4 - Caenorhabditis elegans 

hypothetical protein MGC10924 similar to Nedd4 WW-binding protein 5 

eukaryotic translation initiation factor 3, subunit 7 (zeta, 66/67kD) 

hepatocellular carcinoma-associated antigen 66 

KIAA1600 protein 

fumarate hydratase 

similar to yeast Upf3, variant B 

homolog of mouse quaking QKI (KH domain RNA binding protein) 
sperm associated antigen 9 
hypothetical protein MGC1 0895 
hypothetical protein FU10330 
NPD009 protein 

Arg/Abl-lnteracting protein ArgBP2 
hypothetical protein FU12783 

ESTs, Weakly similar to T28770 hypothetical protein W03D2.1 - Caenorhabditis elegans 

Homo sapiens cDNA: FU23020 fis, clone LNG00943 
SH3 domain binding glutamic acid-rich protein 
ubiquitination factor E4B (homologous to yeast UFD2) 
mesoderm development candidate 2 
DKFZP564M1 12 protein 
CGI-94 protein 
ADP-ribosylaQon factor-like 5 

polymerase (RNA) II (DNA directed) polypeptide J (13.3kD) 
NS1 -associated protein 1 
DKFZP434A043 protein 
high-mobility group 20A 
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104104 AA422029 
[H.sapiens] 
108154 AA425230 
132091 AA447052 
135073 AA452000 
131367 AA456687 
129593 AA487015_S 
135266 AB002326 
133505 C01527 
132064 C01714 
134393 C01811J 
131427 C0235^s 
133435 C02375 
104282 C14448 
134827 D1661U 
130443 D25216 
131742 D31352 
132837 D58024_s 
130377 D80897 
104334 D82614 
134593 D87845 
134731 D89377J 
129913 H06583 
131670 H40732 
104394 H46617 
104402 H56731 
129781 H75570 
129077 H78886 
104417 H81241 
134927 L36531 
129280 M63154 
134498 M63180 
104460 M91504 
104488 N56191 
131248 N78483 

129214 N79268 
130017 R14652 
104530 R20459 
104534 R22303 
sequence. 
104544 R33779 
133328 R36553 
104567 R64534 
128562 R66475 
129575 R70621 
130776 R79356 
104599 R84933 
104660 RCJ\A007160 
104667 RC_AA007234_s AI239923 
10471B RCLAA018409 AI143020 
104764 RC_AA025351 AI039243 

104786 RCJWJ27168 AA027167 

104787 RC_M027317 AA027317 
similar to contains AIu repetitive element. 
134079 RC_AA029423 
104804 RCJ\A031357 
104865 RC_AA045136 
130828 RCJ\A053400 
104907 RC_AA055829 
WARNING ENTRY [H.sap 
104943 RCLAA065217 
105013 RC^AA116054 
105024 RCJVA126311 
132592 RCJWI29390 
105038 RC_M130273 
105077 RCJ\A142919 
105096 RC^AA150205 

129215 RCJW76867 
105169 RC_AA180321 
132796 RCJ\A180487 
130401 RC.M187634 
105200 RCJW95399 
130114 RCJ\A234717 
105330 RCJ\A234743 
105337 RCJ\A234957 
129385 RCJ\A235604 



AA422029 Hs.143640 ESTs, Weakly similar to hyperpdarization-activated cyclic nudeotide-gated channel HHCN2 



NM.005754 

AW954243 

W55956 

AI750575 

AI338247 

R41179 

AI630124 

AA121098 

W52642 

AF151879 

AI929357 

C14448 

BE314037 

D25216 

AA961420 

AA370362 

NM.014909 

D82614 

-NNL000437 

D89377 

NM.001310 

H03514 

AA129551 

H56731 

AA306090 

N74724 

AI819448 

L36531 

M63154 

AW246273 

AW955705 

N56191 

AI038989 

AL044335 

AKD00096 

AK001676 

R22303 

A1091173 

AW452738 

AA040620 

AA923382 

F08282 

AF167706 

AW815036 

BE298665 



Hs.220689 

Hs.170218 

Hs.94030 

Hs.173933 

Hs.98314 

Hs.97393 

Hs.324504 

Hs.3838 

Hs.8261 

Hs.26706 



Ras-GTPase-activating protein SH3-domain-binding protein 
KIAA0251 protein 

Homo sapiens mRNA; cONA DKFZp586E1624 {from clone DKFZp586E1624) 
nuclear factor l/A 

Homo sapiens mRNA; cONA DKFZp586L0120 {from clone DKF2p586L0120) 



Hs.332338 



Hs.155650 

Hs.31433 

Hs.57958 

Hs.155162 

Hs.78771 

Hs234392 

Hs.89404 

Hs.13313 

Hs.10130 

Hs.172129 

Hs.132956 

Hs.124707 

Hs.108479 

Hs.320861 

Hs.91296 

Hs.110014 

Hs.84131 

Hs.62604 

Hs.106511 

Hs.332633 

Hs.109526 

Hs.143198 

Hs.12457 



Homo sapiens mRNA; cDNA DKFZp586J0720 (from clone DKF2p586J0720) 
serum-tnducible kinase 
hypothetical protein FU22393 
CGI-121 protein 

Homo sapiens clone H63 unknown mRNA 
EST 

coproporphyrinogen oxidase (coproporphyria, harderoporphyria) 

KIAA0014 gene product 

ESTs 

EGF-TM7-latrophiiin-related protein 

KIAA1036 protein 

phosphogrycerate kinase 1 

platelet-activating factor acefylhydrolase 2 (40kD) 

msh (Drosophila) homeo box homolog 2 

cAMP responsive element binding protein-fike 2 

ESTs 

Homo sapiens cONA: FU21409 fis. clone COL03924 

ESTs 

ESTs 

ESTs 

Kruppel-like factor 8 
integral, alpha 8 

gastric intrinsic factor (vitamin B synthesis) 



Homo sapiens, clone IMAGE:4299322, mRNA, partial cds 

protocadherin 17 

Bardet-Biedl syndrome 2 

zinc finger protein 198 

inhibitor of g rowth family, member 3 

hypothetical protein FU10814 

gb:yh26b09.r1 Soares placenta Nb2HP Homo sapiens cDNA clone IMAGE:130841 5*. mRNA 



AK001751 


Hs.171835 


AI858702 


Hs.31803 


T79340 


Hs22575 


AW631469 


Hs203213 


AA055829 


Hs.195701 


3ns] 

AF072873 


Hs.1 14218 


H63789 


Hs.296288 


AA126311 


Hs.9879 • 


AW803564 


Hs.288850 


AW503733 


Hs.9414 


W55946 


HS234863 


AL0425Q6 


Hs.21599 


AB040930 


Hs.126085 


BE245294 


Hs.180789 


NWL006283 


Hs.173159 


BE396283 


Hs.173987 


AA32B102 


Hs.24641 


AA233393 


Hs.14992 


AW338625 


HS22120 


AI468789 


Hs.23200 


AA172106 


Hs.110950 



Hs222362 ESTs, Weakly similar to p40 [H.sapiens] 
Hs265327 hypothetical protein DKFZp761 1141 
Hs.5672 hypothetical protein AF140225 
Hs.101490 ESTs 
Hs.278428 progestin induced protein 
Hs.19280 cysteine-rich motor neuron 1 
Hs.151251 ESTs 

Hs.14846 Homo sapiens mRNA; cDNA DKFZp564D016 (from clone DKFZp564D016) 
Hs.30098 ESTs 

Hs.36250 ESTs, Weakly similar to I38022 hypothetical protein [Ksapiens] 
Hs278585 ESTs 
Hs.10031 KIAA0955 protein 

gb:ze97d1 1.s1 Soares JetaLhearLNbHH19W Homo sapiens cDNA clone IMAGE:366933 3" 
mRNA sequence. 

hypothetical protein FU10889 
ESTs, Weakly similar to N-WASP [Rsaptens] 
B-cell CLL/lymphoma 6, member B (zinc finger protein) 
ESTs 

ESTs, Weakly similar to ALU1JWMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 

frizzled (Drosophila) homolog 6 
ESTs, Weakly similar to K1AA0638 protein [H .sapiens] 
ESTs 

Homo sapiens cDNA: FU22528 fis, clone HRC12825 
WAA1488 protein 

Homo sapiens cDNA FU12082 fis. clone HEMBB1002492^ 
KruppeUike factor 7 (ubiquitous) 
KIAA1497 protein 
S164 protein 

transforming, acidic coiled-coil containing protein 1 
eukaryotic translation initiation factor 3, subunit 1 (alpha, 35kD) 
cytoskeleton associated protein 2 
hypothetical protein FU1 1 151 
ESTs 

myotubularin related protein 1 
Rag C protein 
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105376 RCJ\A236559 AW994032 

105397 RCJW242858 AA814807 

131982 RCJW251776 AK00004B 

131991 RC_AA251909 AF053306 
5 128658 RC_AA252672_s BE397354 

105489 RCJA256157 AA256157 

105508 RC AA256880 AA173942 

105539 RCLAA258873 AB040884 

135172 RC AA262727 
10 131569 RCJVA281451 

132542 RC.AA281545 

105643 RC^AA282069 

105659 RC_AA283044 

105666 RCLAA283930 
15 105674 RCJ\A284755 

105709 RC_AA291268 

105722 RC AA291927 

105765 RC_AA343514 

115951 RCJ\A398109 
20 105962 RCJM05737 

105985 RCJ^A406610 

gb:X02067 

106008 RC_AA411465 

131216 RC.AA416BB6 
25 134222 RC.AA424013 

113689 RC.AA424148 

106141 RCJM24558 

130839 RCJM24961_s 

106157 RC^AA425367 
30 130777 RCJ\A425921 

receptor 

130561 RCJW26220 

106196 RC.AA427735 

WARNING 
35 131878 RC.AA430673 

133200 RC.AA432248 

106302 RC AA435896 

106328 RClAA436705 

450534 RC.AA446561 
40 106423 RC.AA448238 

133442 RCJVA448688 ' 

439608 RCJ\A449756 

106477 RC.AA450303 

106503 RC^AA452411 
45 446999 RC_AA454566 

106543 RCJVA454667 

130010 RC.AA456437 

106589 RC J AA456646 

106593 RC_AA456826 
50 106596 RC_AA456981 

CONTAMINATION . 

134655 RC_AA458959 
• member 1 

106636 RCJWJ59950 
55 106654 RC.M460449 

131353 RCJUV463910 

106707 RC_jAA464603 

131710 RCJ^A464606 

106717 RC.AA465093 
60 131775 RC J AA465692 

106747 RCJVA476473 

106773 RC_AA478109 

106781 RCJW478474 

106817 RC_AA480889 
65 106846 RC_AA485223 

106848 RC_AA485254 

106856 RCJ\A486183 

418699 RC_AA496936 

WARNING 

70 107001 RCJVA598589 AI926520 
130638 RCJVA59&831J AW021276 
107054 RCJVA600150 AI076459 
107059 RCJU608545 BE614410 
107080 RC.AA609210 AL122043 

75 107115 RC_AA610108 BE379623 
107130 RC^AA620582 AB033106 



AL389951 

AL137751 

BE621719 

AA283044 

AA426234 

AI609530 

AI92B962 

AI922821 

AA299688 

BE546245 

AW880358 

AA406610 

AB033886 

AI815486 

AW855861 

AB037850 ' 

AF031463 

AB011169 

W37943 

AW135049 

AB011095 
AA525993 

AA083764 

AB037715 

AA398859 

AL079559 

AI570189 

AB020722 

AL137663 

AW864696 

R23324 

AB033042 

AA151520 

AA676939 

AA301116 

AK000933 

AW296451 

AA452379 



Hs.8768 

Hs.7395 

Hs.267448 

Hs.36708 

Hs.324830 

Hs.24115 

Hs.326416 

Hs.109694 

Hs.12144 

Hs.271623 

Hs.263671 

Hs.173802 

Hs.25625 

Hs.34906 

Hs.279789 

Hs.26761 

Hs.32433 

Hs.24183 

Hs.301048 

Hs.339808 



Hs.8619 

Hs.243901 

Hs.8025 

Hs.16621 

Hs.9302 

Hs.20141 

Hs.34892 

Hs.265418 

Hs.16032 
Hs.173699 

Hs.6101 

Hs.183639 

Hs.18397 

Hs.28020 

Hs.25132 

Hs.16714 

Hs.7378 

Hs.301732 

Hs.41693 

Hs.29679 

Hs.334822 

Hs.69285 

Hs.142838 

Hs-28661 '. 

Hs.24605 

Hs.293552. 



hypofftetical protein RJ10849 
hypothetical protein RJ23182 
hypothetical protein FU20039 

budding uninhibited by benzimidazoles 1 (yeast homolog), beta 

diptheria toxin resistance protein required for diphthamlde biosynthesis (SaccharomycesHike 2 

Homo sapiens cDNA FU14178 fls, clone NT2RP2003339 

Homo sapiens mRNA; cDNA DKFZp564H1916 (from done DKFZp564H1916) 

KIAA1451 protein 

K1M1033 protein 

nucieoporin 50kD 

Homo sapiens mRNA; cONA DKF2p434J0812 (from done DKFZp434l0812); partial cds 
KIAA0603 gene product 
hypothetical protein FU11323 

ESTs, Weakly similar to T17210 hypothetical protein DKFZp434N041.1 [H.sapiensl 

histone deacetytase 3 

DKF2P5B6L0724 protein 

ESTs 

ESTs 

seel 3-like protein 
hypothetical protein FU10120 

gb:zv15b10.s1 Soares_NhHMPu_S1 Homo sapiens cDNA clone IMAGE:753691 3' similar to 

SRY (sex determining region Y)-box 18 
Homo sapiens cDNA FU20738 fa, clone HEP08257 
Homo sapiens clone 23767 and 23782 mRNA sequences 
DKFZP434I116 protein 



similar to S. cerevisiae SSM4 
KIAA1323 protein 

Homo sapiens cDNA FU10643 fis, clone NT2RP2005753, highly similar to Homo sapiens M 
KIAA0523 protein 

ESTs, Weakly similar to ALU1.HUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 

hypothetical protein MGC3178 
hypothetical protein FU10210 
hypothetical protein FU23221 
KIAA0766 gene product 
KIAA0470 gene product 
Rho guanine exchange factor (GEF) 1 5 

Homo sapiens mRNA; cDNA DKFZp434G227 (from clone DKFZp434G227) 

hypothetical protein MGC5306 

DnaJ (Hsp40) homolog. subfamQy B, member 4 

cofactor required for Sp1 transcriptional activation, subunit 3 (130kD) 

hypothetical protein MGC4485 . 

neuropilinl 

nucleolar phosphoprotein Nopp34 

Homo sapiens cDNA FLJ10071 fc, clone HEMBA1001702 

ESTs 

ESTs, Moderately similar to ALU7 HUMAN ALU SUBFAMILY SQ SEQUENCE 



AF265208 Hs.123090 SWI/SNF related,' matrix associated, actin dependent regulator of chromatin, subfamily f, 



AW956037 

AW075485 

AW754182 

AK000566 

NMJM5368 

AA600357 

AB014548 

NM_007118 

AA478109 

AA330310 

D61216 

AB037744 

AA449014 

W58353 



Hs.286 • ribosomalproteinL4 

Hs.286049 phosphoserine aminotransferase 

gb:RC2-CT032M31199-011.c01 CT0321 Homo sapiens cDNA, mRNA sequence 

Hs.98135 hypothetical protein FU20559 

Hs-30985 pannexm 1 

Hs.239489 TIA1 cytotoxic granule-associated RNA-binding protein 

Hs.31921 K1AA0648 protein 

Hs.171957 triple functional domain (PTPRF interacting) 

Hs.188833 ESTs 

Hs.24181 ESTs 

Hs.18672 ESTs 

Hs.34892 KIAA1323 protein 

Hs. 1 21 025 chromosome 1 1 open reading frame 5 

Hs.285123 Homo sapiens mRNA full length insert cDNA clone EUROIMAGE 2005779 

Hs.173030 ESTs, Weakly similar to ALUS JHUMAN ALU SUBFAMILY SX SEQUENCE CONTAMINATION 

Hs.31016 putative DNA binding protein 

Hs.17121 ESTs 

Hs.15978 K1AA1272 protein 

Hs.23044 RAD51 (S. cerevisiae) homoibg (E coli RecA homotog) 

Hs.19221 hypothetical protein DKFZp566G1424 

Hs.27693 peptidylprolyl bomerase (cydophQin)-fik0 1 

Hs.12913 K1AA1280 protein 
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107156 RCLAA621239 AA137043 
107174 RCJVV621714 BE122762 
130621 RC_AA621718 AW513087 
107190 RC D19673 AA836401 
5 132626 RC_D25755js AW504732 
107217 RC.D51095 AL080235 
131610 RCJ36027£j AA357879 
' 129604 T08879 AF088886 
107295 T34527 AA186629 
10 (Ga!NAc-T1) 

107299 T40327_s 

107315 T62771.S 

107316 T63174_s * 
107328 T83444 

15 107334 T93641 

134715 U48263 

128636 U49065 

129938 U79300 
. 107375 U88573 
20 130074 U93867 

107387 W01094 

132036 W01568 

107426 W26853 

113857 W27179 
25 135388 W27965 

130419 W36280_s 

107469 W47063 

132616 W79060 

107506 W88550 
30 132358 X60486 

107522 X78931.S 

125827 214077.S 

107582 RC_MQ02147 

107609 RC.AA004711 
35 107661 RCLAA010383 

107714 RCJWJ15761 

107775 R(LAA018772 

107832 RC_AA021473j- AA021473 

sequence. 
40 107859 RCJWJ24835 

124337 RC_AA025858 

107914 RC_AA027229 

[C.elegans] 

107935 RCJ\A029428 
45 116262 RC AA035143 

131461 RCLAA035237 

108007 RCJ\A039347 

108029 RC^AA040740 

108040 RC.AA041551 
50 member 1 

108084 RC.AA045513 

108088 RC_AA045745 

'108168 RC AA055348 

130719 RCj*A056582_s AA679262 
55 108189 RCJ\A056697 AW376061 

108190 RCJ^A056746 

108203 RCJ\A057678 

108216 RCJM058681 

108217 RC.AA058686 
60 108245 RC.M062840 

108277 RC_AA064859 
mRNA 

108280 RC.AA065069 
.. 108309 RCJVA069923 
65 133739 RC_AA070799_S BE535554 

108340 RCj4A070815 AA069820 

108403 RC J AA075374 
• 3*. mRNA sequence. 

108427 RC.AA076382 
70 3', mRNA sequence. 

108435 RCJ\A078787 

108439 RC.AA078986 

3', mRNA sequence. 

108465 RCJ^A079393 AA079393 
75 108469 RCJW079487 AA079487 

sequence 



BE277457 

AA316241 

T63174 

AW959891 

T93597 

U48263 

U49065 

AW003668 

BE011845 

AL038596 

D85983 

AL1 57433 

W26853 

AW243158 

W27965 

AF037448 

W47063 

BE262677 

AB028981 

NIVL003542 

X78931 

NM003403 

AA002147 

R75654 

AA010383 

AA015761 

AW008846 



AW732573 

N23541 

M027229 

AA029428 
AI936442 
AA992841 
AA039347 
AA040740 
AL121031 

AA058944 
AA045745 
AI453137 



AA056746 
AW847814 
AA524743 
AA058686 
BE410285 
AA064859 

AA065069 
AA069818 



AA075374 
AA076382 
T82427 



Hs.9663 

Hs.25338 

Hs.16803 

Hs.5103 

Hs.21275 

Hs.35861 

Hs.29423 

Hs.11590 

Hs.80120 

Hs.30661 



programmed ceO death 6-interacfing protein 
ESTs 

LUC7 (S. cerevtsiaeHike 
ESTs 

hypothetical protein RJ11011 
DKFZP586E1621 protein 
scavenger receptor with C-type lectin 
cathepsln F 



Hs.193700 

Hs.76591 

Hs.187429 

Hs.89040 

Hs.102865 

Hs.135587 

Hs.251064 

Hs.250745 

Hs.1 18893 

Hs.37706 

Hs.291003 

Hs.5297 

Hs.99865 

Hs.155489 

Hs.94668 

Hs.283558 

Hs.8021 

Hs.46423 

Hs.99971 

Hs.97496 

Hs.59952 

Hs.164797 

Hs.60389 

Hs.60642 

Hs.60857 



Hs.47584 
Hs.281561 
Hs.61329 

Hs.61555 
Hs.59838 
Hs.27263 
Hs.61916 
Hs.62007 
Hs.159971 

Hs.116602 

Hs.62886 

Hs.63176 

Hs.14235 

Hs.63335 

Hs.63338 

Hs.289005 

Hs.44883 

Hs.62588 

Hs.89545 



Hs.278270 
Hs.180909 



Hs.194101 
Hs.3462 



hypothetical protein MGC4606 
nudeophosmin/nudeoplasmln 3 

Homo sapiens mRNA; cDNA DKF2p586l0324 (from done DKFZp586l0324) 

KIAA0887 protein 

ESTs 

prepranociceptin 

interteukin 1 receptor-like 2 

Human clone 23629 mRNA sequence 

high-mobility group (nonhistone chromosomal) protein 14 

polymerase (RNA) Hi (DNA directed) (62kD) 

Melanoma associated gene 

hypothetical protein DKFZp434E2220 

hypothetical protein MGC4707 

DKFZP564A2416 protein 

epimorphin 

NS1-associated protein 1 
ESTs 

hypothetical protein PR01855 
KIAA1058 protein 
H4 histone family, member G 
zinc finger protein 272 
YY1 transcription factor 
. EST 

hypothetical protein FU13693 

ESTs 

ESTs 

ESTs 

gb:ze66c11.s1 Soares retina N2b4HR Homo sapiens cDNA clone IMAGE363956 3\ mRNA 

potassium voltage-gated channel, delayed-rectffier, subfamily S, member 3 
Homo sapiens cDNA: FU23582 fis, done LNG13759 

ESTs, Weakly similar to T16370 hypothetical protein F45E12.5 - Caenorhabditis elegans 
ESTs 

hypothetical protein FU10808 

KIAA1458 protein 

EST 

ESTs. 

SWI/SNF related, matrix associated, adin dependent regulator of chromatin, subfamily b, 



Homo sapiens, done IMAGE:41 54008, mRNA, ; 
ESTs 
ESTs 

hypothetical protein FLJ20008; KIAA1839 protein 

ESTs. Moderately similar to A46010 X-Iinked retinopathy protein [H.sapiens] 

EST 

Homo sapiens cDNA: FU21532 fis, done COL06049 

ESTs 

ESTs 

proteasome (prosome, macropain) subunit, beta type, 4 

gb:zm50f03.s1 Stratagene fibroblast (937212) Homo sapiens cDNA done IMAGE:529085 3', 

gb:zm12e1 1js1 Stratagene pancreas (937208) Homo sapiens cDNA done 3', mRNA sequence 
gb:zm67e03.r1 Stratagene neuroepithelium (937231) Homo sapiens cDNA clone 5* similar to 
unactive progesterone receptor, 23 kD 
peraxirodoxin 1 

gb:zm87a01.s1 Stratagene ovarian cancer (937219) Homo sapiens cDNA done IMAGE:544872 
gb:zm91g08.s1 Stratagene ovarian cancer (937219) Homo sapiens cDNA done IMAGB545342 
Homo sapiens cDNA: FU20869 fis, clone ADKA02377 

gb:zm92h01.s1 Stratagene ovarian cancer (937219) Homo sapiens cDNA done IMAGE:545425 
cytochrome c oxidase subunit Vile 

gb:zm97fl)8.s1 Stratagene colon HT29 (937221) Homo sapiens cDNA done 3', mRNA 
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108500 RCJVA063207 AA083207 

108501 RCJWJ83256 AA083256 
gb:M33308 

108533 RCJW084415 AA0B4415 
5 mRNA 

' 108562 RCJW085274 AA100796 
gb^C15341 

108589 RC.M088678 AI732404 

130890 RCJ\A100925 AI907537 

10 134585 RCJW01255 D14041 

130385 RCJW26474 AW067800 

108749 RCJVA127017 AA127017 

. 108807 RC.M129968 AI652236 

108808 RCJW130240 AA045088 

15 108833 RCJVA131866 AF188527 

107290 RC AA132039 W27740 

108846 RClAA132983 AL1 17452 

108857 RCJVA133250 AK001468 

131474 RCJW33583 3 L46353 

20 108894 RCJW35941 AK001431 

108941 RCJU148650 AA148650 
IMAGE:567202 3\ 

108968 RC_AA151110 AI304870 

108996 RC_AA155754 AW995610 

25 109001 RCJW56125 AI056548 

131183 RCJW56289 AI611807 

109019 RCJW56997 AA156755 

109022 RCJW57291 AA157291 

109023 RC AA157293 AA157293 
30 1090S8 RC_AA164293J AA164293 

109072 RCJVU64676 AI7325B5 

129021 RCJ\A167375 AL044675 

130346 RC_AA167550 H05769 

. 109146 RCJ\A176589 AA176589 

35 109172 RCJW80448 AA180448 

131080 RC JW87144_s NNL001955 

129208 RC J\A189170_f AI587376 

109222 RC_M192757 AA192833 

109300 RCJW205650 AA418276 

40 109481 RCJVA233342 AA878923 

109485 RC JVA233472 BE619092 

109516 RC.AA234110 AI471639 

109537 RC D80981 A1858695 

109556 RC.F01660 A1925294 

.45 109577 RC F02206 . F02206 

109578 ,RC F02208 F02208 

109595 RC F02544 AA078629 

. 109625 RC F03918 H29490 

131983 RC.F04258.8 AF119865 

50 109648 RC.F04600 H17800 

109671 RC F08998 R59210 

• 109699 RC.F09605 ; H18013 

109820 RC_F11115 AW016809 

. 109933 RCJM6371 R52417 

55 110014 RC.H10995 AL109666 

110039 RC.H11938 H11938 

110099 RC_H16568 R44557 

110107 RCJH6772 AW151660 

110155 RC H18951 AI559626 

60 110197 RC_H20859 AW090386 

110223 RC_H23747 H19836 

110306 RCJH38087 H38087 

- 110335 RC.H40331 H65490 

110342 RC_H4Q567 H40961; . 

65 110395 RC.H46966 AA025116 

110511 RC.H56640J H56640 

110523 RC.H57154 AI040384 

110715 RCJH96712 H96712 

110754 RC.N20814 AW302200 

70 130132 RC.N25249 U55936 

131135 RCJJ2710O NM.016569 

134263 RCJI39616 AW973443 

110938 RC.N48982 N48982 

110983 RCJI51957 NNL015367 

75 115062 RCJJ52271 AA253314 

111081 RCJI59435 AI146349 



Hs.68270 EST 



gb2n08g12.s1 Stratagene hNT neuron (937233) Homo sapiens cDNA clone 3' similar to 
gbzn06g09.s1 Straiagene hNT neuron (937233) Homo sapiens cONA clone IMAGE:546688 3\ 
gfczm26c06.s1 Stratagene pancreas (937208) Homo sapiens cDNA clone 3 1 similar to 



Hs.68846 ESTs 



. Hs.278573 
Hs.155223 
Hs.71052 
Hs.49376 
Hs.62736 
Hs.61661 
Hs.323760 
Hs.44155 
Hs.62180 
Hs.2726 
Hs.5105 



Hs.188680 
Hs.332436 
Hs.72116 
Hs.285107 
Hs.72150 
Hs.21479 
Hs.72168 
Hs.72545 
Hs.22394 
Hs.173081 
Hs.188757 
Hs.142078 
Hs.144300 
Hs.2271 
Hs.109441 
Hs.333512 
Hs.170142 
Hs.289069 
Hs.28465 
Hs.71913 
Hs.34898 
Hs.87385 
Hs.296639 
Hs.27214 
Hs.27301 
Hs.22697 
Hs.184011 
Hs.7154 
Hs.26634 
Hs.167483 
Hs.323795 
Hs.20945 
Hs.7242 
Hs.21907 
Hs.23748 
Hs.31444 
Hs.93522 
Hs,112278 
Hs.31697 
Hs.105509 
Hs.18845 
■ Hs.33008 
Hs.33333 
Hs.221460 
Hs.19102 
Hs.269029 
Hs.6336 
Hs.184376 
Hs.267182 
Hs.8086 
Hs.38034 
Hs.10267 
Hs.1 54103 
Hs.271614 



stress-associated endoplasmic reticulum protein 1; ribosome associated membrane protein 4 
H-2K binding factor-2 
stanniocalcin 2 
ESTs 

hypothetical protein FU20644 
ESTs 

ESTs, Weakly similar to AF1 74605 1 F-box protein Fbx25 [H.saplens] 
ESTs 

DKFZP586G1517 protein 

arMn (DrosophBa Scraps homolog), actin binding protein . 
high-mobility group (nonhistone chromosomal) protein isoform l-C 
hypothetical protein FU10569 

gb2o09e06.s1 Stratagene neuroepithelium NT2RAMI 937234 Homo sapiens cDNA clone 

ESTs 
EST 

hypothetical protein FU20992 similar to hedgehog-interacting protein 
hypothetical protein FU13397 

ESTs ' 

ubinuclein 1 

ESTs 

ESTs 

hypothetical protein FU10893 
KIAA0530 protein 

Homo sapiens, clone MGC:5564, mRNA, complete cds 

EST 

EST 

endothelin 1 
MSTP033 protein 
similar to rat myomegalin 
ESTs 

hypothetical protein FLJ21016 

Homo sapiens cDNA: FU21869 fls, clone HEP02442 

ESTs 

ESTs 

ESTs 

Homo sapiens potassium channel subunit (HERG-3) mRNA, complete cds 

ESTs 

ESTs 

ESTs 

pyrophosphatase (inorganic) 

ESTs . 

ESTs- 

ESTs 

ESTs 

Homo sapiens done 24993 mRNA sequence 

Homo sapiens mRNA full length insert cDNA clone EUROIMAGE 35907 

histone acetyftransferase 

ESTs 

ESTs 

Homo sapiens mRNA for KIAA1647 protein, partial cds 

arrestin, beta 1 

ESTs 

CTL2 gene 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs, Weakly similar to organic anion transporter 1 [H.saptens] 
ESTs 

KIAA0672 gene product 
synaptosomal-associated protein, 23kD 
TBX3-iso protein 

RNA (guanlne-7-) methyltransferase 

Homo sapiens cDNA FU 12924 fis, clone NT2RP2004709 

M1L1 protein 

LIM protein (similar to rat protein kinase C-blnding enigma) 
CGU 12 protein 
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111128 RC.N64139 
135244 RC N66381 
111216 RC.N68640 
437552 RC.N69352 
131002 RC_N95226 
111399 RC.R00138 
111514 RC_R07998 
similar to 

130182 RC.R08929 
111574 RC.R10307 
111804 RC.R33354 
111831 RC_R36083 
129675 RC.R37938J 
111904 RC_R39330 



133868 RC_R40816jb 
112033 RC__R4316^s 
130987 RC.R45698 
112300 RC_R54554 

112513 RC_R68425 

112514 RC_R68568 
112522 RC_R68763 
112540 RC_R70467 
130346 RC R73565 
129534' RC.R73640 
112597 RC_R78376 
112732 RCLR92453 
131458 RC.T03865 
112888 RCJD3872 
131863 RC.T10072 
112911 RC_T10080 
132215 ROJ10132 
112931 RC.T15343 

RCJT23457 
RCJT23555 
RC.T23670 
RCJT23946 
RC.T33464 
RC_T34413 
RC,T34611 
ROJ40920 
RCJT55182 
RCJT77453 
RC T84039 
RC T86458 
RC_T87693 
ROJ89350_s 
113557 RCJ90945 
113559 RCJT90987 
113589 RCJT91863 
113591 RC.T91881 
113619 RCJT93783.S . 
113683 RCJT96687 
113692 RCJ96944 
113702 RCJT97307 
mRNA 

113717 RC_T97764 
113824 RCJ/V48817 
113840 RCJV5B343 
113844 RCJ/V59949 
PROTEIN TC10 
113902 RCJN74644 

113904 RCJV74761 

113905 RCJV74802 

113931 RCJV81205 

113932 RCJA/81237 
131965 RCJV90146J 
114035 RCJV9Z798 
114106 RCJZ38412 
133593 RC^38709 
114161 RC_Z38904 
424949 RCJ39103 
129059 RC_Z39930J 
128937 RC^Z39939 
WARNING 

130983 RC.Z40012J 



112984 
112998 
133376 
113026 
113070 
128970 
113074 
113095 
113179 
113337 
113421 
113454 
113481 
131441 



AW505364 Hs.19074 UTS (large tumor suppressor, Drosophlla) homolog 2 

AI834273 Hs.971 1 novel protein 

AW139408 Hs.152940 ESTs 

AB001636 Hs.5683 DEAD/H (Asp-Glu-Aia-Asp/His) box polypeptide 1 5 

AL050295 Hs.22039 KIAA0758 protein 

AW270776 Hs.18857 ESTs 

R07998 gbyf16g11.s1 Soares fetal Rver spleen 1NFLS Homo sapiens cDNA done IMAGE:1 27076 3' 

BE267033 Hs.192853 ubiqultin-conjugating enzyme E2G 2 (homologous to yeast UBC7) 

AI024145 Hs.188526 ESTs 

AA482478 Hs.181785 ESTs. 

R36095 Hs.268695 ESTs 

NWL015556 Hs.172180 KIAA0440 protein 

Z41572 gb:HSCZYB122 normalized Infant brain cDNA Homo sapiens cDNA clone o-zyb12, mRNA 

AB012193 Hs.183874 cuHin4A 

R49031 Hs.22627 ESTs 

BE613269 Hs.21893 hypothetical protein DKFZp761N0624 

H24334 Hs.26125 ESTs 

R68425 Hs.13809 hypothetical protein FLJ10648 

R68568 Hs.183373 src homology 3 domain-containing protein HlP-55 

R68857 .. Hs.265499 ESTs 

R69751 gb:yi40a10.s1 Soares placenta Nb2HP Homo sapiens cDNA done 3', mRNA sequence 

H05769 Hs.1 88757 Homo sapiens, done MGC:5564, mRNA, complete cds 

AK002126 Hs.1 1260 hypothetical protein FLJ 11264 

R78376 Hs.29733 EST 

R92453 Hs.34590 ESTs 

BE297567 Hs.27047 hypothetical protein FLJ20392 

AW195317 Hs.107716 hypothetical protein FU22344 

AI656378 Hs.33461 ESTs 

AW732747 Hs.13493 like mouse brain protein E46 

AL035703 Hs.4236 KIAA0478 gene product 

T02966 Hs.167428 ESTs 

T16971 Hs.289014 ESTs, Weakly similar to A43932 mucin 2 precursor, intestinal [H.saplens] 

H11257 Hs.22968 Homo sapiens done IMAGE:451939, mRNA sequence 

BE618768 Hs.7232 acetyl-Coenzyme A carboxylase alpha 

AA376654 Hs. 183684 eukaryotic translation initiation factor 4 gamma, 2 

AB032977 Hs.6298 KIAA1151 protein 

AI375672 Hs.165028 ESTs 

AK001335 Hs.31137 protein tyrosine phosphatase, receptortype, E 

AA828380 Hs.126733 ESTs 

BE622021 Hs.152571 ESTs, Highly similar to IGF-II mRNA-binding protein 2 [H.sapiens] 

T77453 Hs.302234 ESTs 

AI769400 Hs.189729 ESTs 

AI022166 Hs.16168 ESTs 

T87693 Hs.204327 EST 

AA302862 Hs.90063 neurocalcin delta 

H66470 Hs.16004 ESTs 

T79763 Hs.14514 ESTs 

A1078554 Hs.15682 ESTs. 

T91881 Hs.200597 KIAA0563 gene product 

R08665 Hs.17244 hypothetical protein FLJ 13605 . 

AB035335 Hs.144519 T-cell leukemia/iymphoma 6 

AL360143 Hs.17936 ■" DKFZP434H132 protein 

T97307 gb:ye53h05.s1 Soares fetal liver spleen 1NFLS Homo sapiens cDNA done IMAGE:121497 3*, 

T99513 Hs.187447 ESTs 

A1631964 Hs.34447 ESTs 

R72137 HsJ949 DKFZP586B2420 protein 

AI369275 Hs.243010 Homo sapiens cDNA FU14445 fis, done HEMBB1001294, highly similar to GTP-BINDING 

AA3401 1 1 Hs. 100009 acyl-Coenzyme A oxidase 1 , palmitoyl 

AF125044 Hs.19196 ubiquitin-conjugating enzyme HBUCE1 

R81733 Hs.33106 ESTs ' ■ 

BE255499 Hs.3496 hypothetical protein MGC1 5749 

AA256444 Hs.126485 hypothetical protein FU12604; K1AA1692 protein 

W79283 Hs.35962 ESTs 

W92798 Hs.269181 ESTs 

AW602528 gb:RC5-BT0562-260100-01 1 -A02 BT0562 Homo sapiens cDNA, mRNA sequence 

AI416988 Hs.238272 inositol 1 ,4,5-triphosphate receptor, type 2 

BE548222 Hs.299883 hypothetical protein FU23399 

AF052212 Hs.1 53934 core-binding factor, runt domain, alpha subunit 2; translocated to, 2 

AW069534 Hs.279583 CGI-81 protein 

AA251380 Hs.10726 ESTs, Weakly similar to ALU1_HUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 

AI479813 Hs.278411 NCK-assodated protein 1 
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114277 RC_Z40377_s AI052229 Hs.25373 
[C.elegans] 

114304 RC_£40820 AI934204 Hs.16129 

114364 RC_Z416B0 AL1 17427 Hs.172778 

5 132900 RCJW005112 AA777749 Hs.5978 

129034 RCJVA005432 AA481157 Hs.108110 

131881 RCLAA010163 AW361018 Hs.3383 

452461 RCLAA026356 N78223 Hs.108106 

114465 RCJM026901 BE621056 Hs.131731 

10 131376 RC.AA036867 AK001644 Hs.26156 

101567 RCJWJ44644 M33552 Hs.56729 

431555 RCJ\A046426 AI815470 Hs.260024 

132944 RCJVA054515 T98641 Hs.6127 

114618 RCJM084162 AW979261 Hs.291993 

15 130274 RC.AA085749 AA128376 Hs.153884 

110330 RC..AA098874 AI288666 Hs.16621 

114648 RC.M101056 AA101056 
IMAGE:548429 3* 

114658 RC.AA102746 AA102383 Hs.249190 

20 132456 RC..AA114250 s AB011084 Hs.48924 

131319 RC AA126561.S NWL003155 Hs.25590 

132225 RCJM128980J AA128980 
IMAGE:567164 3' 

132669 RCJW129757 W38586 Hs.293981 

25 114709 RC.AA129921 AA397651 Hs.301959 

131973 RCJWI33331 AB018284 Hs.158688 

114750 RCJVU35958 AA887211 Hs.129467 

115714 R(LAA136524_s T19228 Hs.172572 

114763 RC.AA147044 AA810755 Hs.88977 

30 114767 RC.AA148885 AI859865 Hs.154443 

114774 RC_AA150043 AV656017 Hs.184325 

129388 RC.AA151621 AA662477 Hs.1 10964 

129183 RCLM155743 BE561824 Hs.273369 

128869 RCJ\A156335 AA768242 Hs.80618 

35 130207 RCJ\A156336 AF044209 Hs.144904 

114798 RC_AA159181 AA159181 Hs.54900 

114800 RC_AA159825 Z19448- Hs.131887 
[C.elegans] 

114828 RCJW234185 AA252937 Hs.283522 

40 114846 RC.AA234929 BE018682 '» Hs,166196 

114848 RC.AA234935 BE614347 Hs.169615 

114902 RCJW236359 AW2754B0 Hs.39504 

132271 RC_AA236466 AB030034 Hs.115175 

114907 RC.AA236535 N29390 Hs.13804 

45 135159 RC_M236935_S U43374 Hs.95631 

132204 RC_M236942 AA235827 Hs.42265 

114928 RC.M237018 AA237018 Hs.94869 

132481 RC_AA237025 W93378 Hs.49614 

114932 RCJVA242751 AA971436 Hs.16218 

5 0 314162 RC_M242760 BE041820 Hs.38516 

. 131006 RCLAA242763 AF064104 Hs.22116 

114935 RCJ\A242809 H23329 Hs.290880 
• • WARNING 

' 132454 RCJ\A243133 BE296227 Hs.250822 

5 5 437754 RC.AA243495 R60366 Hs.5822 

114957 RC_AA243706 AW170425 Hs.87680 

114974 RC_AA250848 AW966931 Hs.179662 

114977 RC_AA250868 AW295978 Hs.87787 

114995 RC_AA251152 AA769266 Hs.193657 

60 115005 RC_AA251544_s AI760825 Hs.1 11339 

417177 RC_AA251792 NW1004458 Hs.81452 

131889 RC.AA252063 NM002589 Hs.34073 

115026 RC_AA252144 AA251972 Hs.188718 

115045 RC.AA252524 AW014549 Hs.58373 

65 115068 RCJ\A253461 AW512260 Hs.87767 

133138 RC_jAA255522 AV657594 Hs.181161 
RECEPTOR. 

115114 RCJ\A256468 AA527548 Hs.7527 

129584 RCJ\A256528 AV656017 Hs.184325 

70 115137 RC_AA257976 AW968304 Hs.56156 

134312 RCJ^258296 AB011151 Hs.334659 

115166 RCJK258409 AF095727 Hs.287832 

115167 RC.AA258421 AA749209 Hs.43728 
129807 RC.M262077 Y11192 Hs.5299 

75 115239 RC_jAA278650 BE251328 Hs.73291 

115243 RCJM78766 AAB06600 Hs.1 16665 



ESTs, Weakly similar to T20410 hypothetical protein E02A10.2 - Caenorhabditis elegans 
ESTs 

Homo sapiens mRNA; cDNA DKFZp566P013 (from clone DKFZp566P013) 

LIM domain only 7 

DKFZP547E2110 protein 

upstream regulatory element binding protein 1 

transcription factor 

hypothetical protein FU1 1099 

hypothetical protein FU10782 

lysosomal 

Cdc42 effector protein 3 

Homo sapiens cDNA: FU23020 6s, clone LNG00943 
ESTs 

ATP binding protein associated with cell differentiation 
DKFZP434I116 protein 

gb:zn25b03.s1 Stratagene neuroepithenum NT2RAMI 937234 Homo sapiens cDNA clone 

tumor necrosis factor receptor superfamily, member 10a 
KIAA0512 gene product; ALEX2 
stanniocalcin 1 

gb:zo09a1 1.s1 Stratagene neuroepithelium NT2RAMI 937234 Homo sapiens cDNA clone 

guanine nucleotide binding protein (G protein), gamma 3, linked 
proline synthetase co-transcribed (bacterial homoiog) 
KIAA0741 gene product 
ESTs 

hypothetical protein FU20093 

hypothetical protein dJ511E16.2 

minichromosome maintenance deficient (S. cerevisiae) 4 

CGI-76 protein 

hypothetical protein FLJ23471 

uncharacterized hematopoietic stem/progenitor cells protein MDS027 

hypothetical protein 

nuclear receptor co-repressor 1 

serologically defined colon cancer antigen 1 

ESTs, Weakly similar to T24396 hypothetical protein T03F6.2 - Caenorhabditis etegans 

Homo sapiens mRNA; cDNA DKFZp434J1912 (from clone DKFZp434J1912) 

ATPase, Class I, type 8B, member 1 

hypothetical protein FU20989 

hypothetical protein MGC4308 

sterile-alpha motif and leucine zipper containing kinase AZK 

hypothetical protein dJ462023.2 

Human normal keratinocyte mRNA 

ESTs 

ESTs 

ESTs 

KIAA0903 protein 

Homo sapiens, clone MGC:15887, mRNA, complete cds 
CDC 14 (cell division cycle 14, S. cerevisiae) homoiog B 

ESTs, Weakly similar to ALU1.HUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 
serine/threonine kinase 15 

Homo sapiens cDNA: FU22120 fis, clone HEP1B874 
ESTs 

nucleosome assembly protein 1-llke 1 

ESTs 

ESTs 

ESTs 

fatty-acit^Coenzyme A ligase, long-chain 4 

BH-protocadherin (brain-heart) 

ESTs 

ESTs 

ESTs 

Homo sapiens cDNA t FU14643 fis. done NT2RP2001597, weakly similar to RYANODINE 

small fragment nuclease 
CGI-76 protein 
ESTs 

hypothetical protein MGC141 39 
myelin protein zero-like 1 
hypothetical protein 

aldehyde dehydrogenase 5 family, member A1 {succlnate-semlaldehyde dehydrogenase) 
hypothetical protein FU10881 
WAA1842 protein 
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100850 RC_AA279567_s AA835472 Hs.297939 

126884 RCJU280791 U49436 Hs.286236 

115322 RC.AA280819 L08895 Hs.78995 

133626 RCLAA280828 AW836130 Hs75277 

5 115372 RCJU282195 AW014385 Hs.88678 

132825 RCJVA283127_s U82671 Hs.57698 

130269 RC.AA284694 F05422 Hs.168352 

129192 RCJW291137 AA286914 Hs.183299 

452598 RCJW29170B AI831594 Hs.68647 

10 WARNING 

132131 RCJ\A293495 AF069291 Hs.40539 

115536 RC_AA347193 AK001468 Hs.62180 

132411 RCJtt398474_s AA059412 Hs.47986 

115575 RC.AA398512 AA393254 Hs.43619 

15 115601 RCJU400277 AA148984 Hs.48849 
WARNING 

" 103928 RCLAA400896 D14540 Hs.199160 

125819 RC.AA404494 AA044840 Hs.251871 

115683 RCJVA410345 AF255910 Hs.54650 

20 115715 RCJ\A416733 BE395161 Hs.1390 

132952 RCJU425154 AI658580 Hs.61426 

' 115819 RC.AA426573 M486620 Hs.41135 

132525 RC.M431418 AW292809 Hs.50727 

115895 RCJ\A436182 AB033035 Hs.51965 

25 132333 RC_AA437099 AA192669 Hs.45032 

115962 RC_AA446585 A1636361 Hs.179520 

115967 RC.AA446887 AI745379 Hs.42911 

115974 RC.AA447224 BE513442 Hs.238944 

115985 RC_AA447709 AA447709 Hs.268115 

30 129254 RC_AA453624 AA252468 Hs.1098 

133071 RCJW455044 BE384932 Hs.64313 

116095 RC_AA456045 AA043429 Hs.62618 

122691 RC_AA460454_s R19768 Hs.172788 

' 116210 RCLAA476494 BE622792 Hs.172788 

35 116213 RCJ\A476738 AA292105 Hs.326740 

134585 RC_M481422 D14041 Hs.278573 

134790 RC_j*A462269 BE002798 Hs.287850 

116265 RC_AA482595 BE297412 Hs.55189 

129334 RC_AA485084_s AW157022 Hs.4947 

40 116274 RC_AA48543Ls AI129767 Hs.162874 

303150 RC_AA489057 AA887146 Hs.8217 

129945 RC.AA489638 BE514376 Hs.165998 

116331 RC_AA491000 N41300 Hs.71616 

116333 RC_AA491250 AF155827 Hs.203963 

45 132994 RC.AA505133 AA112748 Hs.279905 

134577 RC_AA598447 BE244323 Hs.85951 

116391 RC_AA599243 T86558 Hs75113 

116394 RCJW599574J NMJ06033 Hs.65370 

134531 RC^AA600153 A1742B45 Hs.110713 

50 116417 RCJVA609309 AW499664 Hs.12484 

116429 RC_AA609710 . AF191018 Hs.279923 

116439 RC_AA610068 AA251594 Hs.43913 

116459 RC_AA621399 R80137 Hs.302738 

427505 RC.AA621752 AA361562 Hs.178761 

55 132699 RC.C21523 AW449822 Hs.55200 

116541 RC.D12160 D12160 . Hs.249212 

132557 RC_D19708 AA114926 Hs.5122 

112259 RC_D25801 AA337548 Hs.333402 

116571 RC.D45652 D45652 

60 sequence. 

129815 RC_D60208_f BE565817 Hs.26498 

421919 RC_D80504_S AJ224901 Hs.109526 

116643 RCJTO10 AI367044 Hs.153638 

116661 RCJ : 04247 R61504 

65 repetitive 

116715 RCJ=10966 AL117440 Hs.170263 

116729 RO.F13700 BE549407 Hs.115823 

318709 RCJH05063 R52576 Hs.285280 

134760 RC_H16758 NMJ00121 Hs.89548 

70 116773 RC_H17315_s AI823410 Hs.169149 

106425 RC.H22556 H24201 Hs.247423 

116780 RC.H22566 H22566 Hs.30098 

131978 RC_H48459_s AA355925 Ks.36232 

116819 RC.H53073 H53073 Hs.93698 

75 111428 R<LH56559_s AL031428 Hs.174174 

133175 RC_H57957j» AW955632 Hs.66666 



cathepsin B 
KIAA1856 protein 

MADS box transcription enhancer factor 2, pofypepfide C (myocyte enhancer factor 2C) 

hypothetical protein FU 13910 

ESTs, Weakly similar to Unknown [H^apiens] 

Empirically selected from AFFX single probeset 

nucteoporin-like protein 1 

ESTs 

ESTs, Weakly similar to ALU7.HUMAN ALU SUBFAMILY SQ SEQUENCE CONTAMINATION 

chromosome 8 open reading frame 1 

anfflin (Drosophila Scraps homolog), actin binding protein 

hypothetical protein MGC10940 

ESTs 

ESTs, Weakly similar to ALU4.HUMAN ALU SUBFAMILY SB2 SEQUENCE CONTAMINATION 

myeloid/lymphoid or mixed-iineage leukemia (trithorax (Drosophila) homolog) 
CTP synthase 

junctional adhesion molecule 2 

proteasome (prosome, macropain) subun'rt, beta type, 2 

Homo sapiens mesenchymal stem cell protein DSC96 mRNA, partial cds 

endomucin-2 

N-acetylglucosamlnidase, alpha- (Sanfilippo disease HIB) 

KIAA1209 protein 

ESTs 

hypothetical protein MGC10702 
ESTs 

hypothetical protein FU10631 

ESTs, Weakly similar to T08599 probable transcription factor CA150 [H.sapiens] 
DKFZp434J1813 protein 

ESTs, Weakly similar to AF257182 1 G-protein-coupled receptor 48 [Ksapiens] 
ESTs 

ALEX3 protein 
ALEX3 protein 

hypothetical protein MGC10947 

H-2K binding factor-2 

integral membrane protein 1 

hypothetical protein 

hypothetical protein FU22584 

guanine nucleotide binding protein (G protein) alpha 12 

stromal antigen 2 

PAI-1 mRNA-binding protein 

Homo sapiens mRNA; cDNA DKFZp586N1720 (from clone DKFZp586N1720) 

hypothetical protein FU10339 

clone HQ0310PRO0310p1 

exportin, tRNA (nuclear export receptor for tRNAs) 

general transcription factor U1A 

lipase, endothelial 

DEK oncogene (DNA binding) 

Human done 23826 mRNA sequence 

putative nucleotide binding protein, estradioWnduced 

PIBF1 gene product 

Homo sapiens cDNA: FLJ21425 fis, clone COL04162 
26S proteasome-associated pad1 homolog 
ESTs 

polymerase (RNA) III (DNA directed) (155kD) 
ESTs 

hypothetical protein MGC12760 

gb:HUMGS02848 Human adult lung 3" directed Mbol cDNA Homo sapiens cDNA 3 1 , mRNA 

hypothetical protein FU21657 - . ■ 

zinc finger protein 198 

myeloid/lymphoid or mixed-lineage leukemia 2 

gb:yh16a03.s1 Soares infant brain 1NIB Homo sapiens cDNA clone 3' simflar to contains Alu 

tumor protein p53-binding protein, 1 

ribonuctease P, 40kD subunit 

Homo sapiens cDNA: FU22096 fis, clone HEP16953 

erythropoietin receptor 

karyopherin alpha 1 (importin alpha 5) 

adducin2(beta) 

ESTs 

K1AA0186 gene product 
EST 

K1AA0601 protein 

ESTs, Weakly similar to S19560 proline-fich protein MP4 - mouse [M.musculus] 
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116844 RC H64933 s 

116845 RC_H64973 
116892 RCLH69535 
116925 RC H73110 

5 116981 RCIH81783 
131768 RCJI86259 
117031 RC_H68353 
contains L1 
117034 RC.H88639 

10 132542 RC_H88675 
134403 RC_H93708_s 
117280 RCJI22107 
117344 RC_N24046 
117422 RC.N27028 

15 117475 RC N30205 
117487 RCJJ30521 
130207 RC.N33258 
• ^ 117549 RC.N33390 

• •'117683 RC.N40180 

20 1MAGE:276387 3' similar to 
117710 RC1N45198 
104514 RCJM5979_s 
- 117791 RC_N48325 
117822 RCJW8913 

25 129647 RCJI49394 
117895 RC.N50656 
(H.sapiens] 
131557 RCJI50721 
133057 RCJ153143 

30 118103 RCJJ55326 
118111 RC.N55493 
mRNA 

118129 RCJI57493 
IMAGE:277358 3', mRNA 
35 118278 RC.N62955 
118329 RC.N63520 
3', mRNA 

118336 RCJI63604 
132457 RCJI64166 
40 118363 RC N64168 
118364 RCJJ64191 
118475 RCJI66845 • 
similar to 

118491 RCJJ67135 - 
45 118500 RC_N67295 

101663 RC_N68399 

118584 RC.N68963 

sequence . 

421983 RCJJ69331 
50 116661 RC_N70777 

118684 RC_N71364_s 

118689 RCJN7154SJS 

118690 RCJY71571 
118766 RCJJ74456 

55 118793 RC_N75594 
118817 RC_N79Q35 
118844 RC_N80279 
118919 RC.N91797 
129558 RCJI92454 

60 132692 RC.N94581 
118996 RC.N94746 
119021 RC.N98238 
119039 RCR02384 
119063 RC.R16833 

65 WARNING 

118523 RC_R41828_S 
119111 RC.R43203 
133970 RC.R46395 
119146 RC.R58863 

70 120296 RC_R78248 
119239 RCJN1483 
sequence. 

119281 RC.T16896 
119298 RCJT23820 
75 126502 RC.T30222 

135073 RCJIV15275JB 



H64938 Hs.337434 ESTs, Weakly similar to A46010 X-iinked retinopathy protein [H.sapiens] 

AA649530 gb:ns44t05.s1 NCLCGARjAM Homo sapiens cDNA clone, mRNA sequence 

AI573283 Hs.38458 ESTs 

H731 10 Hs.260603 ESTs, Moderately similar to A475B2 B-ceD growth factor precursor [Haptens] 

N29218 Hs.40290 ESTs 

AC005757 Hs.31809 hypothetical protein 

H88353 gb:yw21a02.s1 Morton Fetal Cochlea Homo sapfens cDNA clone IMAGE:252842 3' similar to 

U72209 Hs.180324 YYI-assodated factor 2 

AL137751 Hs.263671 Homo sapiens mRNA; cDNA DKFZp434l0812 (from clone DKF2p434l0812); partial cds 

AA334551 Hs.82767 sperm specific antigen 2 

M18217 Hs.172129 Homo sapiens cDNA: FU21409 fe, done COL03924 

R19085 Hs.210706 Homo sapiens cDNA FLJ13182 fis, clone NT2RP3004070 

AI355562 Hs.43880 ESTs, Weakly similar to A46010 X-linked retinopathy protein [H,sapfens] 

N30205 Hs.93740 ESTs, Weakly similar to I38022 hypothetical protein [H.sapiens] 

N30621 Hs.44203 ESTs 

AF044209 Hs.144904 nuclear receptor oo-repressor 1 

N33390 Hs.44483 EST 

N40180 gb:yy44d02.s1 Soares_muttip!e_sderosis_2NbHMSP Homo sapiens cDNA clone 

N45198 Hs.47248 ESTs, Highly similar to similar to Cdc14B1 phosphatase [H.sapiens] 

AF164622 Hs.182982 golgin-67 

N48325 Hs.93956 EST 

AA706282 Hs.93963 ESTs 

AB018259 Hs.1 18140 K1AA0716 gene product 

AW450348 Hs.93996 ESTs, Highly similarto SORU.HUMAN SORTILIN-RELATED RECEPTOR PRECURSOR 

AA317439 Hs.28707 signal sequence receptor, gamma (translocon-associated protein gamma) 

AA465131 Hs.64001 Homo sapiens clone 2521 8 mRNA sequence 

AA401733 Hs.184134 ESTs 

N55493 ■ gb:yv50c02.s1 Soares fetal Ever spleen 1NFLS Homo sapiens cDNA clone IMAGE:246146 3', 

N57493 ' gb:yy54c08^1 SoaresjnuItipte_sclerosis_2NbHMSP Homo sapiens cDNA clone 

N62955 " Hs.316433 Homo sapiens cDNA FU11375 fis, clone HEMBA1 000411, weakly similar to ANKYRIN 

N63520 gb:yy62f01.s1 SoaresjnultiplejsclercsisJNbHMSP Homo sapiens cDNA clone IMAGE:278137 

BE327311 Hs.47166 HT021 

AB017365 Hs.173859 frizzled (Drosophiia) homolog 7 

AI183838 Hs.48938 hypothetical protein FU21 802 

N46114 Hs.29169 hypothetical protein FU22623 

N66845 gb:za46c1 1 .s1 Soares fetal liver spleen 1 NFLS Homo sapiens cDNA clone IMAGE:295604 3' 

AV647908 Hs.90424 . Homo sapiens cDNA: FU23285 fis, clone HEP09071 

W32889 Hs.154329 ESTs 

NNL003528 Hs.2178 H2B histone family, member Q 

AW1 36928 gb:UJ-H-BI1-adp-<W)6-0-UKs1 NCLCGAP_Sub3 Homo sapiens cDNA clone 3', mRNA 

AI252640 Hs.1 10364 peptldylproryl Isomerase C (cyclophilin C) 

AL137554 Hs.49927 protein kinase NYD-SP15 

N71313 Hs.163986 Homo sapiens cDNA: FU22765 fis, done KAIA1 180 

AW390601 Hs.184544 Homo sapiens, clone IMAGE:3355383, mRNA, partial cds 

N71571 Hs.269142 ESTs 

N74456 Hs.50499 EST 

N75594 . Hs.285921 ESTs, Moderately similar to T47135 hypothetical protein DKFZp761L0812.1 [H.sapiens] 

AI668658 Hs.50797 ESTs 

AL035364 Hs.50891 hypothetical protein 

AW452696 Hs.130760 myosin phosphatase, target subunit 2 

AW580922 Hs.180446 karyopherih (importin) beta 1 

AW191962 Hs.249239 collagen, type VI!!, alpha 2 

N94746 Hs.274248 hypothetical protein FU20758 

N98238 Hs.55185 ESTs 

AI160570 Hs.252097 pregnancy specific beta-1 -glycoprotein 6 

R16833 Hs.53106 ESTs, Moderately similar to ALU1JWMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 

Y07759 Hs.170157 myosin VA (heavy polypeptide 12, myoxin) 

T02865 Hs.328321 EST 

AA214228 Hs.127751 hypothetical protein 

R58863 Hs.91815 ESTs 

AW995911 Hs.299883 hypothetical protein FLJ23399 

T1 1483 gb:CHR90049 Chromosome 9 exon Homo sapiens cDNA done 1 1 1 -1 5* and 3', mRNA 

AI692322 Hs.65373 ESTs, Weakly similar to T02345 hypothetical protein K1AA0324 [Rsapiens] 

NMJXJ1241 Hs.155478 cydinT2 

T10077 Hs.13453 hypothetical protein FU14753 

W55956 Hs.94030 Homo sapiens mRNA; cDNA DKFZp586E1624 (from clone DKFZp586E1624) 
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119558 


RCJV38194 


W38194 




132736 


RC_W42414_S 


AW0B1883 


Hs.288261 


sapiens mad protein 






132173 


RCW46577 s 


X89426 


Hs.41716 


134873 RC_W49632_s 


AA884471 


Hs.90449 


119650 RC W57613 


R82342 


Hs.79856 


119654 RC W57759 


W57759 




oil i men U 

119683 


RCJV61118 


W65379 


Hs.57835 


119694 


RC~W65344 


AA041350 


Hs.57847 


119718 


RClW69216 


W69216 


Hs.92848 


133010 


RC.W69379 


AI287518 


Hs.62669 


119938 


RCW86728 


AW014862 


Hs.58885 


120128 


RCIZ38499 


BE379320 


Hs.91448 


120130 


RC_Z38630 


AA045767 


Hs.5300 


120148 


RO.Z39494 


F02806 


Hs.65765 


120155 


RC_Z39623 


Z39623 


Hs.65783 


131486 


RC Z40071_s 


F06972 


Hs.27372 


120163 


RC Z40174 


AW082866 


Hs.65882 


120184 


RC Z40182 


Z40182 


Hs.65885 


120211 


RC^Z40904 


Z40904 


Hs.66012 


120245 


RCJVA166965 


AW959515 


Hs.1 11045 


120247 


RC^AA167500 


AA1 67500 


Hs.103939 


120254 


RCJ\A169599_s 


W90403 


Hs.1 11054 


120259 


RCJ\A171724 


AW014786 


Hs.192742 


120260 


RC.AA171739 


AK000061 


Hs.101590 


120275 


RC.M177105 


AA177105 


Hs.78457 


120284 


RCJW82626 


AA179656 




to contains 






114056 RC_AA186324 


AA188175 


Hs.82506 


129507 RCJVA192099 


AJ236885 


Hs.112180 


120302 RC.AA192173 


AA837098 


Hs.269933 


120303 


RC_AA192415 


AI216292 


Hs.96184 


120305 


RC_AA192553 


AW295096 


Hs.101337 


120319 


RCJW194851 


T57776 


Hs.191094 


133389 


RC_AA195520_s 


AA195764 


Hs.72639 


120326 


RCJW96300 


AA196300 


Hs.21145 


134272 


RCJW 96517 


X76040 


Hs.278614 


133145 RC^AA196549 


H94227 


Hs.6592 



120327 RC_AA196721 AK000292 Hs.278732 
106686 RC_AA195729_i N66397 Hs.334825 

120328 RC_AA196979 AA923278 Hs.290905 
120340 RC_AA206828 AA206828 

similar to 

134292 RC^AA207123 AI906291 Hs.81234 

131522 RC_AA214539J AI380040 Hs.239489 

.129051 RC_AA226914_s AA227068 Hs.108301 

120375 RC.AA227260 AF028706 Hs.111227 

120376 RC.AA227469 AA227469 
IMAGE:663732 3', mRNA sequence. 

120390 RCJW233122 AA837093 " ■ Hs.1 11460 

303876 RCJVA233334_s U64820 Hs.66521 
dominant, ataxin 3) 

132038 RC_jAA233347 AI825842 Hs.3776 

104463 RCJ\A233519 T85825 Hs.246885 

125750 RCJ\A233714 AA018515 Hs.264482 

120396 RC.AA233796 AA134006 Hs.79306 

120409 RC_AA235050J AA235050 
gb:L07077 

120414 RC.AA235704 AW137156 Hs.181202 

120420 RC_AA236031 AI128114 Hs.112885 

120422 RC.AA236352 AL133097 Hs.301717 
132221 RCUAA236390.S W94915 Hs.42419 

120423 RC J AA236453 AA236453 Hs.18978 
120435 RCJ\A243370 AA243370 Hs.96450 
120453 ROjAA250947 AA250947 Hs.170263 

120455 RCJ\A251083 AA251720 Hs.104347 

120456 RCJ\A251113 M488750 Hs.88414 
120473 RC_jAA251973 AA251973 Hs.269988 
128922 RCM252023 AI244901 Hs.9589 
120477 RCJ\A252414 AA252414 Hs.43141 
120479 RC J AA252650 AF006689 Hs.110299 
120488 RO.AA255523 AW952916 Hs.63510 
120510 RC.AA258128 AI796395 Hs.111377 

120527 RCJ\A262105 AA262105 Hs.4094 

120528 RC_AA262107 AI923511 Hs.104413 



Empirically selected from AFFX single probeset 

Homo sapiens cDNA: FU23037 fls, done LNG02036, highly similar to HSU68019 Homo 

endothelial cell-specific molecule 1 
Human clone 23908 mRNA sequence 

ESTs, Weakly similar to S65657 aipha-1 (Adrenergic receptor splice form 2 [H.sapiens] 
gb:zd20g1 1 .s1 Soares_fetaLhearLNbHH19W Homo sapiens cDNA clone IMAGE:341252 3* 

ESTs 

ESTs, Moderately similar to ICE4JHUMAN CASPASE-4 PRECURSOR [H.sapiens] 
ESTs 

Homo sapiens mRNA; cDNA DKFZp586D0923 (from done DKFZp586D0923) 
ESTs 

MKP-1 like protein tyrosine phosphatase 
bladder cancer associated protein 
ESTs 
ESTs 

BMX non-receptor tyrosine kinase 

ESTs 

EST 

EST 

ESTs 

EST 

ESTs 

hypothetical protein FU12785 
hypothetical protein 

solute carrier family 25 (mitochondrial carrier; ornithine transporter) member 15 

gb;zp54e1 1 .si Stratagene NT2 neuronal precursor 937230 Homo sapiens cDNA done 3' similar 

KIAA1254 protein 

zinc finger protein 148 (pHZ-52) 

ESTs 

ESTs 

uncoupling protein 3 (mitochondrial proton carrier) 

ESTs 

ESTs 

hypothetical protein RG083M05.2 
protease, serine, 15 

Homo sapiens, clone IMAGE:2961368, mRNA, partial cds 
hypothetical protein FLJ20285 
Homo sapiens cDNA FU14752 fis, done NT2RP3003071 
ESTs, Weakly similar to protease [H.sapiens] 

gb:zq80b08.s1 Stratagene hNT neuron (937233) Homo sapiens cDNA clone IMAGE:647895 3' 

immunoglobufin superfamOy, member 3 

T1A1 cytotoxic granule-associated RNA-binding protein 

nudear receptor subfamily 2, group C, member 1 

Zic family member 3 {odd-paired Drosophila homolog, heterotaxy 1) 

gb:zr18a07.s1 Stratagene NT2 neuronal precursor 937230 Homo sapiens cDNA done 

calcium/calmodulin-dependent protein kinase (CaM kinase) II delta 

Machado-Joseph disease (spinocerebellar ataxia 3, onvopontocerebellar ataxia 3, autosomal 

zinc finger protein 216 
hypothetical protein FU20783 

Homo sapiens mRNA; cDNA DKf=Zp761AQ411 (from clone DKFZp761A0411) 
eukaryotic translation Initiation factor 4E 

gb:zs38e04.s1 Soares_NhHMPu_S1 Homo sapiens cDNA done IMAGE:687486 3' similar to 

hypothetical protein FU10038 
spinal cord-derived growth factor-B 
hypothetical protein DKFZp434N1926 
ESTs 

Homo sapiens cDNA: RJ22822 fis, clone KAIA3968 
EST 

tumor protein p53-blnding protein, 1 

ESTs, Weakly similar to ALUC_HUMAN Hi! ALU CLASS C WARNING ENTRY ill [Ksapiensl 
BTB and CNC homology 1, basic leudne zipper transcription factor 2 
ESTs 
ubiquilin 1 

DKFZP727C091 protein 
mitogen-activated protein kinase kinase 7 
K1AA0141 gene product 
ESTs 

Homo sapiens cDNA FU14208 fis, done NT2RP3003264 
ESTs 
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AW987495 
AA178953 

AW978022 
AB020686 



Z11773 

AA306166 

AW978721 

AI752244 

AI241084 

AA284371 



BE617858 

AW961294 

AA286844 

AW172431 

AI869129 

AA286785 



120529 RC .M262235 A1434823 
120541 RC J AA278298 W07318 
131445 RCJVA278529J NWL014264 
120544 RC_jAA278721 • BE548277 
5 120562 RC_AA280036 BE244580 
120569 RCJW280648 AA807544 

120571 RC.AA280738 AB037744 

120572 RCJ^A280794 
129434 RCJ\A280837 

10 130529 RCJ^A280886 
repetitive 

120575 RC.M280934 

132635 RCJ^A281535 

120591 RCLAA281797JS AF078847 
15 120593 RCLAA282047 AA748355 

430275 RCJVA283002 

117729 RCJW283709 

120609 RCJW283902 

132754 RC.AA284108 
20 130315 RC_AA284109 

132614 RCJ^A284371 

447503 RCJVA284744J AA1 15496 

cds 

135376 RC_AA284784 
25 120621 RCJUV284840 
107866 RCLAA286B44 
129868 RC.jAA287032 
120644 RC_jAA287038 

120660 RCLAA287546 
30 135370 RC_AA287553_S BE622187 

120661 RC AA287556 AA287556 
129116 RC_AA287564 AB019494 
131567 RCJ\A291015_s AF015592 
120699 RCJ\A291716 AI683243 

35 100690 RC_AA291749_S AA383256 
120726 RC AA293656 AA293655 
- 120737 RC AA302430 AL049176 
120745 RC_AA302809 AA302809 
135192 RC.jAA302820_s U83993 

40 120750 RCJ\A310499 AI191410 
120761 RC AA321890 AA321890 

120768 RCJ\A340589 AA340589 

120769 RCJ\A340622 AI769467 
135232 RO.AA342457J AL038812 

45 CONTAMINATION 

. 133439 RC_AA342828_s Z23091 
120793 RC.AA342864 AA342864 
120796 RC^AA342973 AI247356 
120809 RC.jAA346495 AA346495 

50 repeat, mRNA sequence. * . 

. 132459 RC.AA347573 AL1 20071 
120825 RC AA347614 AI280215 
120827 RC AA347717 AA382525 
120839 RClAA348913 AA348913 

5 5 repeat mRNA sequence. 

120850 RC^AA349647 AA349647 
120852 RCJ\A349773 AA349773 
128852 RC^AA35054Ls R40622 
135240 RC.AA357159J AA357159 

60 120870 RC.AA357172J AA357172 
WARNING 

134637 RC_AA369856_s U87309 
120894 RCJW370132 AA370132 
131854 RCJW370472J5 AF229839 
65 120897 RC.AA370867 AA370867 
120915 RC_AA377296 AL135556 

120935 RC.AA383902 AL048409 
WARNING 

120936 RCJ\A385934 AA385934 
70 120937 RC_AA386255 AA386255 

120938 RCJA386260 AA386260 
129722 RC^AA386266 R20855 
120960 RCJ\A398014 AA398014 
120985 RCJ\A398222 A1219896 
75 120988 RC^AA398235 AA398235 



Hs.104415 

Hs.240 

Hs.172052 

Hs.103104 

Hs.302267 

Hs.24970 

Hs.34892 

Hs.294008 

Hs.166644 



Hs.238911 

Hs.54037 

Hs.191356 

Hs.193522 

Hs.237786 

Hs.7145 

Hs.266076 

Hs.75309 

Hs.154353 

Hs.1 18064 

Hs.336898 

Hs.99756 ■ 

Hs/143818 

Hs.61260 

Hs.13012 

Hs.96616 

Hs.99677 

Hs.99670 

Hs.263412 

Hs.225767 

Hs.28853 

Hs.97258 

Hs.1657 

Hs.97293 

Hs.82223 

Hs.321709 
Hs.96693 
Hs.1265 
Hs.104560 



Hs.96800 

Hs.73734 
Hs.96812 
Hs.96820 



Hs.48998 
Hs.96885 
Hs.132967 



Hs.96927 

Hs.191564 

Hs.105601 

Hs.96986 

Hs.292581 

Hs.180941 

Hs.97063 

Hs.173202 

Hs.97079 

Hs.97104 

Hs.97177 

Hs.97184 

Hs.97186 

Hs.104632 

Hs,5422 

Hs.104684 

Hs.97592 

Hs.97631 



ESTs 

M-phase phosphoprotein 1 
serine/threonine kinase 18 
ESTs 

hypothetical protein FU10330 

ESTs, Weakly similar to B34323 GTP-binding protein Rab2 (H.sapiens] 

KIAA1323 protein 

ESTs 

ESTs 

gb2p39e03.s1 Stratagene muscle 937209 Homo sapiens cDNA clone 3" similar to contains Alu 

hypothetical protein DKFZp762E1511; KIAA1816 protein 
ectonudeotide pyrophosphatase/phosphodiesterase 4 (putative function) 
general transcription factor IIH, polypeptide 2 (44kD subunit) 
ESTs 

zinc finger protein 187 
caipain 7 

ESTs, Weakly similar to A46010 X-linked retinopathy protein [H.sapiens] 
eukaryotic translation elongation factor 2 *- 
nonselective sodium potassium/proton exchanger 
similar to rat nuclear ubiquitous casein kinase 2 

Homo sapiens, Similar to RIKEN cDNA 1810038N03 gene, clone MGC:9890, mRNA, complete 

mitochondrial ribosome recycling factor 

hypothetical protein FU23459 

hypothetical protein FU13164 

ESTs 

ESTs 

ESTs 

ESTs, Weakly similar to I38022 hypothetical protein [H.sapiens] 

ESTs, Weakly similar to ALUB.HUMAN ill! ALU CLASS B WARNING ENTRY !!! [H^apiens] 

IDN3 protein 

COC7 (cell division cycle 7, S. cerevisiae, homologate 1 

ESTs, Moderately similar to S29539 ribosomal protein L13a, cytosolic [Rsapiens] 

estrogen receptor 1 

ESTs 

chordin-fike 

gb:EST10426 Adipose tissue, white I Homo sapiens cDNA 3* end, mRNA sequence. 

purinerglc receptor P2X. figand-gated ion channel, 4 

ESTs, Moderately similar to 2109260A B cell growth factor [Rsapiens] 

branched chain keto acid dehydrogenase E1, beta polypeptide (maple syrup urine disease) 

EST 

ESTs 

ESTs. Moderately similar to ALU7JWMAN ALU SUBFAMILY SQ SEQUENCE 

glycoprotein V (platelet) 

ESTs 

ESTs 

gb:EST52657 Fetal heart II Homo sapiens cDNA 3' end similar to EST containing O family 

fibronectin leucine rich transmembrane protein 2 
ESTs 

Human EST clone 122887 mariner transposon Hsmarl sequence 

gb:EST55442 Infant adrenal gland II Homo sapiens cDNA 3' end similar to EST containing Alu 

Homo sapiens cDNA FU12573 fis, clone NT2RM4000979 
ESTs 

ESTs " - . 

EST 

ESTs, Moderately similar to ALU INHUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 

vacuolar protein sorting 41 (yeast homolog) 
ESTs 

l-kappa-B-mteracfing Ras-fike protein 1 

ESTs, Moderately similar to AF174605 1 F-box protein Fbx25 [H.sapiens] 
ESTs 

ESTs, Weakly similar to ALU1_HUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 

EST, Highly similar to (defline not available 7499603) [C.elegans] 

EST 

EST 

glycoprotein M6B 

EST 

ESTs 

ESTs 
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121008 RCLAA398348 
GSSs and a CpG 
121029 RC.AA398482 
121032 RCJM398504 
5 121033 RCJW598505 

121034 RC.AA398507 

121035 RC.AA398523 
121058 RCJW398625 
121060 RCJW398632 

10 121061 RCJW398633 

121091 RCJ\A398894 
CONTAMINATION 

121092 RCJVA398895 
121094 RCLAA398900 

15 121096 RCJW398904 
121115 RCJ\A399122 

121121 RCJ\A399371 

121122 RC_AA399373 
121125 RCJW399441 

20 121151 RCJVA399636 
121153 RCJ\A399640 
121163 RC.AA399680 
121176 RC_AA400080 
121192 RC_AA400262 

25 121223 RCJW00725 
121227 RCJVA400748 
121231 RC.AA400780 

121278 RCJW01631 

121279 RCJ\A401688 
30 121282 RCJW01695 

121299 RCJW402227 

121301 RCJ\A402329 

121302 RCJW402398 
121304 RCLAA402449 

35 121305 RCJW402468 
134721 RCLAA403268. 

121323 RC.jAA403314 

121324 RC.AA404229 
129047 RCJWI04260 

40 131074 RC>A404271 
121344 RC_AA405026 
121348 RCJ\A405182 
121350 RC.AA405237 
contains Alu 

45 121400 RCJA406061 
121402 RCJ\A406063 
121403. RCJ\A406070 
. 121408 RC_AA406137 
121431 RC_AA406335 

50 132936 RCJ\A411801 
• 121471 RC_AA411804 
121474 RCLAA411833 
121526 RC_AA412219 
121530 RCj\A412259 

55 121558 RO.AA412497 
contains L1.I3L1 
121559 RCLAA412498 
121584 RCJWI16586 
121609 RCJWI16867 

60 121612 RCLAA416874 
121737 RC.AA421133 
121740 RC.AA421138 
129194 RC.AA422079 
121784 RCLAA423B37 

65.. 121802 RC_AA424328 
121803 RCLAA424339 
135286 RCJ\A424469_ 
121806 RCJW24502 
129517 RCLAA425004 

70 121845 RC^AA425734 
CONTAMINATION 
121853 RCJ\A425887 
121891 RC AA426456 
121895 RC.AA427396 

75 similar to contains 

121899 RCJW427555 



AA398346 Hs.301720 Human DNA sequence from done RP1 1-251 J8 on chromosome 13 Contains ESTs, STSs, 



AA398482 
AA393037 
AA398505 
AL389951 
AA398523 
AA398625 
AA398632 
AA393288 
AA398894 

AA398895 
AA402505 
AA398904 
AA398187 
AA399371 
AI126713 
AL042981 
AA399636 
AA399640 
AI676062 
AL121523 
AA400262 
AI002110 
AA400748 
AA81494B 
AA037121 
AA292873 
AA401695 
AA402227 
NM.006202 
AA402587 
AA293863 
AA402468 
s AK000112 
AA291411 
AA404229 
AI768623 
U16125 
AA405026 
AA405182 
AA405237 

AA406061 
AA406063 
AA406070 
AA406137 
AA035279 . 
AL120659 
AA411804 
AA402335 
AW665325 
AA778658 
AA412497 

AI192044 
AI024471 
AA416867 
AA416874 
AA421133 
AA421138 
, AA150797 
T90789 
AI251870 
AI338371 
s AW023482 
AA424313 
AW972853 
AI732692 

AA425887 
AA426456 
AA427396 

R55341 



Hs.97641 EST 

Hs.161798 ESTs 

Hs.97360 ESTs 

Hs.271623 nucleoporin 50kD 

Hs.210579 ESTs 

Hs.97391 ESTs 

Hs.97395 ESTs 

Hs.97396 ESTs 

Ks.97657 ESTs, Moderately similar to ALU8 JWMAN ALU SUBFAMILY SX SEQUENCE 

Hs.97658 EST 



Hs.332690 

Hs.104682 

Hs.189095 

Hs.192233 

Hs.251278 

Hs.143629 

Hs.97694 

Hs.1 11902 

Hs.97774 

Hs.190093 

Hs.97169 

Hs.97823 

Hs.96343 

Hs.98518 

Hs.177996 

Hs.97334 

Hs.22826 

Hs.89901 

Hs.325520 

Hs.97316 - 

Hs.291557 

Hs.89306 

Hs.97247 

Hs.97842 

Hs.108264 

Hs.181581 ' 

Hs.193754 

Hs.97973 



Hs.98001 

Hs.98003 

Hs.98004 

Hs.98019 

Hs.176731 

Hs.6111 

Hs.261575 

Hs.188760 

Hs.98120 

Hs.98122 



Hs.104778 

Hs.98232 

Hs.98185 

Hs.98168 

Hs.104671 

Hs.98334 

Hs.109276 

Hs.94308 

Hs.188898 

Hs.157173 

Hs.97849 

Hs.98402 

Hs.112237 

Hs.165066 

Hs.98502 
Hs.98469 



gb2t62h10.r1 Soares testis NHT Homo sapiens cONA clone 5\ mRNA sequence 
ESTs 

ESTs, Weakly similar to mitochondrial citrate transport protein [Rsapiens] 
similar to SALL1 (sal (Drosophlla)-like 

ESTs, Highly similar to T00337 hypothetical protein KIAA0568 [Rsapiens] 

KIAA1201 protein . 

ESTs 

ESTs . 

ESTs 

ESTs 

ESTs 

ESTs, Weakly similar to dJ667H1 2.2.1 [H.sapiens] 

Homo sapiens mRNA; cDNA DKFZp434D024 (from clone DKFZp434D024) 

ESTs, Weakly simflar to ALUC.HUMAN III! ALU CLASS C WARNING ENTRY III [H.sapiens] 

Homo sapiens cDNA FU 11490 tis, clone HEMBA1001918 

ESTs 

ESTs 

tropomcdulin 3 (ubiquitous) 

phosphodiesterase 4A, cAMP-specific (dunce (Drosophila)-homoIog phosphodiesterase E2) 

LAT1-3TM protein 

EST 

ESTs 

hypothetical protein FU20105 

ESTs 

EST 

ESTs 

glutamate receptor, ionotropic, kainate 1 

ESTs 

ESTs 

gb:zt06e10.s1 NCLCGAPJ3CB1 Homo sapiens cDNA clone IMAGE:71 2362 3' simflar to 

EST 

ESTs 

EST 

EST 

ESTs 

aryl-hydrocarbon receptor nuclear translocator 2 
ESTs 

ESTs, Highly similar to Trad [Rsapiens] 

ESTs 

ESTs 

gb:zt95g1 2.s1 SoaresJestls.NHT Homo sapiens cDNA clone IMAGE:7301 50 3' similar to 

ESTs 
ESTs 
EST 
ESTs 

erythrocyte transmembrane protein 
EST 



RAB35, member RAS oncogene family 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs. Moderately similar to ALU2.HUMAN ALU SUBFAMILY SB SEQUENCE 

hypothetical protein FU14303 
ESTs 

gbzw33a02^1 Soares ovary tumor NbHOT Homo sapiens cDNA clone IMAGE:771050 3' 



Hs.50421 KIAA02Q3 gene product 
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121917 RC_AA428218 AA406397 

121918 RCJM28242 BE274689 

121919 RCJVA428281 AA428281 
121941 RCJ\A428865 AA428855 

5 121942 RCJK428994 AW452701 

121970 RCJ\A429666 M429666 

121993 RCJW4301B1 AW297880 
134660 RC^AA430184_s U73524 
126753 RCJ\A431288_S AA305478 

10 122022 RCJU431293 AA431293 

122050 RCJUV431478 AI453076 

122051 RCJWI31492 AA431492 
122055 RCJ\A431732 AA431732 
122105 RC.AA432278 AW241685 

15 122125 RCJ\A434411 AK000492 

135235 RC_M435512_i AW298244 

122162 ' RC.AA435698 AA628233 

129406 RC.AA435711 AB018255 
318801 RCJ\A435815_s U40763 

20 122186 RC_ J AA435842 AA398811 

122235 RC AA436475 AA436475 

129131 RCJ\A436489 AB026436 

134664 RC AA442060 AA256106 

122310 RCJM42079 AW192803 

25 122334 RC_AA443151 BE465894 

122382 RCLAA446133 AA446440 

122425 RCLAA447145 AB007859 

122431 RCJ\A447398 M447398 

122450 RC^AA447643 AA447643 

30 302653 RCJ\A447742_s AJ404468 

122477 RC_AA446226 AA448226 

122500 RCJW48825 AA448825 

122522 RC.AA449444 AA299607 

122536 RCJW50Q87 AF060877 

35 122538 RC.AA450211 AA450211 

122540 RC^AA450244 AA476741 

122560 RCLAA452123 AW392342 

421919 RCJ\A452155 AJ224901 

122562 RCJ\A452156 AA452156 

40 mRNA 

122585 RC_jAA453036 A1681654 

122608 RC.AA453526 AA453525 

122635 RC.AA454085 AA454085 
similarto 

45 122636 RC_AA454103 AW651706 

122653 RCJUV454642 AW009166 

122660 RC.AA454935 AI816827 

" , " 122703 RCJVA456323 AA456323 

122724 RCJ\A457395 AA457395 

50 122749 RCLAA458850 M458850 

122772 RCLAA459662 AW1 17452 

131098 RCJ\A459668 U66669 
129045 RC_AA459679_s AI082883 

122777 RCLAA459702 AKD01022 

55 135362 RCLAA460017J AA978128 

122798 RCJM60324 AW366286 

122837 RCLM461509 AA451509. 

122860 RCJ\A464414J AA464414 
mRNA sequence. ■ 

60 122861 RCJ\A464428 AA335721 

122910 RC.AA470084 AA470084 
132899 RC - AA476606_s AA476606 

122967 RCJ\A478521 AA806187 

129560 RCJW478523 AA317841 

65 123009 RCLAA479949 AA535244 

128917 RCJM81252 A1365215 

123081 RCJ\A485351 AI815486 

123133 RC.M4B7264 AA487264 

123184 RCJW89072 BE247767 

70 129671 RCJW489630 NM.014700 

123233 RCJW490225 AW974175 
[H.sapiens] 

123234 RCJW90227 NM.001938 
123236 RCJ\A490255 AW968504 

75 123255 RCJW490890 AA830335 
129503 RCJW49Q916_s AW768399 



Hs.98038 ESTs 

Hs.184175 chromosome 2 open reading frame 3 

Hs.98560 EST 

Hs.98563 ESTs 

Hs.293237 ESTs 

Hs.98617 EST 

Hs.98661 ESTs 

Hs.87465 ATP/GTP-binding protein 

Hs.95327 CD3D antigen, delta polypeptide (TiT3 complex) 

Hs.98716 ESTs, Moderately similarto T42650 hypothetical protein DKFZp434D0215.1 [RsaplensJ 

Hs.166109 ELAV (embryonic lethal, abnormal vision, DrosophiIa)-IIke 2 

Hs.98742 EST 

Hs.98747 EST 

Hs.98699 ESTs 

Hs.98806 hypothetical protein 

Hs.293507 ESTs 

Hs.79946 cytochrome P450, subfamily XIX (aromatization of androgens) 

Hs.111138 KIAA0712 gene product . 

Hs.77965 peptidyi-prolyl Isomerase G (cydophiBn G) 

Hs.104673 ESTs 

Hs.1 12227 membrane-associated nucleic acid binding protein 

Hs.177534 dual specificity phosphatase 10 

Hs.87507 ESTs 

Hs.98974 ESTs, Weakly similar to S65824 reverse transcriptase homoiog [H.sapiensl 

Hs.98365 ESTs, WeaWy similar to LB4DJWMAN NADP-DEPENDENT LEUKOTR1ENE B4 12- 

Hs.98643 ESTs 

Hs.100955 KIAA0399 protein 

Hs.99104 ESTs 

Hs.112095 hypothetical protein DKF2p434F1819 

Hs.284259 dynein, axonemal, heavy polypeptide 9 

Hs.324123 ESTs 

Hs.99190 ESTs 

Hs.98969 ESTs 

Hs.99236 regulator of G-protein signalling 20 

Hs.99239 ESTs 

Hs.98279 ESTs, Weakly similar to A43932 mucin 2 precursor, intestinal [H.sapiens] 

Hs.283077 centrosomal P4. 1-assoclated protein; uncharacterized bone marrow protein BM032 

Hs.109526 zinc finger protein 198 

gb:zx29c03.s1 Soares JotaLfetus J4b2HF8_9w Homo sapiens cDNA clone IMAGE:787876 3', 

Hs.170737 hypothetical protein FU23251 

Hs.143077 ESTs 

gbzx33a08.s1 Soares_totalJetusJ!b2HF8_9w Homo sapiens cDNA clone IMAGE:788246 3' 

Hs.99519 hypothetical protein FU14007 

Hs.99376 ESTs 

Hs.1 80069 nuclear respiratory factor 1 

Hs.269369 ESTs 

Hs.99457 ESTs . 

Hs.293372 ESTs, Weakly simUar to B34087 hypothetical protein [H^apiens] 

Hs.99489 ESTs 

Hs.236642 3-hydroxyisobutyryl-Coenzyme A hydrolase 

Hs.30732 hypothetical protein FL) 13409; KIAA171 1 protein 

Hs.214397 hypothetical protein FU10160 similar to insulin related protein 2 

Hs.99513 ESTs, Weakly similarto T17454 diaphanous-related formin - mouse [M.musculus] 

Hs.145696 splicing factor (CC1.3) 

Hs.293565 ESTs, Weakly similarto putative p150 [H.sapiens] 

gb:zx78g01.s1 Soares ovary tumor NbHOT Homo sapiens cDNA done IMAGE:809904 3', 

Hs.119394 ESTs 

Hs.98358 ESTs 

Hs.59666 SMAD in the antisense orientation 

Hs.289101 glucose regulated protein, 58kD 

Hs.7845 hypothetical protein MGC2752 

Hs.78305 RAB2, member RAS oncogene family 

Hs.206097 oncogene TC21 

Hs.243901 Homo sapiens cDNA FU20738 fis, clone HEP08257 

Hs.154974 Homo sapiens mRNA; cDNA DKFZp667N064 (from clone DKFZp667N064) 

Hs.18166 KIAA0870 protein 

Hs.1 19004 KIAA0665 gene product 

Hs.188751 ESTs, Weakly similar to MAPB_HUMAN MICROTUBULE^SSOCIATED PROTEIN 1B 

Hs.16697 dowwegulator of transcription 1 , TBP-binding (negative cofactor 2) 
Hs.123073 CDC2-refated protein kinase 7 
Hs.105273 ESTs 
Hs.1 12157 ESTs 
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131043 RC_AA490925 
123259 RCJW90955 
[H .sapiens] 

123284 RCLAA495812 
123286 RCJW495824 
123315 RCJ^496369 
to contains 

129179 RC_AA504125_s 
131612 RCJW521473 
123421 RC.M598440 
123449 RCJW598899J 
129021 RC_jAA599244 
132830 RC_ J AA599694_s 
123497 RC _AA600037 
123604 RC_AA609135 
129539 RCJ\A609582 
123712 RC_AA6Q9684 
123731 RCJW609839 
simitar to 

130725 RCJ\A609862 
123800 RC_AA620423 
123841 RCJ^A620747 
123929 RCL.AA621364 
123978 RC.C20653 
133184 RCJD20085 
132835 RCJD20749 
132406 RC_D51285_s 
128695 RCJ359972J 
124028 RC.F04112J 
sequence. 

124057 RC F13604 
134899 RC.H01662 
130973 RC.H05135J 
RC.H12245 
RC_H22842 
RC_H30894 
RC_H4344*_s 
RCLH45996 
RC_H69281J 
RC_H69485_f 
RC_H69899 



124106 
124136 
124165 
131229 
124178 
129948 
134374 
124254 
similar to 

129056 RC.H70627.S 
100919 RC_H73050_s 
130724 RCJ473260 
100716 RC_H77531Js 
124274 RC_HB0552 
129078 RC_H80737_S 
124828 RC_H93412 
124315 RC_H94892_s 
RC_H95643_s 
RC_H96552 
RC_H97146 
RC_H99131_s 
RC_H99462L_s 
RC_H99837_S 
RC N22140 
135297 RCJJ22197 
134347 RC_N23756_s 
130365 RCJJ24134 
421642 RCJW195 
439311 RC_N26739 
124383 RC_N27098 
124387 RCJ127637 
129341 RC.N33090 
129081 RC.N35967 
102827 RC.N38959J 
124433 RC_N39069 
124441 RC.N46441 
132338 RC.N48270J 
131403 RC_N48365_s 
124466 RCJI51316 
132210 RCJi51499_s 

124483 RC.N53976 

124484 RCJI54157 

124485 RC.N54300 



100747 
124324 
452933 
132231 
129170 
133143 
132963 



AF084535 Hs.22464 epilepsy, progressive myoclonus type 2, Lafora disease (laforin) 

AI744152 Hs.283374 ESTs, Weakly similar to CA15.HUMAN COLLAGEN ALPHA 1(V) CHAIN PRECURSOR 

AA488988 Hs.293796 ESTs 

AA495824 Hs.188822 ESTs. Weakly similar to A46010 X-finked retinopathy protein [H^apiens] 

AA496369 gb:zv37d10.s1 Scares ovary tumor NbHOT Homo sapiens cDNA clone IMAGE:755827 3 1 sim'dar 

AW969025 Hs.109154 ESTs 

AU076668 Hs.334884 SEC10 (S. cerevislae)-ljke 1 

AA598440 Hs.291154 EST, Weakly similar to I38022 hypothetical protein [Ksaplens] 

AL049325 Hs.1 12493 Homo sapiens mRNA; cDNA DKFZp564D036 (from clone DKFZp564D036) 

AL044675 Hs.173081 KJAA0530 protein 

NM.014777 Hs.57730 KIAA0133 gene product 

AA765256 Hs.135191 ESTs, Weakly similar to unnamed protein product [Rsapiens] 

AA609135 Hs.293076 ESTs 

T47614 Hs.323022 ESTs, Highly similar to p60 katanin [fisaplens] 

AA609684 Hs.112748 Homo sapiens cDNA: FU21543 fis, clone COL06171 

AA609839 gb;ae82K)1.s1 Stratagene lung carcinoma 937218 Homo sapiens cDNA clone IMAGE351481 3' 

T98807 Hs.80248 RNA-binding protein gene with multiple splicing 

AA620423 Hs.1 12862 EST 

AA620747 Hs.1 12896 ESTs 

AA621364 Hs.112981 ESTs 

T89832 Hs.170278 ESTs 

AA001021 Hs.66B5 thyroid hormone receptor interactor 8 

Z83844 Hs.5790 hypothetical protein (JJ37E16.5 

AL133731 ' Hs.4774 Homo sapiens mRNA; cDNA DKFZp761C1712 (from done DKFZp761C1712) 

NM-003478 Hs.101299 cuffin 5 

F041 12 gb:HSC2JH062 normalized infant brain cDNA Homo sapiens cDNA clone c-2jh06 3', mRNA 

AA902384 Hs.73853 bone morphogenetlc protein 2 

AI609045 Hs.321775 hypothetical protein DKFZp434D1428 

AI638418 Hs,78580 DEAD/H (Asp-Glu-Ala-Asp/His) box polypeptide 1 

H12245 gb:ym17a12.r1 Soares infant brain 1NIB Homo sapiens cDNA done 3', mRNA sequence 

H22842 Hs.101770 EST 

H30039 Hs.107674 ESTs 

NMJH5340 Hs.2450 leucyWRNA synthetase, mitochondrial 

BE463721 Hs.97101 putative G protein-coupled receptor 

A1537162 Hs.263988 ESTs 4 

N22687 Hs.8236 ESTs 

H69899 gb.yu70c12.s1 Weizmann Olfactory Epithelium Homo sapiens cDNA clone IMAGE:239158 3' 

AI769958 Hs.108336 ESTs, WeaWy similar to ALUE_HUMAN !!!) ALU CLASS E WARNING ENTRY HI [Rsapiens] 

X54534 Hs.278994 Rhesus blood group, CcEe antigens 

AK001 507 Hs.306084 Homo sapiens clone FLB6914 PR01821 mRNA, complete cds 

X89887 Hs.172350 HIR (histone cell cyde regulation defective, S. cerevisiae) homolog A 

H80552 Hs.102249 EST 

AI351010 Hs.102267 lysosomal 

AW952124 Hs.13094 presenillns assodated rhomboid-like protein 

NM_005402 Hs.288757 v-ral simian leukemia viral oncogene homolog A (ras related) 

X04588 Hs.85844 neurotrophic tyrosine kinase, receptor, type 1 

H96552 Hs.159472 Homo sapiens cDNA: FLJ22224 fis, clone HRC01703 

AW391423 Hs.288555 Homo sapiens cDNA: RJ22425 fis, done HRC0S686 

AA662910 Hs.42635 hypothetical protein DKFZp434K2435 

AW250380 Hs.109059 - mitochondrial ribosoma! protein L1 2 

AA094538 Hs.272$08 putative transcription regulation nudear protein; KIAA1689 protein 

AA099693 Hs.34851 epsilon-tubulin 

AL1 18782 Hs.300208 ' Sec23-interacting protein p125 

AF164142 Hs.82042 solute carrier family 23 (nudeobase transporters), member 1 % 

W56119 Hs.155103 eukaryotic translation initiation factor 1 A, Y chromosome 

AF172066 Hs.106346 retlnolc add repressible protein 

BE270668 Hs.151945 mitochondrial ribosoma! protein L43 

N27098 Hs.102463 EST 

N27637 Hs.109019 ESTs 

AI193519 Hs.226396 hypothetical protein FU11126 

AI364933 Hs.168913 serine/threonine kinase 24 (Ste20, yeast homolog) 

BE244588 Hs.6456 chaperonin containing TCP1, subunit 2 (beta) 

AA280319 Hs.288840 PR01 575 protein 

AW450481 Hs.161333 ESTs 

AA353868 Hs.182982 golgin-67 

AI473114 Hs.26455 ESTs 

R10084 Hs.113319 kinesin heavy chain member 2 

NWL007203 Hs.42322 A kinase (PRKA) anchor protein 2 

AI821780 Hs.179864 ESTs 

H661 18 Hs.285520 ESTs. Weakly similar to 2109260A B cell growth fartor [Ksapiens] 

AB040933 Hs.15420 K1AA1500 protein 
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N54831 Hs.271381 ESTs, Weakly sbnflarto 138022 hypothetical protein [H.sapiens] 

N59849 Hs.13565 Sam68-fike phosphotyrosfne protein, T-STAR 

N79264 Hs.269104 ESTs 

N62375 Hs.102731 EST 

AA903424 Hs.6786 ESTs 

D54120 Hs.146409 cell division cyde 42 (GTP-binding protein, 25kD) 

AI301740 Hs.173381 dihydropyrimidlnase-like 2 

BE296313 Hs.265592 ESTs, Weakly similar to I38022 hypothetical protein [H.sapiens] 

N68168 gb2a11o01.s1 Soares fetal liver spleen 1NFLS Homo sapiens cDNA done 3\ mRNA sequence 

N68201 Hs.269124 ESTs, Weakly similar to I3B022 hypothetical protein [H.sapiens] 

N68300 gb2a12g075l Soares fetal liver spleen 1NFLS Homo sapiens cDNA done IMAGE:292380 3', 

N68321 Hs.231500 EST 

N69575 Hs.102788 ESTs 

AL133572 ' Hs.199009 protein containing CXXC domain 2 

AI680737 Hs.289068 Homo sapiens cDNA FU11918 fis, done HEMBB1000272 

T78277 Hs.100293 O-Imked N-acety!glucosamine (GlcNAc) transferase (UDP-N-^tylglucosamin9:polypeptide-N- 

AW582962 Hs.102897 CGI47 protein 

W19407 Hs.3862 regulator of nonsense transcripts 2; DKFZP434D222 protein 

AB002316 Hs.65746 KIAA0318 protein 

AK001357 Hs.102951 Homosapiens cDNA FU10495 fis, done NT2RP2000297, moderately similar to ZINC FINGER 

AA464836 Hs.291079 ESTs, Weakly similar to T27173 hypothetical protein Y54G11AJ9 - Caenomabditis elegans 

J00077 Hs.155421 alpha-fetoprotein 

R05283 . gb:ye91c08.s1 Soares fetal liver spleen 1NFLS Homo sapiens cDNA clone IMAGE:125102 3' 

T97733 Hs.185685 ESTs 

R23053 gb:yh31a05.fi Soares placenta Nb2HP Homo sapiens cDNA done 5' similar to contains L1 

RC_R23930_s AL1 37586 Hs.52763 anaphase-promoting complex subunit 7 

AI248173 Hs.191460 hypothetical protein MGC12936 

AW162919 Hs.170160 RAB2, member RAS oncogene family-like 

AL134275 Hs.6434 hypothetical protein DKFZp761F2014 

H1 1368 Hs.141055 Homo sapiens clone 23758 mRNA sequence 

AA553722 Hs.92096 ESTs, Moderately similar to A46010 X4inked retinopathy protein (Rsapiens) 

R45154 Hs.106604 ESTs 

R41179 Hs.97393 KIAA0328 protein 

AW293165 Hs.143134 ESTs 

NM.004427 Hs.165263 early development regulator 2 (homotog of polyhomeotic 2) 

AW297929 Hs.328317 EST 

W38537 Hs.280740 hypothetical protein MGC3040 

R44357 Hs.48712 hypothetical protein FU20736 

R44519 gb:yg24h04.s1 Soares infant brain 1NIB Homo sapiens cDNA done 1MAGE:33350 3\ mRNA 

R45088 Si gb.-yg38g04.s1 Soares infant brain 1NIB Homo sapiens cDNA clone IMAGE:34896 3', mRNA 

R47948 Hs.188732 ESTs 

H87832 Hs.7388 kelch (DrosophfeJ-Kke 3 

AW966158 Hs.58582 Homo sapiens cDNA FU12789 fis, clone NT2RP2001947 

R55241 Hs.101214 EST 

R59585 Hs.101255 ESTs.. . 

W07701 Hs.304177 - Homo sapiens done FLB8503 PR02286 mRNA, complete cds 

BE561430 Hs.239388 Human DNA sequence from done RP1-304B14 on chromosome 6. Contains a gene for a novel 



124861 RC_R66690 
130141 RC_R67266_s 
124879 RCJtf3588 
124892 RC_R79403 
124906 RC_RB7647 
124922 RC_R93622 

124940 RC_R99599_s 

124941 RC.R99612 
124943 RCJ02888 



R67567 

NNL004455 

R73588 

AI970003 

H75964 

R93622 

AF068846 

AI766661 

AW963279 



WARNING ENTRY [H.sapiensl 



124947 RC_T03170 
124954 RC.T10465 
132924 RC.T15418J 
133113 RC.T15597J 
132975 RCJT15652J 
133235 RCJ16898.S 
131082 RCJT26644J 
124980 RCJ40841 
124984 RCJ47566 i 
124991 RC.T50116 



T03170 
AW964237 
U55184 
BE383768 
R43504 
AW960782 
AI091121 
T40841 
BE313210 
T50116 



to similar to SP:VE22JJWBD P03756 EA22 GENE , mRNA sequence, 
129475 RC_T50145_s NWL004477 Hs.203772 FSHD region gene 1 



Hs.107110 ESTs * 

Hs. 150956 exostoses (mulfiple)-like 1 

Hs.101533 ESTs 

Hs.23756 hypothetical protein similar to swine acylneuraminate lyase 

Hs.107815 ESTs 

Hs.12163 eukaryotic translation Initiation factor 2, subunit 2 (beta, 38kD ) 

Hs.103804 heterogeneous nudear ribonudeoprotein U (scaffold attachment factor A) 

Hs.27774 ESTs, Highly simUarto AF161349 1 HSPCQ86 [Haptens] 

Hs.123373 ESTs, Weakly simBar to ALU1_HUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 

Hs.100165 ESTs 

Hs.6728 K1AA1548 protein 

Hs.154145 hypothetical protein FU1 1585 

Hs.65238 95 kDa retinoblastoma protein binding protein; KIAA0661 gene product 

Hs.6181 ESTs 

Hs.6856 ash2 (absent smaD, or homeotic Drosophila, homologHIke 

Hs.246218 Homo sapiens cONA: FU21781 fis, clone HEP00223 

Hs.98681 ESTs 

Hs.223241 eukaryotic translation elongation fador 1 delta (guanine nudeotide exchange protein) 

gb:yb77c10.s1 Stratagene ovary (937217) Homo sapiens cDNA done IMAGE:77202 3' similar 
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125000 RC_T58615 

132932 R<U59940J 

129534 RCJT63595 

125008 RC.T64891 

125009 RCJ54924 
132940 RCJ64933_r 

125017 RCJ68875 
sequence. 

125018 RCJ69027 
125020 RCJT69924 
129891 RC.T70353 
134204 RCJ79780.S 
125050 RCJ79951 
125052 RC_T80174_s 
125054 RCJ80622 
125063 RC T85352 



T58615 

AW118826 

AK002126 

T91251 

T64924 

T79136 

T68875 



Hs.1 10640 

Hs.6093 

Hs/11260 

Hs.303046 
Hs.127243 



ESTs 

Homo sapiens cDNA: FU22783 fis, done KAIA1993 
hypothetical protein FLJ11264 

gb:yd60a10.s1 Soares fetal liver spleen 1NFLS Homo sapiens cDNA done 3\ mRNA sequence 
ESTs 

Homo sapiens mRNA for KIAA1724 protein, partial cds 

gb:yc30R}5.s1 Stratagene liver (937224) Homo sapiens cDNA done IMAGE:82209 3\ mRNA 



T69027 • Hs.57475 sex comb on midleg homolog 1 

T69981 gb:yc19d03.r1 Stratagene lung (937210) Homo sapiens cDNA done 5\ mRNA sequence 

AI084813 Hs.13197 ESTs 
AI873257 Hs.7994 hypothetical protein FU20551 
AW970209 Hs. 11 1805 ESTs 

T85104 Hs.222779 ESTs, Moderately similar to similar to NEDD-4 [H.sapiens] 
T80622 Hs.268601 ESTs, Weakly similar to envelope [H.sapiens] 

T85352 gb:yd82d01.s1 Soares fetal liver spleen 1NFLS Homo sapiens cDNA done IMAGE: 114721 3' 

similar to contains AIu repetitive elementcontalns L1 repetitive element ;, mRNA sequence. 
1 25064 RC.J85373 T85373 . gb:yd82f07.s1 Soares fetal liver spleen 1 NFLS Homo sapiens cDNA clone IMAGE:1 14757 3' 

similar to contains AIu repetitive element contains MER3 repetitive element ;, mRNA sequence. 



125056 RCJ86284 T86284 
AIu repetitive element, mRNA sequence 

112264 ROJ89579JS AL045364 Hs.79353 

125080 RQJ90360 T90360 Hs.268620 
WARNING ENTRY [rf.sapiens] 

125097 RCJ94328J AW576389 Hs.335774 
125104 RCJ95590 T95590 



gb:yd77b07.s1 Soares fetal liver spleen 1 NFLS Homo sapiens cDNA clone 3' similar to contains 
transcription fadorDp-1 

ESTs, Highly similar to ALU6.HUMAN ALU SUBFAMILY SP SEQUENCE CONTAMINATION 



EST, Moderately similar to S65657 alpha-1C-adrenergic receptor splice form 2 [H.sapiens] 
gb7e40a03^1 Soares fetal fiver spleen 1NFLS Homo sapiens cDNA done 3' similar to 
gb|M10817|lGURRAA Iguana iguana 5S (rRNA );, mRNA sequence 
135107 RC T97257 f T97257 Hs.337531 ESTs, Moderately similar to I38022 hypothetical protein [H^apiens] 
1 29550 RC T97599J AA845462 Hs.124024 deltex (Drosophila) homolog 1 

125118 RC J97620 R10606 gb:yf35f1U1 Soares fetal liverspleen 1NFLS Homo sapiens cDNA clone IMAGE128877 3' 

similar to contains Atu repetitive element, mRNA sequence. 



125120 RCJ97775 

134160 RCJ98152 

125136 RCJV31479 

125144 RCJV37999 

125150 RC.W38240 

104180 RCJV40150 

131987 RC.W45435 

125178 RCJA/58202 

125180 RCJV58344 

125182 RC W58650 

130588 RCW68736 - 

125197 RC.W69106 

133497 RC.W69111 

100562 RC.W69385ji 

125639 RC_W69399_s 
129232 RC W69459 
101495 RC.W72424 
125209 RC W72724 
125212 RC W72834 
129132 RC W73955 
125223 RC.W74701 
WARNING ENTRY [Ksapiens] 
125225 RC_W76540 
125228 RC W79397 
132393 RC W85888 
125238 RC W86038 
125247 RC.W86881 
129298 RC W87804 
125263 RC W88942 
125266 RC_W90022 
PRECURSOR [H.sapiens] 
131321 RCW92272 
131601 RC_W92764_s 
131677 RC.W93040 
120837 RCJW93092 

125277 RCJV93227 

125278 RC.W93523 
125280 RC.W93659 
131856 RC_W94003_s 
131844 RC_W94401_s 
125284 RCJV94688 
313447 RCJV94787_s 
130799 RC_Z38294_s 
125289 RC.238311 
128874 RC^Z38465.s 



T97775 Hs.100717 EST 

T98152 Hs.79432 fibrillin 2 (congenital contractural arachnodactyly) 

AW962364 Hs.129051 ESTs 

AB037742 Hs.24336 KIAA1321 protein 

W38240 Empirically selected from AFFX single probeset 

AA247778 Hs.119155 Homo sapiens mRNA full tength insert cDNA done EUROIMAGE 814975 

AW453069 Hs.3657 activity-dependent neuroprotective protein 

W93127 Hs.31845 ESTs 

W56469 Hs.103120 ESTs 

AA451755 Hs.263560 ESTs 

AL030996 Hs.16411 hypothetical protein LOC57187 

AF086270 Hs.278554 heterochromatin-Dke protein 1 

BE617303 Hs.74266 hypothetical protein MGC4251 

NWL006185 Hs.301512 nudear mitotic apparatus protein 1 

Z97630 Hs.226117 H1 histone family, member 0 

R98881 Hs.109655 sex comb on midleg (DrosophilaHike 1 

W72424 Hs.1 12405 S100 calcium-binding protein A9 (calgranulin B) 

W72724 Hs.103174 ESTs, Weakly similar to TSP2_HUMAN THROMBOSPONDIN 2 PRECURSOR [H.sapiens] 

AA746225 Hs.103173 ESTs 

BE383436 Hs. 108847 hypothetical protein MGC2749 

AI916269 Hs.109057 ESTs, Weakly similar to ALU5.HUMAN ALU SUBFAMILY SC SEQUENCE CONTAMINATION 

W74169 Hs.16492 DKFZP564G2022 protein 

AA033982 Hs.1 10059 ESTs, Weakly similar to I38022 hypothetical protein [H.sapiens] 

AL135094 Hs.47334 hypothetical protein FU14495 

N99713. Hs.109514 ESTs . 

AA694191 Hs.163914 ESTs 

AI051967 Hs.110122 ESTs 

AA098878 gb:zn45g10.r1 Stratagene HeLa cell s3 937216 Homo sapiens cDNA done 5', mRNA sequence 

W90Q22 Hs.186809 ESTs, Highly similar to LCT2_HUMAN LEUKOCYTE CELL-DERIVED CHEMOTAXIN 2 

U91543 Hs.25601 chromodomain helicase DNA binding protein 3 

NMJJ071 15 Hs.29352 tumor necrosis factor, alpha-induced protein 6 

H05317 Hs.283549 ESTs 

BE149656 Hs.306621 Homo saplenscDNA FU11963 fis, done HEMBB1001051 

W93227 Hs.103245 EST 

AI218439 Hs.129998 enhancer of polycomb 1 

AI123705 Hs.106932 ESTs 

W93949 Hs.33245 ESTs s 

AI419294 Hs.324342 ESTs 

NMJ02666 Hs.103253 perilipin 

AW016321 Hs.62306 destrin (actin depolymerizlng factor) 

AB028945 Hs.12696 cortacQn SH3 domain-binding protein 

T34530 Hs.4210 Homo sapiens CDNAFU13069 fis, clone NT2RP3001752 

H06245 Hs.106801 ESTs, Weakly similar to PC4259 ferritin assodated protein [H.sapiens] 
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130966 RC Z38525 s AW971018 Hs.21659 

128875 RC Z3B538J AB040923 Hs.106808 

133200 RC.Z38551 S AB037715 Hs.183639 

130158 RC Z387B3_S AB032947 Hs.151301 

5 125295 RC_Z39113 AB022317 Hs.25887 
domain, {semaphorm)4F 

125298 RC.Z39255J AW972542 Hs.289008 

125300 RC_Z39591 Z39591 Hs.101376 

323122 RC_Z39783_S BE622770 Hs.264915 

10 311463 RCZ39920 R55344 Hs.22142 

130882 RC_Z40166J AA497044 Hs.20B87 

128888 RC 240388.S AI760853 Hs.241558 

125310 RC Z40646 R59161 Hs.124953 

125315 RC.241697 R38110 Hs.106296 

15 125317 RC.Z99349 Z99348 Hs.1 12461 

135095 RC Z99394_s AA081258 Hs.132390 

104786 RC_AA027168 AA027167 Hs.10031 

132837 D58024_s AA370362 Hs.57958 

120456 RC.AA251113 AA488750 Hs.88414 

20 132459 RC.M347573 AL120071 Hs.48998 

101545 M31210 BE246154 Hs.154210 

133505 C01527 AI630124 Hs.324504 

132360 RC N62948 s AW893660 Hs.46440 

132738 RCJV42674 AK000738 Hs.264636 

25 119586 RC_W43000_S AF088033 Hs.159225 

129914 RC_N31750„s NMJH2421 Hs.13321 

130839 AF009301 AB011169 Hs.20141 

132813 L37347 BE313625 Hs.57435 

134342 M99564 NMJJ00275 Hs.82027 

30 131878 RC_AA430673 AA083764 Hs.6101 

105426 RCJ\A251297 W20027 Hs.23439 

132968 RCJU620722 AF234532 Hs.61638 

132173 RCW46577_s X89426 Hs.41716 

113932 RCJV81237 AA256444 Hs.126485 

35 114452 RC AA020825 AI369275 Hs.243010 
PROTEIN TC10 

115243 RC_AA278766 AA806600 Hs.116665 

134403 RC H93708 s AA334551 Hs.82767 

129647 RC N49394 AB018259 Hs.118140 

40 111428 RC_H56559_s AL031428 Hs.174174 

115967 RC_AA446887 AI745379 Hs.42911 

120726 RC_AA293656 AA293655 Hs.97293 

114995 RCJW251152 AA769266 Hs.193657 

303876 RCJVA233334_s U64820 Hs.66521 

45 . dominant, ataxin' 3) 

311463 RC.Z39920 R55344 Hs.22142 

- 120302 RCJW192173 AA837098 Hs.269933 

133071 RC.AA455044 BE384932 Hs.64313 

121032 RC_AA3985Q4 AA393037 Hs.161798 

50 129829 U41813 AF010258 Hs.127428 

120245 RC AA166965 AW959615 Hs.111045 

' 120985 RCIAA398222 AI219898 Hs.97592 

114184 RC Z39095 R56434 Hs.21062 

447503 RC AA284744J AA1 15496 Hs.336898 

55 cds 

132837 RC_AA42B201 AA370362 Hs.57958 

121034 RCJ\A398507 AL389951 Hs.271623 

119718 RC W69216 W69216 Hs.92848 

120455 RCZaA251083 AA251720 Hs.104347 

60 125280 RC_W93659 AI123705 Hs.106932 

132155 RCJW227903 AK001607 Hs.41127 

120609 RCJ\A2B3902 AW978721 Hs.266076 

121278 RCJ\A401631 AA037121 Hs.98518 

109023 RCJW57293 AA157293 Hs72168 

65 129815 RCJD60208J BE565817 Hs.26498 

108061 RCJW043979 AA043979 Hs.62651 

113287 RC T66847 T66847 Hs.194040 

114082 RC Z38239 AK001612 Hs.26962 

116334 RCJW491457 AL038450 Hs48948 

70 131486 RC_Z4C071js F06972 Hs.27372 

107860 RCJWJ24961 AA024961 Hs.50730 

131263 RCJW443826 AU077002 Hs.24950 

132207 RCJ\A443294 BE206939 Hs42287 

129183 RCJW55743 BE561824 Hs.273369 

75 408431 RCJ23708 AI338631 Hs43265 
120575 RCJ\A280934 AW978022 Hs.238911 



ESTs 

kelch (Drosoph!la)-Gke 1 
hypothetical protein FU10210 
Ca2+-dependent activator protein for secretion 

sema domain, immunoglobulin domain (Ig), transmembrane domain (TM) and short cytoplasmic 

Homo sapiens cDNA: FU21814 fis, done HEP01068 
EST 

Homo sapiens cDNA FU12908 fis, clone NT2RP2004399 

cytochrome b5 reductase b5R.2 

hypothetical protein FU10392 

ariadne Prosophila) homolog 2 

ESTs 

ESTs 

ESTs, Weakly similar to I38022 hypothetical protein [Ksapiens] 
zinc finger protein 36 (KOX 1 8) 
KIM0955 protein 
EGF-TM74atrophiiin-reIated protein 

BTB and CNC homology 1 , basic leucine zipper transcription factor 2 

fibronedin leudne rich transmembrane protein 2 

endothelial differentiation, sphingoiipid G-protein-coupled receptor, 1 

Homo sapiens mRNA; cDNA DKFZp586J0720 (from done DKFZp586J0720) 

solute carrier family 21 (organic anion transporter), member 3 

hypothetical protein FLJ20731 

ESTs 

rearranged L-myc fusion sequence 
similar to S. cerevisiae SSM4 

solute earner family 1 1 (proton-coupled divalent metal ion transporters), member 2 

oculocutaneous albinism II (pink-eye dilution (murine) homolog) 

hypothetical protein MGC3178 

ESTs 

myosin X 

endothelial cell-spedfic molecule 1 
hypothetical protein FU12604; KIAA1692 protein 

Homo sapiens cDNA FU14445 fis, done HEMBB1001294, highly similar to GTP-BINDING 

KIAA1842 protein 

sperm specific antigen 2 

KIAA0716geneprodud 

WAA0601 protein 

ESTs 

ESTs 

ESTs 

Machado-Joseph disease (spinocerebellar ataxia 3, olivopontocerebellar ataxia 3, autosomal 

cytochrome b5 reductase D5R.2 
ESTs 

ESTs, Weakly similar to AF257182 1 G-protein-coupled receptor 48 [H.sapiens] 
ESTs 

homeoboxA9 
ESTs 
ESTs 
ESTs 

Homo sapiens, Similar to RIKEN cDNA 1810038N03 gene, done MGCS890, mRNA, complete 

EGF-TM7-latrophflln-related protein 

nudeoporin 50kD 

ESTs 

ESTs, Weakly similar to ALUC.HUMAN HI! ALU CLASS C WARNING ENTRY ll! [H.sapiens] 
ESTs 

hypothetical protein FU13220 

ESTs, Weakly similar to A46010 X-linked retinopathy protein [Rsapiens] 

Homo sapiens cDNA FU11490 fis, done HEMBA1001918 

ESTs 

hypothetical protein FU21657 
EST 

ESTs, Weakly similar to I38022 hypothetical protein [H.sapiens] 
Homo sapiens cDNA FU10750 fis, clone NT2RP3001929 
ESTs 

BMX non-receptor tyrosine kinase 
ESTs 

regulator of G-protein signalling 5 
E2F transcription factor 6 

uncharacterized hematopoietic stem/progenitor cells protein MDS027 
Homo sapiens cDNA: FU22536 fis, done HRC13155 
hypothetical protein DKFZp762E151 1; KIAA1816 protein 
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132121 RCJ\A443284_s NMJ04529 



N39074 

AI718295 

Y07759 

AA649530 

BE545072 

AA196300 

M29551 

AB026436 

AW172431 

AL137554 

AF010258 

AA447709 



117657 RC.N39074 
134922 RCJVQ4507_S 
118523 RC_R41828_s 
116845 RC.H64973 
115291 RC.AA279943 
120326 RCJW1963Q0 
130174 M29550 
129131 RC„AA436489 
129868 RC.AA287032 
118661 RC_N70777 
129829 RC..AA496921 
115985 RC_M447709 
'134637 RCJ\A369856_s U87309 
132714 RCJU252598 W39388 
129771 RCJ173237 
123360 RC.AA504784 
132902 RC_AA490969 
113716 RC_T97750 
113825 RCJM8860 
130367 RC_Z38501 
120541 RC.AA278298 
116727 RC_F13684 
118219 RCJI62231 
119767 RC.W72562 
128917 RCJ\A481252 
451553 RCLAA020928 
132716 RCj\A251288 
118525 RCJJ67861 
.114618 RC_AA084162 
119743 RCJV70242 
108154 R(LAA425151_s NMJ05754 
122798 RCJ^A460324 AW366286 
133746 U44378 
119822 RCJv74471 
122186 RCJVA435842 
114941 RCJM243017 
118053 RC_N53367 
123234 RC_AA490227 
129280 M63154 
118995 RCJJ94591 ' 
116750 RC.H05960 
129026 M98833 
105127 RC_AA158132 
114513 RCLAA044825 
411856 RC_T35697 
132036 W01568 
130091 RC.W88999 
sequence 
414108 U09564 
119881 RC.W81456 
117770 RC_N47953 
119850 RCJV80447 
115439 RC_AA284561 
123107 RCJWTC6071 
406698 M24364 
121231 RCJM00780 
132074 AB002366 
413670 AB000115 
125277 RC.W93227 
114056 RCJVU86324 
121153 RC.M399640 
121609 RCLAA416867 
120661 RC^AA287556 
120850 RC^AA349647 
124947 RCJO3170 
130529 RC.AA280886 
repetitive element, mRNA sequence 
117683 RCJJ40180 N40180 
IMAGE:276387 3' similar to contains L1 .t1 
120745 RCJW302809 AA302809 
120936 RCJW385934 
112597 RC_R78376 
120183 RC^Z40174 
120644 RCJVA287038 



AL096748 

AA532718 

AI936442 

AA001356 

AW014486 

AL135301 

W07318 

R76472 

AA862391 

W72562 

AI365215 

AA018454 

BE379595 

N67861 

AW979261 

AA947552 



AW410035 

AF086409 

AA398811 

AA236512 

N53391 

NMJ01938 

M63154 

N94591 

AA760689 

AL120297 

AA045648 

AA044873 



AL157433 



A1267592 

W81486 

AW957372 

AI247568 

A1567972 

AA225048 

X03068 

AA814948 

AA478486 

AB000115 

W93227 

AA188175 

AA399640 

AA416867 

AA287556 

AA349647 

T03170 

AA178953 



AA385934 
R78376 
AW082866 
AIB69129 



Hs.404 ' myeloid/lymphoid or mixed-lineage leukemia (trithorax (DrosophOa) homolog); translocated to, 3 

Hs.44933 ESTs 

Hs.91161 prefoldin4 

Hs.170157 myosin VA (heavy polypeptide 12, myoxin) 

gb^s44f05.s1 NCLCGAPJVtvl Homo sapiens cDNA clone, mRNA sequence 

Hs.122579 hypothetical protein FU10461 

Hs.21 145 hypothetical protein RG083MQ5.2 

Hs.151531 protein phosphatase 3 (formerly 28), catalytic subunit, beta isoform (caldneurin A beta) 

Hs.177534 dual specificity phosphatase 10 

Hs.13012 ESTs 

Hs.49927 protein kinase NYD-SP15 

Hs.127428 homeoboxA9 • 

Hs.268115 ESTs, Weakly similar to T08599 probable transcription factor CA 150 [Rsapiens] 

Hs.180941 vacuolar protein sorting 41 (yeast homolog) 

Hs.55336 Homo sapiens, done MGC:17421, mRNA, complete cds 

Hs.102708 DKFZP434A043 protein 

Hs.178604 ESTs 

Hs.59838 hypothetical protein FU10808 

Hs.18159 ESTs 

Hs.22509 ESTs 

Hs.8768 hypothetical protein FLJ10849 

Hs.240 rvtphase phosphoprotein 1 

Hs.65646 ESTs 

Hs.48494 ESTs, Moderately similar to A46010 X-Ilnked retinopathy protein [H^apiens] 

Hs.58119 ESTs 

Hs.206097 oncogene TC21 

Hs.269211 ESTs 

Hs.283738 casein kinase 1 , alpha 1 

Hs.49390 ESTs 

Hs.291993 ESTs 

Hs.58086 ESTs 

Hs.220689 Ras-GTPase-acthrating protein SH3-domain-binding protein 

Hs.145696 splicing factor (CC1.3) 

Hs.75862 MAD (mothers against decapentaptegic Drosophila) homolog 4 

Hs.301327 ESTs 

Hs.104673 ESTs 

Hs.87331 ESTs 

Hs.47629 ESTs 

Hs.16697 down-regulator of transcription I.TBP-bindbig (negative cofactor 2) 

Hs.1 10014 gastric Intrinsic factor (vitamin B synthesis) 

Hs.323056 ESTs 

Hs.92418 ESTs 

Hs.108043 Friend leukemia virus integration 1 

Hs.301957 nudix (nucleoside diphosphate linked moiety X)-type motif 5 

Hs.103446 ESTs 

Hs.4190 Homo sapiens cDNA: FLJ23269 fls, clone COL09533 

Hs.37706 hypothetical protein DKFZp434E2220 

gb:zh70hQ3.s1 SoaresJetalJfoer_spleen_1NFLS_S1 Homo sapiens cDNA clone 3', mRNA 

Hs.75761 SFRS protein kinase! 

Hs.58648 ESTs 

Hs.46791 ESTs, Weakly simlar to 138022 hypothetical protein [H^apiens] 

Hs.58452 ESTs 

Hs.193090 ESTs, Highly similar to AF1 61 437 1 HSPC319 [Rsapiens] 

Hs.104207 ESTs 

Hs.73931 major histocompatibility complex, class II, DQ beta 1 

Hs.96343 ESTs, Weakly similar to ALUCJHUMAN lili ALU CLASS C WARNING ENTRY ill [Rsapiens] 

Hs.3852 KIAA0368 protein 

Hs.75470 hypothetical protein, expressed in osteoblast 

Hs.103245 EST 

Hs.82505 K1AA1254 protein 

Hs.97694 ESTs 

Hs.98185 EST 

Hs.263412 ESTs, Weakly similar to ALUBJWMAN fill ALU CLASS B WARNING ENTRY III [Ksapiens] 

Hs.96927 Homo sapiens cDNA FU12573 fls, done m?RM4000979 

Hs.100165 ESTs 

gb:zp39e03.s1 Stratagene muscle 937209 Homo sapiens cDNA done 3" similar to contains Alu 



gb:yy44d02.s1 SoaresjTUi!tiple_sclerosls_2NbHMSP Homo sapiens cDNA clone 
L1 repetitive element ;, mRNA sequence. 

gb:EST10426 Adipose tissue, white I Homo sapiens cDNA 3 1 end, mRNA sequence. 

Hs.97184 EST,! 

Hs.29733 EST 
Hs.65882 ESTs 
Hs.96616 ESTs 
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119023 RCJJ98488 N9848B 
IMAGE:310129 3', mRNA sequence. 

107582 RCJ\A002147 AA002147 

118249 RC.N62580 N62580 

115022 RCJ\A252029 AA252029 

117710 RCJI45198 N45198 

115341 RCJVA281452 AA2B1452 

118896 RC.N9Q680 N46213 

121121 RCJVA399371 AA399371 

118329 RCJI63520 N83520 
3', mRNA sequence. 

119496 RCJV35416 W35416 

118111 RCJs!55493 N55493 
mRNA sequence. 
119062 RCJR16698 
116710 RC_F10577J 
119261 RCJT15956 
122723 RC AA457380 



AW444881 
F10577 
T15956 
AA457380 



gb:zb82h01.s1 Soaies_senescenLfibrob!asts_NbHSF Homo sapiens cDNA clone 
Hs.59952 EST 

Hs.322925 EST, Weakly similar to putative p1 50 [H.sapiens] 
Hs.87935 ESTs 

Hs.47248 ESTs, Highly similar to similar to Cdc14B1 phosphatase [H.sapiens] 
Hs.88840 EST, Weakly similar to granule cell marker protein [M.musculus] 
Hs.54642 methionine adenosytlransferase II, beta 
Hs.189095 similar to SALL1 {sal (DrosophflaHike 

gb:yy62f01.s1 Soaresjnultiple_sderosis_2NbHMSP Homo sapiens cDNA done IMAGE:278137 

Hs.156861 ESTs, Moderately similar to A46010 X-linked retinopathy protein [H.sapiens] 

gb:yv50c02.s1 Soares fetal liver spleen 1NFLS Homo sapiens cDNA clone 1MAGE246146 3', 

ESTs 



Hs.77829 
Hs.306088 
Hs,65289 



v-crk avian sarcoma virus CT10 oncogene homolog 
EST 

gb:aa86b10.s1 Stratagene fetal retina 937202 Homo sapiens cDNA done IMAGE:838171 3' 
similar to contains L1 .b3 L1 repetitive element ;, mRNA sequence. 

1 17732 RCJM6452 N46452 gb:yy76h09.s1 Soares_multiple_sderosis_2NbHMSP Homo sapiens cONA done 

IMAGE:279521 3' similar to contains Llt2 L1 repetitive element mRNA sequence. 

104787 RC..AA027317 AA027317 gb:ze97d1 1.s1 Soares_fetaLhearLNbHH19W Homo sapiens cDNA done IMAGE:366933 3 1 

similar to contains Alu repetitive element,, mRNA sequence. 



100071 A28102 
115819 RCJ\A426573 
130882 RCLZ40166J 
125225 RCJV76540 
108339 RC_AA070801 



A28102 
AA486620 
AA497044 
W74169 
AW151340 



WARNING ENTRY [Rsapiens] 

100338 D63483 D86864 

121636 RCJW417027 AA379203 

103875 RC.M418387 T26379 

118716 RC N73460 A1658908 

119763 RC W72450 R54146 

121917 RCJW428218 AA406397 

132806 M91488 A1699432 

130949 Y10659 AV656840 

108806 RCJVA129933 AF070578 

133276 RCJ\A490478 AW978439 

134760 RCJH16758 NM.000121 

132867 AA121287 AF226667 

132051 AA091284 AA393968 

114208 RC Z39301 AL049466 

104094 AA418187 AA418187 

128718 AA426361 NIVL002959 

302032 RCJI20407 NM_001992 

115501 RC_AA291553 AA291553 

101997 U01160 AU076536 

103708 AA037206 AA430591 

101899 S59184 S59184 

115839 RCJ\A429038 BE300266 

409459 D50678 D86407 

103563 Z22534 L02911 

123233 RC_AA490225 AW974175 
[H^apiens] ^ 

121305 RC_AA402468 AA402468 

114798 RC.AA159181 AA1 59181 

133145 RCJW196549 H94227 
131567 RCJ\A291015_S AF015592 

112300 RC.R54554 H24334 

129507 RC_AA192099 AJ236885 

121033 RC.AA398505 AA398505 

121151 RC.M399636 AA399636 

121402 RCJ\A4Q6063 AA406063 

123203 RO J AA489671 AA352335 

132271 RCJ\A236466 AB030034 

125197 RCJV69106 AF086270 

114935 RC^AA242B09 H23329 
WARNING ENTRY [Rsapiens] 

125279 RC_W93640 AW401809 

108778 RC J AA128548 AF133123 

108087 RC^A045709 AA045708 

132466 RC_N66B10_S AI597655 

133328 R36553 AW452738 

124057 RC.F13804 AA902384 

124800 RCJW5115 AW864086 



Human GABAa receptor atpha-3 subunit 

Hs.41135 endomucin-2 

Hs.20887 hypothetical protein FU10392 

Hs.16492 DKFZP564G2022 protein 

Hs.51615 ESTs, Weakly similarto ALU7_HUMAN ALU SUBFAMILY SQ SEQUENCE CONTAMINATION 

Hs.57735 acetyl LDL receptor; SREC 

Hs.306654 Homo sapiens cDNA RJ13574 fis, done PLACE1008625 

Hs.48802 Homo sapiens done 23632 mRNA sequence 

Hs.1 18722 fucosyltransferase 8 (alpha (1 ,6) fucosyltransferase) 

Hs.10450 Homo sapiens cDNA: FU22063 fis, done HEP10326 

Hs.98038 ESTs 

Hs.278519 hypothetical protein FU10099 

Hs.2851 15 interleukin 13 receptor, alpha 1 

Hs.71168 Homo sapiens done 24674 mRNA sequence 

Hs.69504 ESTs 

Hs.89548 erythropoietin receptor 

Hs.58553 CTP synthase II 

Hs.180145 HSPC030 protein 

Hs.7859 ESTs 

Hs.330515 ESTs 

Hs.281705 sortilinl 

Hs.128087 coagulation factor II (thrombin) receptor 

Hs.190086 ESTs 

Hs.50984 sarcoma amplified sequence 

Hs.72071 hypothetical protein FU20038 

.Hs.79350 RYK receptor-like tyrosine kinase 

Hs.28935 transdudn-iike enhancer of split 1 , homolog of Drosophila E(sp1 ) 

Hs.54481 low density lipoprotein receptor-related protein 8, apolipoprotein e receptor 

Hs.150402 Activin A receptor, type I (ACVR1) (ALK-2) 

Hs.188751 ESTs, Weakly similar to MAPB^HUMAN MICROTUBULE-ASSOCIATED PROTEIN 1B 

Hs.291557 ESTs 

Hs.54900 serologically defined colon cancer antigen 1 

Hs.6592 Homo sapiens, clone IMAG&2961368, mRNA, partial cds 

Hs.28853 CDC7 (cell division cyde 7, S. cerevisiae, homo!og)-like 1 

Hs.26125 ESTs 

Hs.1 12180 zinc finger protein 148 (pHZ-52) 

Hs.97360 ESTs 

Hs.143629 ESTs 

Hs.98003 ESTs 

Hs.65641 hypothetical protein FU20073 

Hs.115175 sterile-alpha motif and leudne zipper containing kinase AZK 

Hs.278554 heterochromatin-like protein 1 

Hs.290880 ESTs, Weakly similar to ALU1.HUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 

Hs.4779 K1AA1150 protein 

Hs.90847 general transcription factor IllC, polypeptide 3 (102kD) 

Hs.40545 ESTs 

Hs.49265 ESTs 

Hs.265327 hypothetical protein DKFZp761l141 

Hs.73853 bone morphogenetic protein 2 

Hs.138617 thyroid hormone receptor Interactor 12 
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M398482 

AA827798 

AU076845 

AM23132 

AA782536 

AK000061 

H75964 

AA199883 

AW137422 

T85104 

AW508930 

AA648278 



T90313 

AF028706 

AF151852 

AA135332 

AA3143B9 

AF084104 

AA330310 

AF031463 

AA292105 

R41179 

AI379720 

AA524470 

AW207152 

NM_016940 

BE619092 

AA026815 

AA356792 

AW961026 



121029 RCJ\A398482 

120663 RC.AA287627 

102133 U15173 

108246 RCJ\A062855 

125226 RC W78134 

120260 RCJWI71739 

124906 RC_R87647 

109406 RC_AA226877 

109271 RC.AA195668 

125052 RC_T80174_S 

109101 RCJVA167708 

115241 RCJ\A278723 

117163 RC.H97909 

113530 RC.T90313 

120375 RCJVA227260 

129435 M314256 

114864 RCJVA235256 

103988 M314389 

'131006 RCJVA242763 

106781 RCAA478474 

106141 RCLAA424558 
. 116213 RC.AA476738 
. 135266 AB002326 

135058 RCJVA430152 

119908 RCLW85844 

103695 AA018758 

103978 AA307443 

109485 RCJVA233472 

129574 AA458603 

115347 RCLAA281528 

120765 RC.AA338735 

WARNING ENTRY [H.saplens] 

121059 RCJ\A398628 AA393283 

131887 AA046548 ■ 

member 1 

112064 RC.R43812 
115606 RC_AA400465 
131750 RC_H94855_s 
102123 U14518 
129847 RCJV46767 
133809. RC.AA235275 
132210 RC.N51499JS 
122356 RCJVA443794 
114958 RC.AA243708 
103951 AA287840 

134703 RCJVA280704 
128727 AA287864 
105743 RCJ\A293300_s BE246502 

' domain, (semaphorin) 4B 
103744. AA076003. 
sequence 
114348 N80402 
114009 RCJW900&7 

134704 RCJ\A280849 
128629 AA399187 
104410 H65925 
110200 RC_H21075 
124483 RC.N53976 
101391 M14648 
109657 RC.F04826 
117140 RC_H96813 
132937 RC.M233706J AW952912 
129799 R36410 AW967473 
105077 RCJ\A142919 
100850 RC_N58561_s 
131043 RC.JAA490925 
118417 RCJI66048J 
129254 RCJVA243695 
119149 RC.R58910 
133996 AA091367 
110223 RC.H23747 
117626 RCJ136090 
135286 RC_M424469_s AW023482 
122967 RCJW478521 AA806187 
131236 AA282640 AF043117 
128568 AA463380 H12912 



Hs.97641 

Hs.1 05089 

Hs.1 55596 

Hs.146343 

Hs.122647 

Hs.101590 

Hs.107815 

Hs.67624 

Hs.86022 

Hs.222779 

Hs.52184 

Hs.193859 

Hs.42344 

Hs.16732 

Hs.111227 

Hs.1 11449 

Hs.71608 

Hs.42500 

Hs.22116 

Hs.24181 

Hs.9302 

Hs.326740 

Hs.97393 

Hs.93814 

Hs.58753 

Hs.186600 

Hs.34136 

Hs.28465 

Hs.1 1463 

Hs.334824 

Hs.96752 



W17064 Hs.332848 



AL049390 

AI025829 

NWL004349 

NNL001809 

N64025 

AV649326 

NM-007203 

AA443794 

N20912 

AL353944 

AF1 17065 

AI223335 



AA079267 

AL050321 
AI248544 
AA837124 
AL096748 
AI807519 
H21075 
AI821780 
NWL002210 
R60900 
H96813 



W55946 

AAB36472 

AF084535 

AF080229 

AA252468 

BE304701 

AA380267 

H19836 

AK001757 



Hs.22689 

Hs.86320 

Hs.31551 

Hs.1594 

Hs.296178 

Hs.76359 

Hs.42322 

Hs.98390 

Hs.42369 

Hs.50115 

Hs.88764 

Hs.50651 



EST 
ESTs 

BCL2/adenovfrus E1B 19kD-interacting protein 2 
ESTs 

N-myristoyrtransferase2 

hypothetical protein 

ESTs 

ESTs 

ESTs 

ESTs, Moderately similar to similar to NEDD-4 rjisaplensj 

hypothetical protein FU20618 

ESTs 

ESTs 

ESTs 

Zic family member 3 (odd-paired Drosophiia homolog, heterotaxy 1 ) 

CGI-94 protein 

ESTs 

ADP-ribosylation factor-like 5 

CDC14 (cell division cycle 14, S. cerevisiae) homolog B 

ESTs 

phosducin-like 

hypothetical protein MGC10947 

KIAA0328 protein 

hypothetical protein 

ESTs 

ESTs 

chromosome 21 open reading frame 6 

Homo sapiens cDNA: FU21869 fis, clone HEP02442 

UMP-CMP kinase 

hypothetical protein FU14825 4t ..«^ ( 
ESTs, Weakly similar to ALU8JHUMAN ALU SUBFAMILY SX SEQUENCE CONTAMINATION 

gb:zt74e03.r1 Soares_testis_NHT Homo sapiens cDNA clone 5', mRNA sequence 
SWI/SNF related, matrix associated, acttn dependent regulator of chromatin, subfamily e, 

Homo sapiens mRNA; cDNA DKFZp58601318 (from clone DKFZp58601318) 
ESTs 

core-binding factor, runt domain, alpha subunit 2; translocated to, 1; cydin D-related 
centromere protein A (17kD) 
hypothetical protein FLJ22637 



Hs.301532 

Hs.103000 

Hs.88780 

Hs.102708 

Hs.104520 

Hs.31802 

Hs.179864 

Hs.295726 

Hs.26814 

Hs.42241 

Hs.300383 

Hs.239114 

Hs.234863 

Hs.297939 

Hs.22464 

Hs.1098 

Hs.65732 

Hs.78277 

Hs.31697 

Hs.281348 

Hs.97849 

Hs.289101 

Hs.24594 

Hs.274691 



A kinase (PRKA) anchor protein 2 

ESTs 

ESTs 

Homo sapiens mRNA; cDNA DKFZp761 J1112 (from done DKFZp761J1 112) 
male-specific lethai-3 (DrosophllaHike 1 
Janus kinase 1 (a protein tyrosine kinase) 

sema domain, immunoglobulin domain (Ig), transmembrane domain (TM) and short cytoplasmic 

gb:zm97e10.s1 Stratagene colon HT29 (937221) Homo sapiens cDNA clone 3\ mRNA 

CRP2 binding protein 
KIAA0831 protein 
ESTs 

OKFZP434A043 protein 

Homo sapiens cDNA FLJ13694 fis, clone PLACE2000115 

ESTs, Highly similar to A59266 unconventional myosin-15 [H.sapiens] 

ESTs 

integrin, alpha V (vitronectin receptor, alpha polypeptide, antigen CD51) 

ESTs 

ESTs 

hypothetical protein MGC3032 

mannosidase, alpha, class 1 A, member 2 

Homo sapiens cDNA FU12082 fis, clone HEMBB1002492 

cathepsin B 

epilepsy, progressive myoclonus type 2, Lafora disease (lafortn) 

gb:Human endogenous retrovirus K clone 10.1 polymerase mRNA, partial cds 

DKFZp434J1813 protein 

ESTs 

DKFZP434F2021 protein 
ESTs 

hypothetical protein RJ10B95 
ESTs 

glucose regulated protein, 58kO 

ubiquitination factor E4B (homologous to yeast UFD2) 

adenylate kinase 3 
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112888 
115192 
116688 
122254 
128981 
131042 
103704 
121341 
106593 
115195 
115425 
117258 
120209 



RCJQ3872 

RCJW261920 

RC_N71484 

RCJM36837 

AA135452 

RC.R42457 

AA028171 

AA233107 

RC AA456826 

RCJIA262156 

RCJVA284071 

RCJJ21299 

RC.Z40892 



134082 L16991 

104774 RC..AA026066 

115625 RC.M40163Q 

104469 N28707 

107401 W20054 

111686 RC R21510 

115300 RCJA2B0026 

115378 RC_AA282292 

132224 RC H97819 

113791 M95767 

129144 AA004987 

104448 L44574 

132084 RC_T2698Ls 

111831 RC_R36083 

114765 RCJW252163 

115029 RCJW252219 

100457 H81492 

104536 R24011 . 
PROTEIN 91 

116167 RC.M461562 

103889 AA236771 

131978 RC H48459.S 

118843 RC_N60181 

120837 RC W93092 

133647 D21852 

129521 U41815 

103746 AA081876 
sequence 

132019 RCJW134965J 

132310 RC_AA284107 

117367 RC_N24954 

103743 AA075998 



AW195317 Hs.107716 hypothetical protein FU22344 

AA741024 Hs.88378 ESTs 

AK000708 Hs.169764 hypothetical protein FU20701 

AA436837 gb:zv57g07.s1 SoaresJestis_NHT Homo sapiens cDNA clone 3', mRNA sequence 

AA927177 Hs.86041 COG triplet repeat binding protein 1 

AI826288 Hs.171637 hypothetical protein MGC2628 

AA028171 Hs.151258 hypothetical protein FLJ21062 

AF035528 Hs.1 53863 MAD (mothers against decapentapleglc, Drosophila) homolog 6 

AW296451 Hs.24605 ESTs 

AW968619 Hs.155849 ESTs 

AA611895 Hs.180680 ESTs, Weakly similar to 154374 gene NF2 protein [Ksapiens] 

AF086041 Hs.42975 ESTs 

F02951 gb:HSC1HB082 normalized infant brain cDNA Homo sapiens cDNA clone c-1hb08 3', mRNA 

L16991 Hs.79006 deoxythymidylate kinase (thymidylate kinase) 

AW959755 Hs288896 Homo sapiens cDNA FU12977 lis, done NT2RP20Q6261 

AA059459 Hs.62592 ESTs 

N28707 Hs.154304 Homo sapiens chromosome 19, BAC 282485 (CIT-B-344H19) 

N91453 Hs.102987 ESTs 

R22039 Hs.23217 ESTs 

AA280095 Hs.8B689 ESTs 

AA282292 Hs.279841 hypothetical protein FU1 0335 

N41549 Hs.285410 ESTs 

AI269096 Hs.135578 chltobiase, aVN-aoetyl- 

AL137275 Hs.20137 hypothetical protein DKFZp434P0116 

NM.007331 Hs.1 10457 Wolf-Hirschhom syndrome candidate 1 

NNL002267 Hs.3886 karyopherin alpha 3 (importin alpha 4) 

R36095 Hs268695 ESTs 

AA463550 Hs.337532 ESTs, Weakly similar to A47582 B-cel) growth factor precursor [H^apiens] 

AL137939 Hs.40096 ESTs 

BE246400 Hs.285176 acetyVCoenzyme A transporter 

R24024 Hs.158101 Homo sapiens cDNA FU14673 fis, done NT2RP2003714, moderately similar to ZINC FINGER 

AI091731 Hs.87293 hypothetical protein FU20045 

R85350 Hs.101368 ESTs 

AA355925 Hs,36232 K1AA0186 gene produd 

N80181 Hs.221498 ESTs 

BE149656 Hs.306621 Homo sapiens cDNA FU 11963 fis, clone HEMBB1001051 

NM_015361 Hs.268053 K1AA0029 protein 

AF071076 Hs.112255 nudeoporin 98kD 

AA075000 gb:zm83c07.s1 Stratagene ovarian cancer (937219) Homo sapiens cDNA done 3', mRNA 

H56995 Hs.37372 Homo sapiens DNA binding peptide mRNA, partial cds 

AA173223 Hs.289044 Homo sapiens cDNAFU 12048 fis, clone HEMBB1001990 

AI041793 Hs.42502 ESTs 

AA075998 gb:zm89b09.r1 Stratagene ovarian cancer (937219) Homo sapiens cDNA done 5' similar to 



gb:M15887 ACYL-GOA-BINDING PROTEIN (HUMAN);, mRNA sequence 



103761 AA0B5138 AA765163 
BINDING PROTEIN (HUMAN);, mRNA sequence 



130237 L3906O 

128752 RC_N72879 

135162 AA045930 

131386 AA096412 

129021 RCJW599244 

424274 AA293634 

129913 H06583 

131888 U79298 
23803 mRNA 

118612 RC.N69466 

322026 AA203138 

110892 RC.N38882 

111429 RC.R01245 

113334 RC.T76962 

104091 AA417310 

105246 RCJW226879 



AA913909 
AA504428 
AI187925 
BE219898 
AL044675 
W73933 
NM.001310 
AW294659 

AB037788 

AW024973 

AL035301 

AI038052 

AW974666 

BE465093 

AA226B79 



Hs.153088 
Hs.10487 
Hs.95667 
Hs.173135 
Hs.173081 
Hs.283738 
Hs.13313 
Hs.34054 

HsJ224961 

Hs.283675 

Hs.97375 

Hs.19162 

Hs.293024 

Hs.106101 



IMAGE:663856 3' similar to contains Alu repetitive element;, mRNA sequence. 



gb:nz79b1 0.s1 NCLCGAP_GCB1 Homo sapiens cDNA clone 3' similar to gb:M34539 FK506- 

TATA box binding protein (TBP^associated fedor, RNA polymerase I, A, 48kD 
Homo sapiens, done IMAGE:3954132, mRNA, partial cds 
F-box protein 30 

dual-spedficity tyrosine-(Y>phosphorylation regulated kinase 2 

KIAA0530 protein 

casein kinase 1, alpha 1 

cAMP responsive element binding protein-like 2 

Homo sapiens cDNA: FU2248B fis, done HRC10948, highly similar to HSU79298 Human done 

deavage and polyadenylation specific factor 2, 100kD subunit 
NPD009 protein 

Ksapiens gene from PAC 106H8 

ESTs, Weakly similar to. 154374 gene NF2 protein [H.sapiens] 

ESTs 

hypothetical protein FLJ22557 

gb:zr19c09.$1 Stratagene NT2 neuronal precursor 937230 Homo sapiens cDNA done 



113300 RCJ67448 T67448 Hs.13101 

117147 RC_H97225_S AW901347 Hs.38592 

121349 RCJ\A405205 AA405205 Hs.97960 

100294 D49396 AA331881 Hs.75454 

133999 M28213 AA535244 Hs.78305 

133259 AA278548 BE379646 Hs.6904 

129423 AA371418 AA204686 Hs.234149 

131098 RCJ\A459668 U66669 Hs.236642 

135272 AA399391 A1828337 Hs.97591 

129155 AA046865 AI952677 Hs.108972 



ESTs 

hypothetical protein FU23342 

ESTs, Weakly similar to T51 146 ring-box protein 1 [H.sapiens] 

peroxlredoxin 3 ' 

RAB2, member RAS oncogene family 

Homo sapiens mRNA full length Insert cDNA done EUROIMAGE 2004403 
hypothetical protein FU20647 
3^ydroxyisobutyryM)oenzyrne A hydrolase 
ESTs 

Homo sapiens mRNA; cDNA DKFZp434P228 (from done DKFZp434P228) 
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311291 AA056319 
120750 RCJVA310499 
101002 J04058 
133012 AA099241 
103879 AA228148.S 
131281 RC.AA443212 
115109 RCJ\A256383 
118502 RCJI67317 
134100 L07540 
131869 AA484944 
115396 RCJ\A282985 
103860 AA203742 
135089 N75611.S 
129938 U79300 
107508 W90095 
103685 AA005190 
125170 AA203147 
129179 RC.AA504125. 
116262 AA477046 
123009 RCJ\A479949 
131004 D29833 
103317 X83441 
132814 RC_C15251_f 
103992 U77718 
109258 X59710 
110754 RC.N20814 
132727 AA136382_S 
100341 D63506 
134664 AA256106 
103826 AA165564 
111678 RCLR20628 
101341 L76159 
115455 RCJ\A285068 
111192 RCJ\A477748 
129385 RCJ\A235604 
125050 RCJ79951 
122105 RCJVA432278 
121324 RCJ\A404229 
120938 RC_AA386260 
115001 RCJUV251376 
sequence. 

124799 RCJM5088 
sequence. 

122724 RC.AA457395 
117791 RC.N48325 
121895 RC_M427396 



AA782601 Hs.319817 ESTs 

AJ191410 Hs.96693 ESTs, Moderately similar to 210926QA B ceD growth factor [H^apiens] 

AV655843 Hs.169919 etectron-transfer-llavoprotein, alpha polypeptide (glutaric aciduria II) 

AA847843 Hs.6Z711 Homo sapiens, clone IMAGE:3351295, mRNA 

BE543269 Hs.50252 mitochondrial ribosomal protein L32 

AA251716 Hs.25227 ESTs 

AJ249977 Hs.88049 protein kinase, AMP-activated, gamma 3 non-catalytic subunit 

AL157488 Hs.50150 Homo sapiens mRNA; cDNA DKF2p564B182 (from clone DKFZp564B182) 

AA460085 Hs.171075 replication factor C (activator 1) 5 (36.5kD) 

AW968547 Hs.33540 ESTs, Weakly similar to dJ309K20.4 [Rsapiens] 

AA810854 Hs.89081 ESTs 

AW976877 Hs.38057 ESTs 

AI918035 Hs.301198 roundabout (axon guidance receptor, DrosophUa) homolog 1 

AW003668 Hs.135587 Human clone 23629 mRNA sequence 

N74925 Hs.38761 Homo sapiens cDNA: FU21564 fe, done COL06452 

AA158008 Hs.292444 ESTs 

AL020996 Hs.8518 selenoprotein N 

SAW969025 Hs.109154 ESTs 

AI936442 Hs.59838 hypothetical protein FU 10808 

AA535244 Hs.78305 RAB2, member RAS oncogene family 

D29833 Hs.2207 salivary proline-rich protein 

X83441 Hs.166091 ligase IV, DNA, ATP-dependent 

D60730 Hs.57471 ESTs 

BE018142 Hs.300954 Huntingtin interacting protein K 

AL044818 Hs.84928 nuclear transcription factor Y, beta 

AW302200 Hs.6336 KIAA0672 gene product 

N27495 Hs.5565 hypothetical protein FLJ22626 

AF032922 Hs.8813 syntaxin binding protein 3 

AA256106 Hs.87507 ESTs 

AW162998 Hs.24684 KIAA1376 protein 

R384B7 Hs.169927 ESTs 

NML004477 Hs.203772 FSHD region gene 1 

AA876002 Hs.120551 tofl-fike receptor 10 

AW021968 Hs.109438 Homo sapiens clone 24775 mRNA sequence 

AA172106 Hs.110950 Rag C protein 

AW970209 Hs.111805 ESTs 

AW241685 Hs.98699 ESTs 

AA404229 Hs.97842 EST 

AA386260 Hs.104632 EST 

AA251376 gb:zs10a06.s1 NCLCGAPJ3CB1 Homo sapiens cDNA clone IMAGE:684754 3', mRNA 

R45088 gb:yg38g04.s1 Soares infant brain 1NIB Hom9 sapiens cDNA clone IMAGE:34896 3', mRNA 

AA457395 Hs.99457 ESTs 

N48325 Hs.93956 EST 

M427396 gb:zw33a02.s1 Soares ovary tumor NbHOT Homo sapiens cDNA clone IMAGE:771050 3 1 



similar to contains Alu repetitive eiement;contains MER12.t2 MER12 repetitive element ;, mRNA sequence. 



108244 RC J AA062839 
3\ mRNA sequence. 
117852 RCJ149408 
109298 RCJVA205432 
122432 RC J AA447400 
124627 RC.N74625 



AA062839 gb:zm05c09.s1 Stratagene corneal stroma (937222) Homo sapiens cDNA clone IMAGE:513232 

AW877787 Hs.136102 KIAA0853 protein ■ ' • ~ 

R77854 . Hs.250693 Krueppel-related zinc finger protein 

AA447400 Hs.187684 ESTs, Weakly similarto B34087 hypothetical protein [H.sapiens] 

M74625 gb:za55c03.s1 Soares fetal Over spleen 1NFLS Homo sapiens cDNA done IMAGE:296452 3* 



similar to gb*M14338 VITAMIN K-DEPENDENT PROTEIN S PRECURSOR (HUMAN);contains OFR.b3 OFR repetitive element mRNA sequence. 



115141 RCJ^A258071 

128636 U49065 

115373 RCJVA282197 

114651 RCJVA101400 

132796 RC.M180487 

103749 RCJJ35583 

107328 T83444 

115349 RCLAA281563 

111490 RC.R06862 



AA465131 Hs.64001 Homo sapiens clone 25218 mRNA sequence 

U49065 Hs.102865 interleukin 1 receptpr-like 2 

AA664862 Hs.181022 CGI-07 protein 

AA101400 Hs.189960 ESTs 

NM 006283 Hs.173159 transforming, acidic cofled-coil containing protein 1 

AL135301 Hs.8768 hypothetical protein FU10849 

AW959891 Hs.76591 KIAA0887 protein 

AF121176 Hs.12797 DEAD/H (Asp-Glu-Ala-Asp/Hls) box polypeptide 16 

R06862 gb:yf11e095l Soares fetal liver spleen 1NFLS Homo sapiens cDNA clone IMAGE:126568 3* 



similar to contains L1 repetitive element ;, mRNA sequence. 



103763 AA085354 AA085291 
contains Alu repetitive element mRNA sequence 



118791 RC.N75520 N75520 

116644 RC.F03032 F03032 

116823 RC_H56485 AW204742 
[Ksapiens] 

108940 RCJVU48603 AA148603 
IMAGE:567198 3\ mRNA sequence. 

112218 RC_R50057 R50057 

116557 RC_D20572J D20572 

133649 U25849 U25849 

131745 RC.C20746 AI828559 



Hs.261003 
Hs.290278 
Hs.143542 



Hs.272251 
Hs.90171 
Hs.75393 
Hs.31447 



gb:zn01g06.s1 Stratagene colon HT29 (937221) Homo sapiens cDNA done 3' similar to 

ESTs, Moderately similar to B34087 hypothetical protein [H.saplens] 

ESTs, Weakly similar to B34087 hypothetical protein [H.sapiens] 

ESTs, Highly similarto CSAJWMAN COCKAYNE SYNDROME WD-REPEAT PROTEIN CSA 

gb2o09e04.s1 Stratagene neuroepithelium NT2RAMI 937234 Homo sapiens cDNA clone 

Homo sapiens mRNA; cDNA DKFZp586M1418 (from clone DKFZp586M1418) 
EST 

acid phosphatase 1 , soluble 

ESTs, Moderately similar to A46010 X-linked retinopathy protein [Ksapiens] 
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116801 RC.H43879 H43879 
sequence. 

115006 RC.AA251548 AA251548 

123424 RC_AA598500 H29882 

120831 RCJUV347919 AA347919 

103691 AA018298 AA018298 
121555 RC_AA412491 AF025771 
111193 RC_N67946 N67946 
132061 RCJW058946 AB020700 
134575 RC^AA194568_i AA194568 
115050 RC_AA252794 AA252794 
420208 U31799 BE276055 
133735 AC002045jcpt1 R66740 
128546 Z21305 NM.003478 
111946 RC.R40697 R40697 
124879 RC.R73588 R73588 
115683 AA410345 AF255910 

103692 AA018418 AW137912 
(CACNA1F) gene, complete cds; HSP27 
103767 AA089688 BE244667 
125266 W90022 W90022 
PRECURSOR [H.sapiens] 

135235 AA435512 AW298244 

134497 RCJ\A404494 BE258532 

426754 RCJVA278529J NNLQ14264 

412177 RC_AA342828_s Z23091 

132000 RC.AA044644 AW247017 

124738 RCJWJ44644 T07568 

324000 RC.AA195729J AA604749 

106896 RC_AA196729_i AW073202 

132000 RC_AA025858 AW247017 

129577 RC_AA025858 N75346 

107091 RCJW233519 AI949109 

130296 RC_N52271 D31139 

102855 RCJJ68399 NMJ03528 

113689 RCJ\A098874 AB037850 

100939 RC J\A279667_s L04288 

130430 RC H22556 W27893 

106734 RC_N45979_s BE296690 
Intersectin 2 long isoform (ITSN2) mRNA 

135148 RC_AA431288_s AA306478 

134221 RCJ\A609862 BE280456 

105376 RCJI35583 AW994032 

124541 U77718 AF1 12222 

134546 AA203147 AL020996 

134000 RCJW93092 AW175787 

125656 RCJA/93092 AW516428 

100939 RC_N58561_s L04288 

125656 RCJW93092 AW516428 

101779 RCJ/V69385_s BE543412 

332489 RC_R22947 R23053 

133000 RC.N38959J AL042444 

125905 RC_N38959J AI678638 

129000 RC_H73050_s AA744902 

100920 RC_H73050_s X54534 



gb:yo69h09.s1 Soares breast 3NbHBst Homo sapiens cDNA clone (MAGE:18323*3 3", mRNA 

Hs.87886 EST 

Hs.162614 ESTs 

Hs.96889 EST 

Hs.103332 ESTs 

Hs.50123 zinc finger protein 189 

Hs.117569 ESTs 

Hs.3830 KJAA0893 protein 

Hs.85938 EST 

Hs.88009 ESTs 

Hs.95972 silver (mouse homolog) like 

Hs.1 10613 KIAA0220 protein 

Hs.101299 cullin 5 

Hs.76666 C9orf10 protein 

Hs.101533 ESTs 

Hs.54650 junctional adhesion molecule 2 

Hs.227583 Homo sapiens chromosome X map Xp1 1 .23 L-type calcium channel alpha-1 subunit 
pseudogene, complete sequence; and JM1 protein, JM2 protein, and Hb2E genes, complete cds 

Hs.296155 CGM0O protein 

Hs.186809 ESTs. Highly similar to LCT2.HUMAN LEUKOCYTE CELL-DERIVED CHEMOTAXIN 2 

Hs.293507 ESTs 

Hs.251871 CTP synthase 

Hs.172052 serine/threonine kinase 18 

Hs.73734 glycoprotein V (platelet) 

Hs.36978 melanoma antigen, family A, 3 

Hs.137158 ESTs 

HS.19D213 ESTs 

Hs.334825 Homo sapiens cDNA FU14752 fis, clone NT2RP3003071 

Hs.36978 melanoma antigen, family A, 3 

Hs.82906 CDC 20 (cell division cycle 20, S. cerevisiae, homolog) 

Hs.246885 hypothetical protein FU20783 

Hs.154103 LIM protein (similar to rat protein kinase C-Wnding enigma) 

Hs.2178 H2B histone family, member Q 

Hs.16621 DKFZP434I116 protein 

Hs.297939 cathepsinB 

Hs. 150580 putative translation initiation factor 

Hs.288173 Homo sapiens cDNA: FU21747 fis, done COLF5160, highly similar to AF1 821 98 Homo sapiens 

Hs.95327 CD3D antigen, delta polypeptide (TiT3 complex) 

Hs.80248 RNA-binding protein gene with multiple splicing 

Hs.8768 hypothetical protein FU10849 

Hs.44499 pinin, desmosome associated protein 

Hs.8518 selenoprotein N 

Hs.334841 selenium binding protein 1 

Hs.78687 neutral sphingomyelinase (N-SMase) activation associated factor 

Hs.297939 cathepsin B 

Hs.78687 neutral sphingomyelinase (N-SMase) activation associated factor 

Hs.250505 retinoic acid receptor, alpha 

NA Hu01 Chip Redos 

Hs.62402 p21/Cdc42/Rad-activated kinase 1 (yeast Ste20-related) 

Hs.6456 chaperonin containing TCP1 , subunit 2 (beta) 

Hs.1 07767 hypothetical protein PR01489' 

Hs.278994 Rhesus blood group, CcEe antigens 
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TABLE 1A 

Table 1A shows the accession numbers for those pkeys lacking unigenelD's for Tables 1. The pkeys in Table 7 lacking unlgenelD ? s are represented within 
Tables 1-6A. For each probeset we have listed the gene cluster number from which the oligonucleotides were designed. Gene clusters were compiled 
5 using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence similarity using Clustering and 

Alignment Tools (DoubleTwIst Oakland California). The Genbank accession numbers for sequences comprising each cluster are listed In the 'Accession" 
column. 



1 0 Pkey: Unique Eos probeset Identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 



15 Pkey CAT Number Accession 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



108469 116761J 

124106 125446J 

108501 13684.-12 

108562 36375J 

125008 1802095J 

125020 116017J 

125066 1814993.1 

116661 1532859J 

125104 413347J 

124575 1666S49J 

125263 1547.2 

116845 393481J 

118417 37166J 



118584 532052 J 

103743 112194 J 

103744 114161J 
103746 113452J 
103761 114208 J 
103763 48290.6 
120209 1531817J 
120284 158963J 
112540 1605263J 

.111904 1719336J 

121059 273450J- 

121094 275729.1 

114106 1182096J 

130091 23961_-3 

122264 296527J 

108280 11068ZJ 

129961 1706092J 

130529 158447J 



AA079487 AA128547 AA128291 AA079587 AA079600 

H12245 AA094769 R14576 

AA083256 

AA100796 AF020589 AA074629 AA075946 AA100849 AA085347 AA126309 AA07931 1 AA079323 AA085274 

T91251 T64891 T85665 

T69981 T69924AA078476 

T86284T81933 

R61504 F04247 

T95590AA703278H62764 

N68168N69188N90450 

AA098878 W88942 

AA649530AA659316 H64973 

AF080229 AF080231 AF080230 AF080232 AF080233 AF080234 BE550633 AI636743 AW614951 BE467547 AI680833 
AI633818 N29986 U87592 U87593 U87590 U87591 S46404 U87587 M463992 AW206802 AI970376 AI583718 AI672574 
N25695 AW665466 AI818326 AA126128 AI480345 AW013B27 AA248638 AI214968 AA204735 AA207155 AA206262 AA204833 
AW003247 AW495808 A1080480 AI631703 AI651023 AIB67418 AW818140 AA502500 AI206199 AI671282 AI352545 BE501030 
AI652535 BE465762 AA206331 AW451866 AA471088 AA206342 AA204834 AA206100 AW021661 AA332922 N66048 
AA703396 H92276 AW139734 H92683 U87589 U87595 H69001 U87594 BE466420 AI624817 BE46661 1 AI206344 AA574397 
AA348354 AI493192 

AW136928 AI685655 BE218584 BE465078 N68963 AA975338 BE147199 N76377 
AA075998 AA075999 AA070986 AA070896 AA129207 AA078942 AA070783 AA078941 
AA079267AA076003 
AA075000AA081676 

AA765163 AW298222 AA126126 AA085138 AA076068 

AA085291 AA085354 

F02951Z40892 F04711 

AA179656 AA182626 AA182603 

R69751 R70467 H69771 H80879 H80878 

Z41572 R39330 

AA393283AA398628 

AA402505AA398900 

AW602528BE073859 Z38412 



AA436837AA442594 
AA065069AA085108 
R23053 R79884R76271 
AA178953AA192740 

AA069818 AA069971 AA069923 AA069908 
AA021473 
AA609839 
D45652 
AA128980 
T68875 
T85352 
T85373 



108309 11 1495 J 
107832 genbank^AA021473 
123731 genbank UJ AA609839 
116571 genbanKJ345652' 
132225 gennanK_AA128980 
125017 genbank_T68875 

125063 genbank.J85352 

125064 genbank_T85373 
100964 entrezLJ00212J00212 

125118 149288J R10606T97620AA576309 
102269 entrezJJ30245U30245 
125150 NOT„FOUND.entre^.W38240 W38240 
116801 genbank_H43879 H43879 
118111 genbankJJ55493 N55493 
118129 genbanR_N57493 N57493 
118329 genbanleN63520 N63520 
118475 genbanleN66845 N66845 
111490 genbanleR06862 R06862 
111514 genbanK_R07998 R07998 
104534 R223Q3jat R22303 
120340 genbankJ\A206828 AA206828 
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10 



15 



20 



25 



30 



35 



40 



45 



120376 genbanLAA227469 
104787 genbanKJ\A027317 
120409 genbankJ\A235050 
120745 genbankJV\302809 
120809 genbanK_M346495 
120839 genban!eAA348913 
113702 genbanLT97307 
115001 genbankJ^A251376 
122562 genbanLAA452156 
122635 genbanl^AA454085 
108244 genbankJUV062839 
108277 genbanI^AAC64B59 
122723 genbankJW57380 
124028 genbankJ=04112 
108403 genbankJW)75374 
122860 genbankJM64414 
108427 genbanLAA076382 
108439 genbank taJ AA078986 
131353 231290J 
AA969360 

108533 genbank_M084415 
117031 genbanK_H88353 
124254 genbanLH69899 
101447 entre*_M21305 
101458 entrez M22092 
124577 genbank_N68300 

108940 genbank_AA148603 

108941 genbank_AA148650 
124627 genbanK_N74625 
124720 144582J 

124793 genbank_R44519 
124799 genbankj*45088 
117683 genbanLN40180 
117732 genbanLN46452 
124991 genbanRJ-501 16 
119023 genbanLN98488 
119239 95573.2 



AA227469 

AA0Z7317 

AA235050 

AA302809 

AA346495 

AA348913 

T97307 

AA251376 

AA452156 

AA454085 

AA062839 

AA064859 

AA4573B0 

F04112 

AA075374 

AA464414 

AA076382 

AA078986 

AW411259 H23555 AW015049 A1684275 AW0158B6 AW068953 AW014085 AI027260 R52686 AA918276 A1129462 

N34869 AI948416 AA534205 AA702483 AA705292 

AA084415 

H88353 

M21305 
M22092 



N68300 
AA148603 
AA148650 
N74625 

R05283R11056 

R44519 

R45088 

N40180 

N46452 

T50116 

N98488 

T11483 T11472 



119558 NOT^FOUND.entrez u .W38194 



W38194 



119654 genbankJA/57759 W57759 

105246 genbankj*A226879 AA226879 

121350 genbankJM05237 AA405237 

121558 genbanLAA412497 AA412497 

105985 genbankJ\A406610 AA406610 

100071 entre?J\28102A28102 

114648 genbankJ\A101056 AA101056 

121895 genbanLAA427396 . AA427396 

100327 entre?_D55640D55640 

123315 714071 1 AA496369 AA496646 
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TABLE 2: 



Pkey: Unique Eos probeset Identifier number 

5 Accession: Accession number used for previous patent filings 

ExAccn: Exemplar Accession number, Genbank accession number 
UnigenelD: Unigene number 
Unjgene Title: Unigene gene title 

10 Pkey Accession ExAccn UnigenelD UnigeneTftie 

100420 100420 D86983 Hs. 11 8893 Melanoma associated gene 

100484 100484 NMJJ05402 Hs.288757 v-ral simian leukemia viral oncogene horn 

100991 100991 J03836 Hs.82085 serine (or cysteine) proteinase inhibito 

15 101168 101168 NMJJ053Q8 Hs.211569 G protein-coupled receptor kinase 5 

101261 101261 D30857 Hs.82353 protein C receptor, endothelial (EPCR) 

101447 101447 M21305 gbtfuman alpha satellite and satellite 3 

101543 101543 M31166 Hs.2050 pentaxirwelated gene, rapidly induced b 

101560 101560 AW958272 Hs.347326 intercellular adhesion molecule 2 

20 101714 101714 M68874 Hs.211587 phosphoOpase A2, group IVA (cytosolic, 

101838 101838 BE243845 Hs.75511 connective tissue growth factor 

102012 102012 BE259035 Hs.118400 ■ singed (DrosophilaHike (sea urchin fas 

102164 102164 NM.000107 Hs.77602 damage-specific DNA binding protein 2 (4 

. 102283 102283 AW161552 Hs.83381 guanine nucleotide binding protein 11 

25 102564 102564 U59423 Hs.79067 MAD (mothers against decapentapleglc, Dr 

102759 102759 NMJJ05100 Hs.788 A kinase (PRKA) anchor protein (gravin) 

102804 102804 NM_002318 Hs.83354 lysyl oxidase-like 2 

102898 102898 NMJJ02205 Hs.149609 integrin, alpha .5 (fibronectin receptor, 

103036 103036 M13509 Hs.83169 matrix metattoproteinase 1 (interstitial 

30 103095 103095 NlvjJ)05424 Hs.7B824 tyrosine kinase with immunoglobulin and 

103166 103166 AA159248 Hs.180909 peroxiredoxin 1 

103280 103280 U84722 Hs.76206 cadherin 5, type 2, VE-cadherin (vascula 

103850 103850 AA187101 Hs.213194 hypothetical protein MGC1 0895 

104592 104592 AW630488 Hs.25338 protease, serine, 23 

35 104786 104786 AA027167 Hs.10031 KIAA0955 protein 

104865 104865 T79340 Hs.22575 B-cell CLL/lymphoma 6, member B (zinc fi 

104952 104952 AW076098 Hs.345588 . desmopbkin (DPI, DPII) 

105178 105178 AA313825 Hs.21941 AD036 protein 

105330 105330 AW338625 Hs.22120 ESTs 

40 105729 105729 H46612 Hs.293815 Homo sapiens HSPC285 mRNA, partial cds 

105977 105977 AK001972 Hs.30822 hypothetical protein RJ1 11 10 

106031 106031 X64116 Hs.171844 Homo sapiens cDNA: FU22296 fis, clone H 

106155 106155 AA425414 Hs.33287 nuclear factor l/B 

106423 106423 AB020722 Hs.16714 Rho guanine exchange factor (GEF) 15 

45 107174 107174 BE122762 Hs.25338 ESTs 

107295 107295 AA186629 Hs.80120 UDP44^cetyWalpha-D-galactosamine:polyp 

108756 108756 AA127221 Hs.117037 ESTs 

108888 108888 AA135606 Hs.189384 gb:z!10a05.s1 Soare3_pregnanLuterus_NbH 

109166 109166 AA219691 Hs.73625 RAB6 interacting, kinesin-like (rabkines 

50 -109768 109768 F0683B ' . Hs.14763 ESTs 

110906 110906 AA035211 Hs.17404 ESTs 

111006 111006 BE387014 Hs.166146 Homer, neuronal immediate early gene, 3 

111133 111133 AW580939 Hs.97199 complement component C1q receptor 

113073 113073 N39342 Hs.103042 microtubule-associated protein 1B 

55 113923 113923 AW953484 Hs.3849 hypothetical protein FU22041 similar to 

115061 115061 AI751438 Hs.41271 Homo sapiens mRNA full length insert cDN 

115145 115145 AA740907 Hs.88297 . .ESTs 

115947 115947 R47479 Hs.94761 . KIAA1691 protein 

116339 116339 AK000290 Hs.44033 dipeptidyl peptidase 8 

60 116589 116589 A1557212 Hs.17132 ESTs, Moderately similar to I54374 gene 

117023 117023 AW070211 Hs.102415 Homo sapiens mRNA; cDNA DKFZp586N0121 (f 

117563 117563 AF055634 Hs.44553 unc5 (C.eiegans homolog) c 4 

118475 118475 N66845 gb:za46c11.s1 Soares fetal liver spleen 

119073 119073 BE245360 Hs.279477 ESTs 

65 119174 119174 R71234 gb:yi54c08.s1 Soares placenta Nb2HP Homo 

119416 119416 T97186 gb7e50h09^1 Soares fetal liver spleen 

121335 121335 AA404418 gb:zw37e02.s1 ScaresJotaLfetus_Nb2HF8_ 

123160 123160 AA488687 Hs.284235 ESTs, Weakly similar to I38022 hypotheti 

123523 123523 AA608588 gbae54e06^1 Stratagene lung carcinoma 

70 123964 123964 C13961 gb:C13981 Clontech human aorta polyA+mR 

124315 124315 NIVLO05402 Hs288757 v-ral simian leukemia viral oncogene horn 

124669 124669 AI571594 Hs.102943 hypothetical protein MGC1291 6 

124875 124875 AI887654 Hs.285814 sprouly (Drosophila) homolog 4 

125103 125103 AA570056 Hs.122730 ESTs, Moderately similar to KIAA1215 pro 

75 125565 125565 R20840 gb:yg05c08.r1 Soares Infant brain 1NIB H 
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126511 126511 
126649 126649 
449602 449602 
127402 127402 
128992 128992 
129188 129188 
129371 129371 
129765 129765 
129884 129884 
130639 130639 
130828 130828 
131080 131080 
131182 131182 
131573 131573 
131756 131756 
131881 131881 
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Hs.166846 

Hs.99936 

Hs.156346 

Hs.89640 

Hs.78995 

Hs.902 

Hs.9884 

Hs.3069 

Hs.102267 

Hs.295944 

Hs.151254 

Hs.57435 

Hs.154879 

Hs.1227 

Hs.76152 

Hs.1846 

Hs.758 

Hs.74502 

Hs.79194 

Hs.151531 

Hs.2050 

Hs.247930 

Hs.75133 

Hs.247916 

Hs.80612 

Hs.77813 

Hs.77813 

Hs.77813 

Hs.78991 

Hs.72885 

Hs.108043 

Hs.89890 

Hs.8262 

Hs.184062 

Hs.248153 

Hs.75307 

Hs.79432 

Hs,99872 

Hs.151051 

Hs.96264 

Hs.1594 

Hs.1 55596 

Hs.1592 . 

Hs.180069 

Hs.157205 

Hs.32964 

Hs.25590 

Hs.44566 

Hs.82240 

Hs.54483 

Hs.192023 

Hs.1 18825 

Hs.2442 

Hs.1 12255 

Hs.41706 

Hs.75862 



DEAD/H (Asp-Glu-Ala-Asp/His) box polypep 
TRAM-like protein 
K1AA0071 protein 

postmeiotic segregation increased 2-like 
KIAA01 18 protein 

gb:Human monocyte PABL (pseudoautosomal 
KIAA0143 protein 

TIA1 cytotoxic granule-associated RNA-bi 
KIAA0175 gene product 
CD38 antigen (p45) 
KIAA0225 protein 

solute carrier family 23 (nucleobase tra 
topoisomerase (DNA) II binding protein 
cystatin D 

Homo sapiens, Similar to hypothetical pr 
SRY (sex determining region Y)-box 2 
neuregullnl 

polymerase (DNA directed), epsilon 
keratin 10 (epidermolytlc hyperkeratosis 
topoisomerase (DNA) II alpha (170kD) 
TEK tyrosine kinase, endothelial (venous 
MADS box transcription enhancer factor 2 
neurofibromin 2 (bilateral acoustic neur 
spindle pole body protein 
heat shock 70kD protein 9B (mortalin-2) 
lysyl oxidase 

tissue factor pathway inhibitor 2 
kallikrein 7 (chymotryptic, stratum com 
solute carrier family 11 (proton-coupled 
transmembrane trafficking protein 
DiGeorge syndrome critical region gene D 
aminotevuBnate, delta-, dehydratase, 
decorin 

gb:Human alpha satellite and satellite 3 
tumor protein p53 (U-Fraumeni syndrome) 
RAS p21 protein activator (GTPase activa 
chymotrypsinogen B1 
cAMP responsive element binding protein 
protein phosphatase 3 (formerly 2B), cat 
nidogen (enactin) 

pentaxin-related gene, rapidly induced b 
Epsilon , IgE 

transcription factor 6-Iike 1 (mitochond 
natriuretic peptide precursor C 
ubiqultin-conjugating enzyme E2A (RAD6 h 
sphingomyelin phosphodiesterase 1 , acid 
sphingomyelin phosphodiesterase 1, acid 
sphingomyelin phosphodiesterase 1, acid 
DNA segment numerous copies, expressed 
azurocidin 1 (cationic antimicrobial pro 
Friend leukemia virus integration 1 
pyruvate carboxylase 
lysosomal 

putative RabS-interacting protein 
olfactory marker protein 
tomplex^ssodated-testis-expressed 1- 
fibrillin 2 (congenital contractural ara 



mitogennactivatec; protein kinase 10 
alpha thalassemia/menta! retardation syn 
centromere protein A (17kD) 
BCL2/adenovirus E1B 19kD-interacting pro 
CDC16 (cell division cycle 16, S. cerevl 
nuclear respiratory factor 1 
branched chain aminotransferase 1, cytos 
SRY (sex determining region Y)-box 1 1 
stanniocalcin 1 
K1AA1641 protein 
syntaxin 3A 

N-myc (and STAT) Interactor 
eukaryotic translation initiation factor 
mltogen-actlvated protein kinase kinase 
a disintegrin and metalloproteinase doma 
nudeoporin98kD 

DEAD/H (Asp-Glu-Ala-Asp/His) box polypep 
MAD (mothers against decapentapiegic, Dr 
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132828 132828 AB014615 Hs.57710 

132828 132828 AB014615 Hs.57710 

130441 130441 U63630 Hs.155637 

129350 129350 U50535 Hs.110630 

5 130457 130457 AB014595 Hs.155976 

102560 102560 R97457 Hs.63984 

134305 134305 U61397 Hs.81424 

132736 132736 AW081883 Hs.211578 

102663 102663 NWL002270 Hs.168075 

10 102735 102735 AF111106 Hs.3382 

101175 101175 U82671 Hs.36980 

132164 132164 A1752235 Hs.41270 

102826 102826 NMJJ07274 Hs.8679 

102846 102846 BE264974 Hs.6566 

15 134161 134161 AA634543 Hs.79440 

302383 302363 AW163799 Hs.198365 

125701 125701 T72104 Hs.93194 

134656 134656 AI750878 Hs.87409 

102968 102968 AU076611 Hs.154672 

20 134037 134037 AI808780 Hs.227730 

103023 103023 AW500470 Hs.1 17950 

130282 130282 BE245380 Hs.153952 

128568 128568 H12912 Hs.274691 

103093 103093 S79876 Hs.44926 

25 129063 129063 X63094 Hs.283822 

133227 133227 AW977263 Hs.68257 

103184 103184 U43143 Hs.74049 

103208 103208 AW411340 Hs.31314 

131486 131486 F06972 Hs.27372 

30 103334 103334 NWL001260 Hs.25283 

135094 135094 NMJ)03304 Hs.250687 

103352 103352 H09366 HsJ8853 

132173 132173 X89426 Hs.41716 

131584 131584 AA598509 Hs.29117 

35 103378 103378 AL119690 Hs.153618 

103410 103410 AA158294 Hs.295362 

103438 103438 AW175781 Hs.152720 

103452 103452 NM.006936 Hs.85119 

135185 135185 AW404908 Hs.96038 

40 134662 134662 NNL007048 Hs.284283 

103500 103500 AW408009 Hs.22580 

132084 132084 NM_002267 Hs.3886 

133152 133152 211695 Hs.324473 

103612 103612 BE336654 Hs.70937 

45 103692 103692 AW137912 Hs.227583 

129796 129796 BE218319 Hs.5807 

132683 132683 BE264633 Hs.143638 

103723 103723 BE274312 Hs.214783 

133260 133260 AA403045 Hs.6906 

50 103766 103766 AI920783 Hs.191435 

132051 132051 AA393968 Hs.180145 

135289 135289 AW372569 Hs.9788 

103794 103794 AF244135 Hs.30670 

134319 134319 BE304999 Hs.285754 

55 119159 119159 AF142419 Hs.15020 

103850 103850 AA187101 Hs.213194 

322026 322026 . AW024973 Hs.283675 

103861 103861 AA206236 Hs.4944 

447735 447735 AA775268 Hs.6127 

60 131236 131236 AF043117 Hs.24594 

129013 129013 AA371156 Hs.107942 

103988 103988 AA314389 Hs.342849 

425284 425284 AF155568 Hs.348043 

133281 133281 AK001601 Hs.69594 

65 108154 108154 NNL005754 Hs.220689 

135073 135073 W55956 Hs.94030 

129593 129593 AI338247 Hs.98314 

132064 132064 AA121098 Hs.3838 

131427 131427 AF151879 Hs.26706 

70 104282 104282 C14448 Hs.332338 

130443 130443 D25216 Hs.155650 

132837 132837 AA370362 Hs.57958 

104334 104334 D82614 Hs.78771 

134731 134731 D89377 Hs.89404 

75 131670 131670 H03514 Hs.15589 

104402 104402 H56731 Hs.132956 



fibroblast growth factor 8 (androgen-tnd 
fibroblast growth factor 8 (androgen-ind 
protein kinase, DNA-activated, catalytic 
Human BRCA2 region, mRNA sequence CG006 
cuiBn 4B 

cadherin 13, H-cadherin (heart) 
ubiquitin-like 1 (sentrin) 
Homo sapiens cDNA: FU23037 fis, clone L 
karyopherin (importin) beta 2 
protein phosphatase 4, regulatory subuni 
melanoma antigen, family A, 2 
procollagen-lysine, 2-oxoghitarate 5-dio 
cytosoBc acyt coenzyme A thioester hydr 
thyroid hormone receptor interactor 13 
IGF-II mRNA-binding protein 3 
2,3-bisphosphoglycerate mutase 
" apoiipoprotelnA-i 
thrombospondln 1 

methylene tetrahydrofoiate dehydrogenase 
integrin, alpha 6 

- multifunctional polypeptide similar to S 
5* nucleotidase (CD73) 
adenylate kinase 3 

dipeptidylpeptidase IV (CD26, adenosine 
Rhesus blood group, D antigen 
general transcription factor IIF, polype 
fms-related tyrosine kinase 4 
retinoblastoma-binding protein 7 
BMX non-receptor tyrosine kinase 
cyciin-dependent kinase 8 
transient receptor potential channel 1 
uracil-DNA glycosyiase 
endothelial cell-specific molecule 1 
purine-rich element binding protein A 
HCGVIIM protein 

DR1 -associated protein 1 (negative cofac 
M-phase phosphoproteln 6 
SMT3 (suppressor of mif two 3, yeast) ho 
Ric (DrosophilaHike, expressed in many 
butyrophflin, subfamily 3, member A1 
atkylglycerone phosphate synthase 
karyopherin alpha 3 (importin alpha 4) 
mitogen-activated protein kinase 1 
. H3 histone family, member A 
Homo sapiens chromosome X map Xp1 1.23 L- 
GTPase Rab14 
WD repeat domain 4 

Homo sapiens cDNA FU14041 fts, clone HE 
Homo sapiens cDNA: FU23197 fis, clone R 
ESTs 

HSPC030 protein 

hypothetical protein MGC10924 similar to 
hepatocellular carcinoma-associated anti 
fumarate hydratase 

homolog of mouse quaking OKI (KH domain 

hypothetical protein MGC10895 

NPD009 protein . 

hypothetical protein FU12783 

Homo sapiens cDNA: FU23020 fis, clone L 

ublquitinatlon factor E4B (homologous to 

DKFZP564M1 12 protein 

ADP-ribosylation factor-like 5 

NS1 -associated protein 1 

high-mobility group 20A 

Ras-GTPase-activating protein SH3-domain 

Homo sapiens mRNA; cDNA DKFZp586E1624 (f 

Homo sapiens mRNA; cDNA DKFZp586L0120 (f 

serum-inducible kinase 

CGM21 protein 

EST 

KIAA0014 gene product 

EGF-TOTJatrophllirwelated protein 

phosphogtycerate kinase 1 

msh (DrosophOa) homeo box homolog 2 

ESTs 

ESTs 
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129077 129077 
134927 134927 
134498 134498 
104488 104488 
5 129214 129214 
104530 104530 
104544 104544 
104567 104567 
129575 129575 
10 104599 104599 
104667 104667 
104764 104764 
104787 104787 
104804 104804 
15 130828 130828 
.104943 104943 
105024 105024 
105038 105038 
105096 105096 
20 105169 105169 
130401 130401 
130114 130114 
105337 105337 
105376 105376 
25 131962 131962 
128658 128658 
105508 105508 
135172 135172 
132542 132542 
30 105659 105659 
105674 105674 
105722 105722 
115951 115951 
105985 105985 
35 131216 131216 
113689 113689 
130839 130839 
130777 130777 
106196 106196 
40 133200 133200 
106328 106328 
' 106423 106423 
439608 439608 
106503 106503 
45 106543 106543 
106589 106589 
106596 106596 
106636 106636 
131353 131353 
50 131710 131710 
131775 131775" 
106773 106773 
106817 106817 
106848 106848 
55 418699 418699 
130538 130638 
107059 107059 
107115 107115 
107156 107156 
60 130621 130621 
132626 132626 
131610 131610 
107295 107295 
107315 107315 
65 107328 107328 
134715 134715 
129938 129938 
130074 130074 
132036 132036 
70 113857 113857 
130419 130419 
132616 132616 
132356 132358 
125827 125827 
75 107609 107609 
107714 107714 



N74724 


Hs.108479 


L36531 


Hs.91296 


AW246273 


Hs.84131 


N56191 


Hs.106511 


AL044335 


Hs.109526 


AK001676 


Hs.12457 


AJ091173 


Hs.222362 


AA040620 


Hs.5672 


F08282 


Hs.278428 


AW815036 


Hs.151251 


AI239923 


Hs.63931 


A1039243 


Hs.278585 


AA027317 




AI858702 


Hs.31803 


AW631469 


Hs.203213 


AF072873 


Hs.114218 


AA126311 


Hs.9879 


AW503733 


Hs.9414 


AL042508 


HS21599 


BE245294 


Hs.1 80789 


BE396283 


Hs.1 73987 


AA233393 


Hs.14992 


AI468789 


Hs.347187 


AW994032 


Hs.8768 


AK000046 


Hs.343877 


BE397354 


Hs.324830 


AA173942 


Hs.326416 


AB028956 


Hs.12144 


AL137751 


Hs.263671 


AA283044 


Hs.25625 


AJ609530 


Hs.279789 


AI922821 


Hs.32433 


BE546245 


Hs.301048 


AA406610 




AI815486 


Hs.243901 


AB037850 


Hs.16621 


AB011169 


Hs.20141 


AW1 35049 


Hs.26285 


AA525993 


Hs.173699 


AB037715 


Hs.183639 


AL079559 


Hs.28020 


AB020722 


Hs.16714 


AW864696 


Hs.301732 


AB033042 


Hs.29679 


AA676939 


Hs.69285 


AK000933 

ni Www www 


Hs.28661 


AA452379 




AWQ58037 


Hs.286 


AW754182 




mm 015368 


Hs 30985 


AB014548 

now i t*^tu 


Hs 31921 


AA478109 


Hs.188833 


D61216 


Hs.18672 


AA449014 


Hs.1 21025 


BE539639 


Hs.173030 


AW021276 


Hs.17121 


BE614410 

UUV 1 Tl ■ W 


Hs.23044 


BE379623 


Hs.27693 


AA1 37043 

rU» 1 Wr VTw 


Hs.9663 


AW513087 


Hs.1 6803 


AW504732 


Hs.21275 


AA357879 


Hs.29423 


AA1 86629 


Hs.80120 


AA316241 


Hs.90691 


AW959891 


Hs.76591 


U48263 


Hs.89040 


AW003668 


Hs.1 35587 


AL038596 


Hs250745 


AL157433 


Hs.37706 


AW243158 


Hs.5297 


AF037448 


Hs.155489 


BE262677 


Hs.283558 


NMJJ03542 


Hs.46423 


NNL003403 


Hs.97496 


R75654 


Hs.164797 


AA015761 


Hs.60642 



ESTs 

Integral, alpha 8 

threonyl-tRNA synthetase 

protocadherin 17 

zinc finger protein 198 

hypothetical protein RJ10814 

ESTs, Weakly similar to p40 [H.sapiens] 

hypothetical protein AF140225 

progestin induced protein 

ESTs 

ESTs 

ESTs 

gb:ze97d11.s1 Soares_fetaLhearLNbHH19W 
ESTs, Weakly similar to N-WASP [H.sapien 
ESTs 

frizzled (Drosophaa) homoiog 6 
ESTs 

KIAA1488 protein 
Kruppel-Gke factor 7 (ubiquitous) 
S164 protein 

eukaryotic translation initiation factor 
hypothetical protein FU11 151 
myotubularin related protein 1 
hypothetical protein FU10849 
hypothetical protein FU20039 
diptheria toxin resistance protein requi 
Homo sapiens mRNA; cDNA DKFZp564H1916 (f 
KIAA1033 protein 

Homo sapiens mRNA; cDNA DKFZp434l0812 (f 
hypothetical protein FU11323 
histone deacetytase 3 
ESTs 

sec13-iike protein 

gb:zv15b10.s1 Soares_NhHMPu_S1 Homosapi 

Homo sapiens cDNA RJ20738 fis, clone HE 

DKF2P434I116 protein 

slmflartb S. cerevisiae SSM4 

Homo sapiens cDNA RJ10643 fis, clone NT 

ESTs, Weakly similar to ALU1_HUMAN ALU S 

hypothetical protein FU10210 

K1AA0766 gene product 

Rho guanine exchange factor (GEF) 15 

hypothetical protein MGC5306 

cofector required for Sp1 transcriptiona 

neuropiOnl 

Homo sapiens cDNA FU10Q71 fis, clone HE 
ESTs, Moderately similar to ALU7JHUMAN A 
ribosomal protein L4 

gb:RC2-CT0321-131199-011-c01 CT0321 Homo 

pannexin 1 

KIAA0648 protein 

ESTs 

ESTs 

chromosome 11 open reading frame 5 
ESTs, Weakly similar to ALU8JHUMAN ALU S 
ESTs 

RAD51 (S. cerevisiae) homoiog (E coli Re 
peptldylprolyl isomerase (cyc!ophilin)-l 
programmed cell death 6-interacting prot 
LUC7 (S.cerevisiaeHike 
hypothetical protein FU11011 
scavenger receptor with C-type lectin 
UDP-N-acetyKalpha-D-gabctosamine:polyp 
nucleophosmln/nucleoplasmin 3 
KIAA0887 protein 
prepronociceptin 

Human clone 23629 mRNA sequence 
polymerase (RNA) 111 (DNA directed) {62k 
hypothetical protein DKFZp434E2220 
DKFZP564A2416 protein 
NS1-associated protein 1 
hypothetical protein PR01855 
H4 histone family, member G 
YY1 transcription factor 
hypothetical protein FLI13693 
ESTs 
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10 



15 



20 



25 



30 



35 



40 



107832 


107832 


AA021473 




124337 


124337 


N23541 


Hs.281561 


129577 


129577 


N75346 


Hs.306121 


132000 


132000 


AW247017 


Hs.36978 


107935 


107935 


AA029428 


Hs.61555 


131461 


131461 


AA992841 


Hs.27263 


108029 


108029 


AA040740 


Hs.62007 


108084 108084 


AA058944 


Hs.116602 


108168 


108168 


AI453137 


Hs.63176 


108189 


108189 


AW376061 


Hs.63335 


108203 


108203 


AW847814 


Hs.289005 


108217 108217 


AA05B686 


Hs.62588 


108277 108277 


AA064859 




108309 108309 


AA069818 




108340 108340 


AA069820 


Hs.180909 


108427 108427 


AA076382 




108439 


108439 


AA078986 




108469 


108469 


AA079487 




108501 


108501 


AA083256 . 




108562 


108562 


AA100796 




130890 


130890 


AI907537 


Hs.76698 


130385 


130385 


AW067800 


Hs.155223 


108807 


108807 


AI652236 


Hs.49376 


108833 


108833 


AF188527 


Hs.61661 


108846 


108846 


AL1 17452 


Hs.44155 


131474 


131474 ' 


L46353 


Hs.2726 


108941 


108941 


AA148650 




108996 


108996 


AW995610 


Hs.332436 


131183 


131183 


AI611807 


Hs.285107 


109022 


109022 


AA157291 


Hs.21479 


109068 


109068 


AA164293 


Hs.72545 


129021 


129021 


AL044675 


Hs.173081 


109146 


109146 


AA176589 


Hs.142078 


131080 


131080 


NMJJ01955 


Hs.2271 


109222 


109222 


AA192833 


Hs.333512 


109481 


109481 


AA878923 


Hs.289069 


109516 109516 


AI471639 


Hs.71913 


109556 109556 


AI925294 


Hs.87385 


109578 


109578 


F02208 


Hs.27214 


109625 


109625 


H29490 


Hs.22697 


109648 


109648 


H17800 


Hs.7154 


109699 


109699 


H18013 


Hs.167483 


109933 


109933 


R52417 


Hs.20945 


110039 


110039 


H11938 


Hs.21907 



gbze66c1 1 .si Soares retina N2MHR Homo 
Homo sapiens cDNA: FU23582 fis, done L 
CDC20 (cell division cycle 20, S. cerevi 
melanoma antigen, family A, 3 
ESTs 

K1AA1458 protein 
ESTs 

Homo sapiens, clone iMAGE:41 54008, mRMA, 
ESTs 

ESTs, Moderately similar to A46010 X-lin 
Homo sapiens cDNA: FU21532 fis, done C 
ESTs 

gb:zm50f03.s1 Stratagene fibroblast (937 
gb:zm67e03.r1 Stratagene neuroepftheGum 
peroxiredoxin 1 

gb:zm91g08.s1 Stratagene ovarian cancer 

gb:zm92h01.s1 Stratagene ovarian cancer 

gbzm97fl)8.s1 Stratagene colon HT29 (937 

gbznQ8g12.s1 Stratagene hNT neuron (937 

gbzm26c06.s1 Stratagene pancreas (93720 

stress-associated endoplasmic reticulum 

stanniocaldn 2 

hypothetical protein FLJ20644 

ESTs, Weakly similar to AF1 74605 1 F-box 

DKFZP586G1517protBin 

high-mobmty group (nonhistone chromoso 

gb:zo09e06.s1 Stratagene nBuroepitheiium 

EST 

hypothetical protein FLJ13397 

ubinuclein 1 

ESTs 

KIAA0530 protein 
EST 

endothelinl 

similar to rat myomegarm 

hypothetical protein FU21016 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

Homo sapiens done 24993 mRNA sequence 
histone acetyltransferase 
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TABLE 2A 

Table 2A shows the accession numbers for those pkeys lacking unlgenelD's for Table 2. The pkeys In Table 7 lacking unfgeneiD's are represented within 
Tables 1 -6A. For each probeset we have listed the gene cluster number from which the oligonucleotides were designed. Gene clusters were compiled 
using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence similarity using Clustering and 
Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for sequences comprising each cluster are listed in the 'Accession' 
column. 



Pkey: Unique Eos probeset identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey 



CAT Number Accession 



108469 
108501 
108562 
101300 



125565 
132983 



121335 
130018 
121822 



108309 
107832 
123523 
123964 
118475 
104787 
106596 
113947 
108277 



116761J 
13684.-12 
36375.1 
4669J 



117156 145392.1 



1704098.1 
11922.1 



133681 13893J 



AA079467 AA128547 AA128291 AA079587.AA079600 
AAQ83256 

AA100796 AF020589 AA074629 AA075946 AA100849 M085347 AA126309 AA07931 1 AA079323 AA085274 
BE535511 M62098 AA306787 AWB91766AA348998AA338869 AA344013 AW956561 AW389343 AW403607 L40391 
AW408435 AA121738 AI568978 H13317 R20373 AW948724 AW948744 AA335023 AA436722 AA448690 C21404 
AW884390 AA345454 AA303292 AA174174 BE092290 T90614 M035104 R76028 AA126924 AA741086 AW022056 
AW1 18940 AA121666 AI832409 AA683475 AI140901 AI623576 AW519064 AW474125 AI953923 AI735349 AW150109 
AI436154 AW118130 AW270782 A1804073 N27434 AA876543 AA937B15 AI051166 AA505378 AI041975 AI335355 
AI089540 AA662243 AI127912 AI925604 AI250880 AI366874 AI564386 AI815196 AI683526 AI435885 AI160934 H79030 
AI801493 AA448691 AI673767 AI076042 AI804327 AA813438 AA680002 AI274492 T16177 AI287337 AI935050 
AA907805 AA91 1493 AI58941 1 AI371358 AW576236 AI078866 AW5161 68 AA346372 AI5601 85 AA471009 R75857 
AA296025 AA523155 AA853168 AI696593 AI658482 AI566601 AW072797 AA128047 AA035502 AW243274 AA992517 
R43760 

W73853 AA9281 12 W77887 AW889237 AA148524 AI749182 AI754442 AI338392 AI253102 A1079403 AI370541 AI697341 
H97538 AW188021 AI927669 W72716 AI051402 AI188071 A1335900 N21488 AW770478 W92522 AI691028 AI913512 
AI144448 W73819 AA604358 N28900 W95221 AI868132 H98465 AA148793 
R20840R20839 

M30269 NM 002508 X82245 AI078760 AW957003 D78945 M27445 AA650439 AL048816 AV660256 AV660347 
AA333052 BE295257 T60999 AA383049 AW369677 Z26985 AW175704 AA343326 AW747957 A1818389 W17308 
W17302 H15591 AA371284 AA370412 W94966 BE384365 T28498 R80714 R16959 H21723 AW835154 D56097 D56381 
W21232 AA190565 AW379755 AW067895 

AI352558 Z82248 X78138 NM.003405 AU077248 AA223125 S80794 D78577 AU24697 AW403970 BE614089 BE296713 
BE621334 L20422 X80536 D54224 D54950 X57345 N29226 AA127798 AA340253 F08031 AA192540 H67636 AA321827 
AW950283 AA084159 BE538808 AW401377 AA256774 C03366 W46595 W47608 AA305009 H69431 H69456 AL120082 
H1 1706 AA303717 AA361357 H22042 H78020 AW999584 AA1 34368 AA32291 1 AA322961 H60980 N85248 N31547 
H79624 T1 1718 W85826 AW894663 AW894624 BE167441 BE170015 AA304626 AW602163 AW998929 AA156681 
AA151067 BE002724 AA608688 H82692 BE155392 AW383636 BE155394 AA487004 AW383504 AI342365 R82553 
W16498 BE155344 AI143938 R69901 AA322873 AW340648 R25364 AA367935 AI559406 AA033522 AA374252 
AW835019 A1922133 A1697089 N99662 AW189078 AI199076 AW151598 W59944 AA662875 W94022 AA299055 
AI039008 AI829449 AA583503 A1635674 AW131665 A1473820 AW2731 18 AW900930 AA908944 AI688035 AW170272 
AI082545 AW468176 AI608761 AI082748 AI91 1 682 AI248943 AI831 016 AA192465 AI218477 AA938406 AA385288 
AI809817 AA905196 AI191245 AI470204 AI188296 AI421367 AI125315 AI087141 AA629032 AA740589AI554181 
AA150830 AI248541 AI077943 AA775958 AA864930 AI261476 AI123121 AI310394 AA862331 AA872478 BE537084 
A1205606 AA720684 AI872093 AW150042 AL120538 AA219627 AA988608 C21397 AI359337 H25337 AI089749 
AA605146 AI359620 AA150478 AI359738 AW383642 AW995424 AI766457 R56892 AI089839 W61343 N69107 W46459 
AA565955 N20527 AI279782 W46596 AA776573 H23204 AI866231 AI083995 N21530 AA126874 D82630 W65437 
AI086917 AW382095 AI086877 H69844 AW340217 W85827 L08439 AA262704 AA505380 W47413 W94135 AA223241 
AW089153 AA084101 BE538000 AA096126 T28031 AA491574 R84813 AA774536 AW383522 AA155615 AW383529 
AA491520 AW028427 AA171496 AI469689 AW664539 AI81 1 102 AI81 1 1 16 BE464590 BE350791 H78021 T15405 H21979 
AA219489 H13301 AA505883 AI864305 AI423963 AW084401 F04963 R69858 H67097 AI917740 AI655561 H69864 
AA033631 AW383484 AI886261 H25293 AA513281 AW271 187 H1 1617 N79982 AI174338 AI904207 AI904208 BE614558 
W94127 W65436 AI272249 AA700018 AI579932 AI085941 AW152629 
AA404418AI217248 

AA353093 AW957317 AW872498 A1560785 A1289110 AW135512 X97261 T68873 

AI743860 N49543 AW027759 BE349467 AI656284 BE463975 R35022 AA370031 AW955302 AL042109 N53092 AI611424 
AL079362 AI969290 AI928016 BE394912 BE504220 BE467505 AI61 161 1 AI611407 AI61 1452 W56437 AI284566 
AI583349 AW183058 AI308085 AI074952 AA437315 AA628161 AW301728 AI150224 AA400137 AA437279 AI223355 
AA639462 AI261373 AI432414 AI984994 AI539335 AA401550 AA358757 A1609976 M442357 AA359393 AA437046 
AA370301 AA429328 AW272055 A1580502 AIB32944 AI038530 AA425107 AI014986 AI148349 AW237721 AW779756 
AW137877 AJ125293AA400404 R28554 
AA069818 AA069971 AA069923 AA069908 
genbank_AA021473 AA021473 
genbankJ\A608588 AA608588 
genbank_C13961 C13961 
genbank_N66845 N66B45 
genbanLAA027317 AA027317 

304084 1 AI583948 AA57821 2 AW303715 AA653450 AA456981 AI400385 W88533 A12241 33 AW272145 AA088886 R94698 

genbaniLw84768W84768 

genbankJ\A064859 AA064859 



279548.1 

18986J 

244391.1 



111495.1 
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108427 genbank^AA076382 AA076382 

108439 genbankJW)78986 AA078986 

131353 231 290 1 AW41 1259 H23555 AW015049 AI684275 AW015886 AW06B953 AW014085 A1027260 R52686 AA918278 AI129462 

AA969360 N34869 AI948416 AA534205 AA702483 AA705292 

101447 entre^M21305 M21305 

108931 genbanU\A147186 AA147188 

108941 genbankJ\A148650 AA148650 

103138 entre?_X65965 X65965 

119174 genbank_R71234 R71234 

119416 genbanU"97186 T97186 

105985 genbanLAA406610 AA406610 

100327 entre?_D55640 D55640 
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TABLE 3: 
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55 
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65 
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Pkey: Unique Eos probeset Identifier number 

Accession: Accession number used for previous patent fiOngs 

ExAccn: Exemplar Accession number, Genbank accession number 

UnfcenelD: Unigene number 

Unlgene Title: Unigene gene title 



Pkey Accession ExAccn UniGene UnigeneTitle 



100405 D86425 
100420 D86983 
100481 HG1098-HT1098 
100484 HG1103-HT1103 
100718 HG3342-HT3519 
100991 J03764 
101097 L05797 
101168 L15388 
101194 L20971 
101261 L35545 
101345 L76380 
101447 M21305 
101485 M24736 
101543 M31166 
101550 M31551 
101560 M32334 
101674 M61916 
101714 M68B74 
101741 M74719 
101838 M92934 
101657 M94856 
102012 U03057 
102024 U03877 
102164 U18300 
102241 U27109 
102283 U31384 
102303 U33053 
102564 U59423 
102663 U70322 
102759 U81607 - 
102778 U83463 
102804 U89942 
102887 X04729 
102898 X06256 
102915 X07820 

103036 X54925 

103037 X54936 
103095 X60957 
103158 X67235 
103166 X67951 
103185 X69910 
103280 X79981 
103554 Z18951 
103850 AA187101 
104465 N24990 
104592 R81003 
104764 AA025351 
104786 AA027168 
104850. AA040465 
104865 AA045136 
104894 AA054087 
104952 AA071089 
104974 AA085918 
105178 AA187490 
105263 AA227926 
105330 AA234743 
105376 AA236559 
105729 AA292694 
105826 AA398243 
105977 AA406363 
106008 AA411465 
108031 AA412284 
106124 AA423987 



AW291587 Hs.82733 
D86983 Hs.1 18893 
X70377 Hs.121489 
NM_005402Hs.288757 
BE295928 Hs.75424 
J03836 Hs.82085 
BE245301 Hs.89414 
NM_005308Hs.211569 
L20971 Hs.188 
D30857 Hs.82353 
NM_005795Hs.152175 
M21305 - 
AA296520 Hs.89546 
M31166 Hs.2050 
Y00630 Hs.75716 
AW958272 Hs.347326 
NWL002291HS.82124 
M68874 Hs.211587 
NMJ03199Hs.326198 
BE243845 Hs.75511 
BE550723 Hs.153179 
BE259035 Hs.118400 
AA301867 Hs.76224 
NWL000107HS.77602 
NM_007351Hs.268107 
AW161552 Hs.83381 
U33053 Hs.2499 
U59423 Hs.79067 
NWL002270HS.168075 
NM_005100Hs.788 
AF000652 Hs.8180 
NIVL002318HS.83354 
J03836 Hs.82085 
NNL002205HS.149609 
X07820 Hs.2258 
M13509 Hs.83169 
BE018302 Hs.2894 
NIVL005424HS.78824 
BE2425B7 Hs.1 18651 
AA159248 Hs.180909 
NMJQ6825Hs.74368 
U84722 Hs.76206 
AI878826 Hs.74034 
AA187101 Hs.213194 
Z44203 Hs.26418 
AW630488 Hs.25338 
AI039243 Hs.278585 
AA027167 Hs.10031 
AL133035 Hs.8728 
T79340 Hs.22575 
AF065214 Hs.18858 
AW076098 Hs.345588 
Y12059 Hs.278675 
AA313825 Hs.21941 
AW388633 Hs.6682 
AW338625 Hs.22120 
AW994032 Hs.8768 
H46612 Hs.293815 
AA478756 Hs.194477 
AK001972 Hs.30822 
AB033888 Hs.8619 
X64116 Hs.171844 
H93366 Hs.7567 



Melanoma i 
cystatinD 

v-ral simian leukemia viral oncogene horn 
inhibitor of DNA binding 1 , dominant neg 
serine (or cysteine) proteinase inhibito 
chemokine (C-X-C motiO, receptor 4 (fus 
G protein-coupled receptor kinase 5 
phosphodiesterase 4B, cAMP-specific (dun 
protein C receptor, endothelial (EPCR) 
calcitonin receptor-like 
gb:Human alpha satellite and satellite 3 
selectin E (endothelial adhesion molecul 
pentaxin-related gene, rapidly induced b 
serine (or cysteine) proteinase tnhibito 
Intercellular adhesion molecule 2 
laminin, beta 1 

phospholipase A2, group IVA (cytosolic, 
transcription factor 4 
connective tissue growth factor 
fatty acid binding protein 5 (psoriasis- 
singed (Drosopnila)-(ike (sea urchin fas 
EGF-containing fibulin-like extracellula 
damage-specific DNA binding protein 2 (4 
muliimerin 

guanine nucleotide binding protein 11 

protein kinase (Mike 1 

MAD (mothers against decapentaplegic, Dr 

karyopherin (importin) beta 2 

A kinase (PRKA) anchor protein (gravin) 

syndecan binding protein (syntenin) 

iysyl oxidase-iike 2 

serine (or cysteine) proteinase inhibfto 
integral, alpha 5 (fibronectin receptor, 
matrix metaOoproteinase 10 (stromelysin 
matrix metaHoproteinase 1 (interstitial 
placental growth factor, vascular endoth 
tyrosine kinase with immunoglobulin and 
hematopoieticaliy expressed homeobox 
peroxlredoxin 1 

transmembrane protein (63kD), endoplasml 
cadherin 5, type 2, VE-cadherin (vascula 
caveoDn 1 , caveolae protein, 22kD 
hypothetical protein MGC10895 
ESTs 

protease, serine, 23 
ESTs 

KIAA0955 protein 

hypothetical protein DKFZp434G171 
B-ceO CLUiymphoma 6, member B (zinc fi 
phospholipase A2, group IVC (cytosolic, 
desmoplakin(DPI, DPIi) 
bromodomain-containing 4 
AD036 protein 

solute carrier family 7, (cation)c amino 
ESTs 

hypoftetical protein RJ10849 

Homo sapiens HSPC285 mRNA, partial cds 

E3 ubiquiiin iigase SMURF2 

hypothetical protein FU11110. 

SRY (sex determining region Y)-box 18 

Homo sapiens cDNA: FU22296 fts. done H 

Homo sapiens cDNA: FU21962 fis. clone It 
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106155 
106302 
106423 
106793 
107174 
107216 
107295 
1073S5 
108755 
108848 



109001 



10 



109456 
15 109768 
110107 
110905 
110984 
111006 
111018 
111133 
111760 
113073 
113195 
113923 
114521 
115061 
115096 
115145 
115819 
115947 
116314 
116339 
116430 
116589 
116733 
117023 
117186 
117563 
117997 
118475 
118581 
119073 
119155 
119174 
119221 
119416 
119866 
121335 
121381 
123160 
123473 
123523 
123533 
123964 
' 124006 
124315 
124659 



20 



25 



30 



35 



40 



45 



50 



55 



60 124847 
124875 
125091 
125103 
125355 

65 125565 
125590 
126511 
126563 



70 



75 



126872 
127402 
127651 
127759 
128062 



AA425309 
AA435896 
AA448238 
AA478778 
AA621714 
D51069 
T34527 
U97519 
AA127221 
AA132983 
AA135606 
AA156125 
AA179845 
AA232645 
F10399 
H16772 
N39584 
N52006 
N53375 
N54067 
N64436 
R26892 
T33637 
T57112 
W80763 
AA046808 
AA253217 
AA255991 " 
AA258138 
AA426573 
AA443793 
AA490588 
M496257 
AA609717 
D59570 
F13787 
H88157 
H98988 
N34237 
N52090 
N66845 
M68905 
R32894 
R61715 
R71234 
R98105 
T97186 
W80814 
AA404418 
AA405747 
AA488687 
AA599143 
AA608588 
AA608751 
C13961 
D60302 
H94892 
N93521 
N95477 
R60044 
R70506 
T91518 
T95333 
R45630 
R20839 
.R23858 
AI024874 
W26247 
AA856990 
AA136653 
AA358869 
AI123976 
AI369384 
AA379500 



129046 AA195678 



AA425414 Hs.33287 
AA398859 Hs.18397 
AB020722 Hs.16714 
H94997 Hs.16450 
BE122762 Hs.25338 
D51069 Hs.211579 
AA186629 Hs.80120 
NIVL005397HS.16426 
AA127221 Hs.117037 
AL1 17452 Hs.44155 
AA135606 Hs.189384 
AI056548 HS72116 
AA219691 Hs73625 
AW956580 Hs.42699 
F06838 Hs.14763 
AW151660 Hs.31444 
AA035211 Hs.17404 
AW613287 Hs.80120 
BE387014 Hs.166146 
A1287912 Hs.3628 
AW580939 Hs.97199 
BE551929 Hs.268754 
N39342 Hs.103042 
H83265 Hs.8881 
AW953484 Hs.3849 
AW139036 Hs.108957 
AI751438 Hs.41271 
AI683069 Hs.175319 
AA740907 Hs.88297 
AA486620 Hs.41135 
R47479 Hs.94761 
AI799104 Hs.178705 
AK000290 Hs.44033 
AK001531 Hs.66048 
AI557212 Hs.17132 
AL157424 Hs.61289 
AW070211 Hs.102415 
H98988 Hs.42612 
AF055634 Hs.44553 
N52090 Hs.47420 
N66845 
N68905 

BE245360 Hs.279477 
R61715 Hs.310598 
R71234 

C14322 Hs.250700 
T97186 

AA496205 Hs.193700 
AA40441B 

AW088642 Hs.97984 

AA488687 Hs.284235 

AA599143 

AA608588 

AA608751 

C13961 

AI147155 Hs.270016 
NM_005402Hs.288757 
AI680737 Hs.289068 
AI571594 Hs.102943 
W07701 Hs.304177 
AI887664 Hs.265814 
T91518 

AA570056 Hs.122730 
R60547 Hs.170098 
R20840 

R23858 Hs.143375 
T92143 Hs.57958 
AA516391 Hs.181368 
AA001860 Hs.279531 
AW450979 

AA358869 Hs.227949 
AA382523 Hs.105689 
AI369384 Hs.292441 
AA379621 Hs.105547 
H04150 Hs.107708 
AB029290 Hs.108258 



nuclear factor l/B 

hypothetical protein FU23221 

Rho guanine exchange factor (GEF) 15 

ESTs 

ESTs 

melanoma cell adhesion molecule 

UDP-N-acetyl-a!pha-I>galactosamIne:poryp 

podocalyxMe 

ESTs 

DKFZP586G1517 protein 

gb:zl10a05.s1 Soaresj3regnanLuterus_NbH 

hypoftetlcal protein FU20992 similar to 

RAB6 Interacting, kinesin-Bce (rabkines 

ESTs 

ESTs 

ESTs 

ESTs 

UDP-N-acetyl-alpha-l>galactDsamine:polyp 
Homer, neuronal Immediate early gene, 3 
mitogen-activated protein kinase kinase 
complement component C1q receptor . 
Homo sapiens cDNA FU1 1949 fis, clone HE 
microtubule-associated protein 1B 
ESTs, Weakly similar to S41044 chromosom 
hypothetical protein FU22041 simflar to 
40S ribosomal protein S27 isoform 
Homo sapiens mRNA full length insert cDN 
ESTs 
ESTs 

endomudn-2 

KIAA1691 protein / 

Homo sapiens cDNA FU1 1333 fis, clone PL 

dipeptidyl peptidase 8 

hypothetical protein FU10669 

ESTs, Moderately similar to I54374 gene 

synaptojanin 2 

Homo sapiens mRNA; cDNA DKFZp586N0121 {f 
ESTs, Weakly similar to ALU1.HUMAN ALU S 
unc5 (C.elegans homolog) c 
EST 

gb:za46c11.s1 Soares fetal liver spleen 
gb:za69b09.s1 Soares_fetaUung_NbHL19W 
ESTs 

ESTs, Moderately similar to ALU1.HUMAN A 
gb:yi54c08.s1 Soares placenta Nb2HP Homo 
tryptase beta 1 

gb:ye50h09.s1 Soares fetal liver spleen 
Homo sapiens mRNA; cDNA DKFZp586l0324 (f 
gb:zw37e02*1 SoaresJota!JetusJlb2HF8_ 
hypothetical protein FLJ22252 similar to 
ESTs, Weakly similar to I38022 hypotheti 
gb:ae52d04.s1 Stratagene lung carcinoma 
gb:ae54e06.s1 Stratagene lung carcinoma 
gb:ae56h07.s1 Stratagene lung carcinoma 
gb:C13961 Clontech human aorta polyA+mR 
ESTs 

v-ral simian leukemia viral oncogene horn 

Homo sapiens cDNA FU11918 fis, clone HE 

hypothetical protein MGC12916 

Homo sapiens clone FLB8503 PR02286 mRNA, 

sprouty (Drosophila) homotog 4 

gbrye20f05.s1 Stratagene lung (937210) H 

ESTs, Moderately similar to KIAA1215 pro 

KIAA0372gene product 

gb:yg05c08.r1 Soares infant brain 1N1B H 

Homo sapiens, clone IMAGE:3840937, mRNA, 

EGF-TM7-latrophBirMelated protein 

U5 snRNP-specific protein (220 kD), orth 

ESTs 

gb:U*Bl3-ala-*12-0-Ul.s1 NCLCGAP.Su 
SEC13(S. cerevlslae^nke 1 
MSTP031 protein 
ESTs 

neural proliferation, differentiation an 
ESTs 

actin binding protein; macrophin (mfcrof 
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129188 
129314 
129371 
129468 
129765 
129805 
129884 
130495 
130639 
130657 



M30257 
AA028131 
M10321 
J03040 



130972 
131080 
131137 
131182 
131486 
131573 
131647 
131756 
131859 
131881 
132050 
132083 
132164 
132358 
132413 
132456 
132490 
132676 
132687 
132718 
132736 
132760 
132933 
132968 
132994 
133061 
133147 
133161 
133200 
133260 
133363 
133491 
133517 
133550 
133607 
133614 
133627 
133691 
133698 
133913 
133975 
133985 
134039 
134Q88 
134161 
134299 
134416 
134453 
134656 
134989 
135051 
135073 
135349 
100114 
100130 
100143 
100168 
100208 
100224 
100405 
100420 
100455 
100529 
100618 
100619 



AA012933 

AA286710 

AA243278 

D59711 

T94452 

M053400 

AA370302 

J05008 

U85193 

AA256153 

X83107 

AA046593 

AA410480 

D45304 

M90657 

AA010163 

AA136353 

Y07867 

U84573 



AA132969 • 

AA114250 

F13782 

AA283035 

AB002301 

M056731 

U68019 

H99198 

AA598702 

N77151 

AA505133 

AB000584 

D12763 

AA253193 

AA432248 

AA083572 

AA479713 

U0395 

X52947 

W80846 

M34539 

D67029 

U09587 

M85289 

D10522 

W84712 

D29992 

L34657 

S78569 

D43636 

U97188 

AA487558 

M28882 

X70683 

X14787 

AA236324 

C15324 

AA452000 

D83174 

000596 

D11428 

D13640 

D14B74 

D26129 

D28476 

D86425 

D85983 

D87953 

HG1862-HT1897 
HG2614-HT2710 
HG2639+IT2735 



NWL001078HS.109225 
BE622768 Hs.290356 
X06828 Hs.1 10802 
AW41053B Hs.111779 
M86933 Hs.1238 
AA012848 Hs.12570 
AF055581 Hs.13131 
AW250380 Hs.109059 
AI557212 Hs.17132 
AW337575 Hs.201591 
AW631469 Hs.203213 
D81866 Hs.21739 
NM.Q01955Hs.2271 
W27392 Hs.33287 
AI824144 HS.23912 
F06972 Hs.27372 
AA040311 Hs.28959 
AA359615 Hs.30089 
AA443966 Hs.31595 
AW960564 
AW361018 Hs.3383 
AI267615 Hs.38022 
BE386490 Hs.279663 
AI752235 Hs.41270 
NMJX)3542Hs.46423 
AW361383 Hs.260116 
AB011084 Hs.48924 
NI\UK)1290Hs.4980 
N92589 Hs.261038 
AB002301 Hs.54985 
NM_004600Hs.554 
AW081883 Hs.211578 
AA125985 Hs.56145 
BE263252 Hs.6101 
AF234532 Hs.61638 
AA1 12748 Hs.279905 
AI186431 Hs.296638 
AA026533 Hs.66 
AW021103 Hs.6631 
AB037715 Hs.183639 
AA403045 Hs.6906 
AI866286 Hs.71962 
BE619053 Hs.170001 
NM_000165Hs.74471 
AI129903 Hs.74669 
BE273749 

NWL003003HS.75232 
NM_002047Hs.75280 
M85289 Hs.211573. 
AI878921 Hs.75607 
AU076964 Hs.7753 
C18356 Hs.295944 
L34657 Hs.78146 
N(VL002290Hs.78672 
AI379954 Hs.79025 
AA634543 Hs.79440 
AW580939 Hs.97199 
X68264 Hs.211579 
AI272141 Hs.83484 
AI750878 Hs.87409 
AW968058 Hs.92381 
AI272141 Hs.83484 
W55956 Hs.94030 
AA1 14212 Hs.9930 
X02308 Hs.82962 
NWL000304HS.103724 
AU076465 Hs.278441 
H73444 Hs.394 
NWL002933HS.78224 
AL121516 Hs.138617 
AW291587 Hs.82733 
D86983 Hs.118893 
AW888941 Hs.75789 
BE313693 Hs.334330 
AI752163 Hs.114599 
N24433 Hs.241567 



vascular cell adhesion molecule 1 
mesoderm development candidate 1 
von Wiljebrand factor 
secreted protein, acidic cysteine-rich 
amelogenln (Y chromosome) 
tubulin-specific chaperone d 



mitochondrial ribosomal protein L12 
ESTs, Moderately similar to 154374 gene 
ESTs 
ESTs 

Homo sapiens mRNA; cDNA DKFZp586H518 (f 
endotheGn 1 
nuclear factor l/B 
ESTs 

BMX non-receptor iyroslne kinase 

ESTs 

ESTs 

ESTs 

transmembrane 4 superfamHy member 1 
upstream regulatory element binding prot 
ESTs 
Pirin 

procollagen-lysine, 2-oxoglirtarate 5-dio 
H4 histone family, member G 
metaltoprotease 1 (pitrilysin family) 
KIAA0512 gene product ALEX2 
UM domain binding 2 
ESTs, Weakly similar to 138022 hypotheti 
KIAA0303 protein 

Sjogren syndrome antigen A2 (60kD, ribon 
Homo sapiens cDNA: FU23037 fis, clone L 
thymosin, beta, identified in neuroblast 
hypothetical protein MGC3178 
myosin X 

clone HQ0310PRO0310p1 
prostate differentiation factor 
interleukin 1 receptor-like 1 
hypothetical protein FU20373 
hypothetical protein FU10210 
Homo sapiens cDNA: FU23197 fis, clone R 
ESTs, Weakly similar to B36298 proline-r 
eukaryotic translation initiation factor 
gap Junction protein, alpha 1, 43kD (con 
vesicle-associated membrane protein 5 (m 
FK506-binding protein 1A(12kD) 
SEC14(S.cerevisiae)-like1 
glycyl-tRNA synthetase 
heparan sulfate proteoglycan 2 (perlecan 
myristoylated alanine-rich protein kinas 
calumenln 

tissue factor pathway Inhibitor 2 
platelet/endothelial cell adhesion molec 
faminin, alpha 4 
KIAA0096 protein 
IGF-II mRNA-blnding protein 3 
complement component C1q receptor 
melanoma ceO adhesion molecule 
SRY (sex determining region Y>box 4 
thrombospondin 1 

nudix (nucleoside diphosphate linked moi 

SRY (sex determining region Y)-box 4 

Homo sapiens mRNA; cDNA DKFZp586E1624 (f 

serine (or cysteine) proteinase tnhibito 

thymidylate synthetase 

peripheral myelin protein 22 

KIAA0015 gene product 

adrenomeduilin 

ribonuclease, RNase A family, 1 (pancrea 
thyroid hormone receptor interactor 12 
nidogen2 

Melanoma associated gene 

N-myc downstream regulated 

calmodulin 2 (phosphorylase kinase, delt 

collagen, type VIII, alpha 1 

RNA binding motif, single stranded inter 
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100668 
100676 
100718 
100752 
100828 
100850 
100991 
101097 
101110 
101142 
101156 
101168 
101184 
101192 
101317 
101336 
101345 
101400 
101475 
101485 
101496 
101505 
101543 
101557 
101560 
101587 
101592 
101633 
101634 
* 101667 
101682 
101714 
101720 
101741 
101744 
101793 
101837 
101838 
101840 
101857 
101864 
101931 
101966 
102012 
102013 
102024 
102059 
102121 
102283 
102300 
102378 
102395 
102460 
102491 
102499 
102523 
102560 
102564 
102589 
102600 
102645 
102687 
102693 
102709 
102759 
102804 



HG2B55^T2995 
HG3044-HT3742 
HG3342-HT3519 
HG3543W3739 
HG4069-HT4339 
HG417-HT417 
J03764 
L06797 
L08246 . 
L12711 
L13977 
L15388 
L19871 
L20859 
L42176 
L49169 
L76380 
M15990 
M23254 
M24736 
M26576 
M27396 
M31166 
M31994 
M32334 
M35878 
M36429 
M57730 
M57731 
M60858 
M62994 
M68874 
M69043 
M74719 
M75126 
M84349 
M92643 
M92934 
M93056 
M94856 
M95787 
S76965 
S81914 
U03057 
U03100 
U03877 
U08021 
U14391 
U31384 
U32944 
U40369 
U41767 
U48959 
U51010 
U51478 
U53445 



U56725 
X02761 



Hs.180414 
Hs.287820 



102907 
102915 
102927 
102960 
103011 
103020 
103029 
103036 
103056 



U59423 
1)62015 
U63825 
U67963 
U73379 
U73824 
U77604 
U81607 
U89942 
X04412 
X06985 
X07820 
X12876 
X15729 
X52541 
X53416 
X54489 
X54925 
X57206 



BE295928 Hs75424 
T81309 

AL048753 Hs.303649 
AA836472 Hs.297939 
J03836 Hs.82085 
BE245301 Hs.89414 
A1439011 Hs.86386 
L12711 Hs.89643 
AA340987 Hs.75693 
NMJ05308HS.211569 
NWL001674HS.460 
BE247295 Hs.78452 
L42176 Hs.8302 
NM.006732HS.75678 
NM_005795Hs.152175 
M15990 Hs.194148 
BE410405 Hs.76288 
AA296520 Hs.89546 
X12784 Hs.1 19129 
AA307680 Hs.75692 
M31166 Hs.2050 
BE293116 Hs.76392 
AW958272 Hs.347326 
AI752416 Hs.77326 
AF064853 Hs.91299 
NMJ»4428Hs.1624 
AV650262 Hs.75765 
NM.005381 
AF043045 Hs.81008 
M68874 Hs.211587 
M69043 Hs.81328 
NM-003199HS.326198 
AI879352 Hs.1 18625 



W01076 
M92843 



Hs278573 
Hs.343586 



BE243845 Hs.75511 
AA236291 Hs.183583 
BE550723 Hs.153179 
BE392588 Hs.75777 
NM.006823HS.75209 
X96438 Hs.76095 
BE259035 Hs.1 18400 
BE616287 Hs.178452 
AA301867 Hs.76224. 
Ar752668 Hs.76669 
NM_004998Hs.82251 
AW161552 Hs.83381 
AI929721 Hs.5120 
AU076887 Hs.28491 
AU077005 Hs.92208 
U43959 Hs.211582 
U51010 

BE243877 Hs.76941 
U53445 Hs.15432 
R97457 Hs.63984 
U59423 Hs.79067 
AU076728 Hs.8867 
AI984144 Hs.66713 
AL1 19566 Hs.6721 
. NM_007019Hs.93002 
AA532780 Hs.183684 
AA122237 Hs.81874 
NAl.005100Hs.788 
NM.002318HS.83354 
A1767736 Hs.290070 



BE409861 
X07820 



Hs.202833 
Hs.2258 



BE512730 Hs.65114 
AI904738 Hs.76053 



heat shock 70kD protein 2 
fibronectin 1 

inhibitor of DNA binding 1, dominant neg 
insuIMke growth factor 2 (somatomedi 
small inducible cytokine A2 (monocyte ch 
cathepsin 8 

serine (or cysteine) proteinase inhiblto 
chemokine (C-X-C motif), receptor 4 (fus 
myeloid ceil leukemia sequence 1 (BCL2-f 
transketolase (Wernicke-Korsakoff syndro 
proiylcarboxypeptidase (angiotensinase C 
G protein-coupled receptor kinase 5 
activating transcription factor 3 
solute carrier family 20 (phosphate tran 
four and a half LIM domains 2 
FBJ murine osteosarcoma viral oncogene h 
calcitonin receptortike 
v-yes-1 Yamaguchi sarcoma viral oncogene 
calpain 2, (m/ll) targe subunit 
selectin E (endothelial adhesion molecut 
collagen, type IV, alpha 1 
asparagine synthetase 
pentaxin-related gene, rapidly induced b 
aldehyde dehydrogenase 1 family, member 
intercellular adhesion molecule 2 
insutin-Dke growth factor binding prote 
guanine nucleotide binding protein (G pr 
ephrin-A1 
GR02 oncogene 
nucteolin 

filamin B, beta (actin-binding protein-2 
phosphoiipase A2, group IVA (cytosoiic, 
nuclear factor of kappa light polypeptld 
transcription factor 4 
hexokinase 1 

CD59 antigen p18-20 (antigen identified 
zinc finger protein homologous to Zfp-36 
connective tissue growth factor 
serine (or cysteine) proteinase Inhiblto 
fatty acid binding protein 5 (psoriasis- 



AJ243425 
X53416 



Hs.326035 
Hs.195464 



AW800726 Hs.789 
M13509 Hs.83169 
Y18024 Hs.78877 



protein kinase (cAMP-dependent, catalyti 
immediate early response 3 
singed (Drosophila)-like (sea urchin fas 
catenin (cadherin-associateb protein), a 
EGF-containing fibulMceextracellula 
nicotinamide N-methyltransferase 
myosin IE 

guanine nucleotide binding protein 11 
dynein, cytoplasmic, light polypeptide 
spermidine/spermine N1-acety [transferase 
a disintegrin and metailoprotetnase doma 
myosin, light polypeptide kinase 
gb:Human nicotinamide N-methyltransferas 
ATPase, Na*yK+ transport!^, beta 3 poly 
downregulated in ovarian cancer 1 
cadherin 13, H-cadherin (heart) 
MAO (mofiiers against decapentaplegic, Dr 
cysteine-rich, angiogenic inducer, 61 
hepatitis delta antigen-interacting prot 
lysosomal 

ubiquitin carrier protein E2-C 
eukaryotic translation initiation factor 
microsomal glutathione S-transferase 2 
A kinase (PRKA) anchor protein (gravin) 
lysyl oxidase-like 2 
gelsoDn (amyloidosis, Finnish type) 
heme oxygenase (decycling) 1 
matrix metalioprotehase 10 (stromelystn 
keratin 18 

DEAD/H (Asp-Glu-Ala-Asp/His) box porypep 
early growth response 1 
filamin A, alpha (actin-binding protein- 
GR01 oncogene (melanoma growth sfamulati 
matrix metaJioproteinase 1 (interstitial 
inositol 1 ,4,5-trisphosphate 3-kinase B 
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103080 
103095 
103138 
103176 
103195 
103347 
103371 
103432 
103471 
103987 
104447 
104764 
104783 
104798 
104865 
104877 
104894 
104952 
105113 
105178 
105196 
105215 
105263 
105271 
105330 
105461 
105492 
105493 
105594 
105727 
105732 
105767 
105882 



106031 
106124 
108222 
106241 
106263 
105264 
106366 
106454 
106634 
106724 
106793 
106799 
106842 
106868 
106890 
106961 
106974 
107030 
107061 
107086 
107216 
107385 
107444 
107985 
108507 
108695 
108931 
109001 
109195 
109390 
109456 
109737 
110411 
110660 
110906 
11101B 
111091 
111356 
111378 
111741 
111769 
112318 



X59798 

X60957 

X65965 

X69111 

X70940 

X87838 

X91247 

X97748 

Y00815 

AA303711 

L44538 

AA025351 

AA027050 

AA029462 

AA045136 

AA047437 

M054087 

M071089 

AA156450 

AA187490 

AA195031 

AA205724 

AA227926 

AA227986 

AA234743 

AA253216 

AA256210 

AA256268 

AA279397 

AA292379 

AA292717 

AA346551 

AA400292 

M404338 

AA412284 

AA423987 

AA428594 

AA430108 

AA431462 

AA431470 

AA443756 

AA449479 

AA459916 

AA465226 

AA478778 

AA479037 

AM82597 

AA487561 

AA489245 

AA504110 

AA520989 

AA599434 

AA608649 

AA609519 

D51069 

U97519 

W28391 

AA035638 

AA083514 

AA121315 

AA147186 

AA156125 

AA186932 

AA219653 

AA232645 

F1007B 

H48032 

H82117 

N39584 

N54067 



N90933 
N93764 
R26124 
R27957 
R55470 



AU077231 Hs.82932 
NM_005424Hs78824 
X65965 

AL021154 Hs.76884 
AA351647 Hs.2642 
AU077309 Hs.171271 
X91247 Hs.13046 
X97748 

Y00815 Hs.75216 
AL120051 Hs.144700 
AW204145 Hs.156044 
AI039243 HS278585 
AA533513 Hs.93659 
AW952619 Hs.17235 
T79340 Hs.22575 
AI138635 Hs.22966 
AF065214 Hs.18858 
AW076098 Hs.345588 
AB037816 Hs.8982 
AA313825 Hs.21941 
W84893 Hs.9305 
AA205759 Hs.10119 
AW388633 Hs.6682 
AA807881 Hs.25329 
AW338625 Hs.22120 
BE539071 Hs.69388 
AI805717 Hs.289112 
AL047586 Hs.10283 
AB024334 Hs.25001 
AL135159 Hs.20340 
AW504170 Hs.274344 
AW370946 Hs.23457 
W46802 Hs.81988 
AI678765 Hs.21812 
X64116 Hs.171844 
H93366 Hs.7567 
AA356392 Hs.21321 
BE019681 Hs.6019 
W21493 Hs.28329 
AL046859 Hs.3407 
AA186715 Hs.336429 
NM.014038Hs.5216 
W25491 Hs.288909 
N48670 Hs.28631 
H94997 Hs.16450 
BE313412 Hs.7961 
AF124251 Hs.26054 
BE185536 Hs.301183 
AA489245 Hs.88500 
AW243614 Hs.18063 
AI817130 Hs.9195 
AL117424 Hs.25035 
BE147611 Hs.6354 
NM_012331Hs^6458 
D51069 Hs.211579 
NM_005397Hs.16426 
W28391 Hs.343258 
T40064 Hs.71968 
AI554545 Hs.68301 
AB029000 Hs.70823 
AA147186 

AI056548 Hs72116 
AF047033 Hs.132904 
AW007485 Hs.87125 
AW956580 Hs.42699 
AA055415 Hs.13233 
AW001579 Hs.9645 
AA782114 Hs.28043 
AA035211 Hs.17404 
AI287912 Hs.3628 
AA300067 Hs.33032 
BE301871 Hs.4867 
AW160993 Hs.326292 
AB020653 Hs.24024 
AW629414 Hs.24230 
AW083384 Hs.11067 



cycOn D1 (PRAD1: parathyroid adenomatos 

tyrosine kinase with Immunoglobulin and 

gb:H.sapiens SOD-2 gene for manganese su 

inhibitor of DNA binding 3, dominant neg 

eukaryotic translation elongation factor 

catenin (cadherin -associated protein), b 

thioredoxtn reductase 1 

gb:H.sapiens PTX3 gene promotor region. 

protein tyrosine phosphatase, receptor t 

ephrin-B1 

ESTs 

ESTs 

protein disulfide isomerase related prot 

Homo sapiens clone TCCC1A00176 mRNA sequ 

B-cell CLL/Iymphoma 6, member B (zinc fi 

Homo sapiens clone IMAGE:451939, mRNA se 

phospholipase A2, group IVC (cytosolic, 

desmopiakin (DPI, DPII) 

Homo sapiens, clone IMAGE:3506202, mRNA, 

AD036 protein 

angiotensin receptor-like 1 

hypothetical protein FLJ14957 

solute carrier family 7, (cationlc amino 

ESTs 

ESTs 

hypothetical protein FLJ20505 

CGI-43 protein 

RNA binding motif protein 8B 

tyrosine 3-monooxygenase/tryptophan 5-mo 

KIAA1002 protein 

hypothetical protein MGC12942 

ESTs 

disabled (DrosophQa) homolog 2 (mitogen 
ESTs 

Homo sapiens cDNA; FU22296 fis, clone H 

Homo sapiens cDNA: FU21962 fis, clone H 

Homo sapiens clone FLB9213 PR02474 mRNA, 

Homo sapiens cDNA: FLJ21288 fis, clone C 

hypothetical protein FU14005 

protein kinase (cAMP-dependent, catalyti 

RIKEN cDNA 9130422N19 gene 

HSPC028 protein 

hypothetical protein FLJ22471 

Homo sapiens cDNA: FU22141 fis, clone H 

ESTs ' 

Homo sapiens clone 25012 mRNA sequence 
novel SH2-containing protein 3 
molecule possessing ankyrin repeats indu 
mitogen-activated protein kinase 8 inter 
Homo sapiens cDNA FU10768 fis, done NT 
Homo sapiens cDNA FU1 3698 fis, clone PL 
chloride Intracellular channel 4 
stromal cell derived factor receptor 1 
methionine sulfoxide reductase A 
melanoma ceil adhesion molecule 
podocalyxin-like 

proliferation-associated 2G4, 38kD 

Homo sapiens mRNA; cDNA DKFZp564F053 (fr 

ESTs 

KIAA1077 protein 

gb:zo38d01.s1 Stratagene endothelial eel 
hypothetical protein RJ20992 similar to 
solute carrier family 4, sodium bicarbon 
EH-domain containing 3 
ESTs 

ESTs, Moderately similar to A47582 B-cel 
Homo sapiens mRNA forKIAA1741 protein, 
ESTs 
ESTs 

mitogen-activated protein kinase kinase 
hypothetical protein DKFZp434N185 
mannosyl (aljpha-13->grycoprotein beta- 
hypothetical gene DKFZp434A11 14 
KIAA0846 protein 
ESTs 

ESTs, Highly similar to T46395 hypotheti 



124 



WO 02/079492 



PCTYUS02/04915 



112951 T16550 

113057 T26674 

113195 T57112 

113490 T83700 

5 113542 T90527 

113803 W42789 

113847 W60002 

113910 W78175 

113947 W84768 

10 114047 W94427 

115061 AA253217 

115819 M426573 

115870 AA432374 

115964 AA446622 

15 . 116228 AA478771 

116264 M482594 

116314 AA490588 

116589 D59570 

117023 H88157 

20 117112 H94648 

117156 H97538 

117176 H98670 

117280 W22107 

119559 W38197 

25 119B66 W80814 

120655 AA287347 

121314 AA402799 

121335 AA404418 

121822 AA425107 

30 121835 AA425435 

122331 AA442872 

122577 AA452860 

123160 AA4B8687 

123486 AA599574 

35 124059 F13673 

124339 H99093 

124358 N22495 

• 124364 N23031 

124726 R15740 

40 124763 R39610 

125167 W45560 

125304 Z39833 

125307 Z40583 

125329 AA825437 

45 . 125598 R66613 

125509 AA868063 

418245 AA128075 

127435 N66570 

127556 AI051390 

50 127619 AA627122 

128453 X02761 

128495 AF010193 

128515 AA149044 

128580 U82108 

55 128623 D78676 

128642 L35240 

128669 AA598737 

128903 R69417 

128914 AA232837 

60 129087 N72695 

129188 M30257 

129226 M96843 

129265 X68277 

129345 AA292440 

65 129468 J03040 

129488 AA228107 

129498 AA449789 

129557 W01367 

129619 AA610116 

70 129627 AA258308 

129762 AA460273 

129884 AA286710 
130018 T68873 

130147 D63476 

75 130178 M62403 

130282 X55740 



AA307634 Hs.6650 
AW194301 Hs.339283 
H83265 Hs.8881 
BE178110 Hs.173374 
H43374 Hs.7890 
AW880709 Hs.283683 
NM_005032Hs.4114 
AA113262 Hs.17901 
W84768 

AL035858 Hs.3807 
AI751438 Hs.41271 
AA486620 Hs.41135 
NrVL005985Hs.48029 
AA987568 Hs.74313 
AI767947 Hs.50841 
D51174 Hs.272239 
AI799104 Hs.178705 
AI557212 Hs.17132 
AW070211 Hs.102415 
AW969999 Hs.293658 
W73853 

H45100 Hs.49753 
M18217 .Hs.172129 
W38197 

AA496205 Hs.193700 
AA305599 Hs.238205 
W07343 Hs.182538 
AA404418 
AI743860 

AB033030 Hs.300670 
AL133437 Hs.1 10771 
AA829725 Hs.334437 
AA4B8687 Hs.284235 
BE019072 Hs.334802 
BE387335 Hs.283713 
H99093 Hs.343411 
AW070211 Hs.102415 
AF265555 Hs.250646 
NWL003654HS.104576 
BE410405 HSJ6288 
AL137540 Hs.102541 
AL359573 Hs.124940 
AW580945 Hs.330466 
AA825437 Hs.58875 
T40064 Hs.71968 
AA8SB063 Hs.104576 
AA088767 Hs.83883 
X69086 Hs.286161 
AI051390 Hs.116731 
AA627122 Hs.163787 
X02761 Hs.287820 
NIVL005904HS.100602 
BE395085 Hs.10086 
U82108 Hs.101813 
BE076608 Hs.105509 
Z28913 Hs.102948 
W28493 Hs.180414 
AW150717 Hs.345728 
AW867491 Hs.107125 
AI348027 Hs.108557 
NIVL001078HS.109225 
BE222494 Hs.180919 
AA530892 Hs.171695 
R22497 Hs.1 10571 
AW410538 Hs.1 11779 
AW966728 Hs.54642 
AA449789 Hs.75511 
AL045404 Hs.46366 
AA209534 Hs.284243 
T40064 Hs.71968 
AA453694 Hs.1 2372 
AF055581 Hs.13131 
AA353093 

D63476 Hs.1 72813 
U20982 Hs.1516 
BE245380 Hs.153952 



vacuolar protein sorting 45B (yeast homo 
Human DNA sequence from clone RP1-1 87J1 1 
ESTs, Weakly similar to S41044 chromosom 
Homo sapiens cDNA FU1Q500 fis, clone NT 
Homo sapiens mRNA for KIAA1671 protein, 
chromosome 8 open reading frame 4 
plastin 3 (T Isofomi) 

Homo sapiens, done IMAGE:3937015, mRNA, 
gb:zh53d03.s1 Soares_fe!aUver_spieen_ 
FXYD domairMxmtaining ton transport reg 
Homo sapiens mRNA full length Insert cDN 
endomucin-2 

snail 1 (drosophlla homotog), zincfinge 

KIAA1265 protein 

ESTs 

lysosomal 

Homo sapiens cDNA FIJ1 1333 As, done PL 
ESTs, Moderately similar to I54374 gene 
Homo sapiens mRNA; cDNADKFZp586N0121 (f 
ESTs 
ESTs 

uvea) autoantigen with coiled coil domai 
Homo sapiens cDNA: FU21409 fis, clone C 
Empirically selected from AFFX single pr 
Homo sapiens mRNA; cDNA DKFZp586l0324 (f 
hypothetical protein PRO2013 
phospholipid scrambtase 4 
gb:zw37e02.s1 SoaresJotaLfetus_Nb2HF8._ 
metalbthioneln 1E (functional) 
KIAA1204 protein 

Homo sapiens cDNA: FLJ21904 fis, done H 
hypothetical protein MGC4248 
ESTs, Weakly similar to I38022 hypotheti 
Homo sapiens cDNA FU14680 fis, clone NT 
ESTs, Weakly similar to S64054 hypotheti 
DEAD/H (Asp-GIu^la-Asp/His) box polypep 
Homo sapiens mRNA; cDNA DKFZp586N0121 (f 
baculoviral IAP repeat-containing 6 
carbohydrate (keratan sulfate Gai-6) sul 
calpain 2, (m/Il) large subuntt 
netrin 4 

GTP-binding protein 

ESTs 

ESTs 

Homo sapiens mRNA; cDNA DKFZp564F053 (fr 
carbohydrate (keratan sulfate Gal-6) sul 
transmembrane, prostate androgen induced 
Homo sapiens cDNA FU13613 fis, done PL 
ESTs 
ESTS 

fibronectin 1 

MAD (mothers against decapehtaplegic Dr 
type I transmembrane protein Fn14 
solute carrier family 9 (sodium/hydrogen 
CTL2gene 

enigma (LIM domain protein) 
heat shock 70kD protein 8 
STAT Induced STAT inhibitor 3 
plasmalemma veside assodated protein 
hypothetical protein PP1057 
vascular cell adhesion molecule 1 
inhibitor of DNA binding 2, dominant neg 
dual specificity phosphatase 1 
growth arrest and DlW^amage-taducible, 
secreted protein, addic, cysteine-rich 
methionine adenosyltransferase II, beta 
connective tissue growth factor 
KIAA0948 protein 
tetraspan NET-6 protein 

Homo sapiens mRNA; cDNA DKFZp564F053 (fr 
tripartite motif protein TR1M2 
lysosomal 
metallothionein 1L 

PAK-interacting exchange factor beta 
insuDn-fflce growth factor-binding prote 
S'nudeotidase^) 
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130431 
130495 
130553 
130638 
130639 
130657 
130686 
130776 
130818 
130840 
130899 
131002 
131080 
131084 
131091 
131107 
131182 
131207 
131319 
131328 
131328 
131328 
131328 
131509 
131555 
131564 
131573 
131692 
131756 
131859 
131909 
131915 
132046 
132050 
132151 
132164 
132187 
132303 
132314 
132358 
132398 
132421 
132490 
132520 
132546 
132610 
132716 
132840 
132883 
132968 
132989 
132999 
133071 
133076 
133099 
133147 
133149 
133161 
133200 
133220 
133260 
133295 
133349 
133391 
133398 
133436 
133454 
133478 
133491 
133510 
133517 
133526 
133538 
133562 
133584 
133590 



L10284 

AA243278 

AA430032 

H16402 

D59711 

T94452 

AA431571 

R79356 

AA280375 



Z41740 

AA121543 

J05008 

AA101878 

T35341 

N87590 

AA256153 

W74533 

U25997 

V01512 

V01512 

V01512 

V01512 

X56681 

AA161292 

AA491465 



D50914 

D45304 

M90657 

W69127 

AA316186 

AA384503 

AA1 36353 

AA044755 

U84573 

AA05B911 

AA620962 

AA285290 

X60486 

R31641 

AA489190 

F13782 

AA257993 

M24283 

AA443114 

T35289 

N23817 

AA047151 

N77151 

AA480074 

Y00787 

T99789 

W84341 

L09209 

D12763 

T16484 

AA253193 

AA432248 

X82200 - 

AA083572 

L00352 

N75791 

X57579 

X02612 

H44631 

AA090257 

X83703 

L40395 

AA227913 

X52947 

M11313 

L14837 

M60721 



AW505214 H3.155560 
AW250380 Hs.109059 
AF062649 Hs.252587 
AW021276 Hs.17121 
AI557212 Hs.17132 
AW337575 Hs.201591 
BE548267 Hs.337986 
AF167706 Hs.19280 
AW190920 Hs.19928 
BE048821 Hs.20144 
AI077288 Hs.296323 
AL050295 Hs.22039 
NM_001955Hs.2271 
NM017413HS.303084 
AJ271216 Hs.22880 
BE620886 Hs.75354 
AI824144 Hs.23912 
AF104266 Hs.24212 
NNL003155HS.25590 
AW939251 Hs.25647 
AW939251 Hs.25647 
AW939251 Hs.25647 
AW939251 Hs.25647 
X56681 Hs.2780 
T47364 Hs.278613 
T93500 Hs.28792 
AA040311 Hs.28959 
BE559681 Hs.30736 
AA443966 Hs.31595 
AW960564 

NWL016558HS.274411 
A1161383 Hs.34549 



AI359214 
AI267615 
BE379499 
AI752235 
AA235709 
BE177330 
AF1 12222 



Hs.179260 

Hs.38022 

Hs.173705 

Hs.41270 

Hs.4193 

Hs.325093 

Hs.323805 



NMJ)03542Hs.46423 
AA876616 Hs.16979 
AW163483 Hs.48320 
NNL001290HS.4980 
AA257992 Hs.50651 



M24283 

AA160511 

BE379595 

BE218319 

AA373314 

AF234532 



Hs.1 68383 

Hs.5326 

Hs.283738 

Hs.5807 

Hs.5897 

Hs.61638 



T67986 



AA480074 Hs.331328 
Y007B7 Hs.624 
BE384932 Hs.64313 
AW946276 Hs.6441 
W16518 Hs.279518 
AA026533.Hs.66 
AA370045 Hs.6507 
AW021103 Hs.6631 
AB037715 Hs.183639 
NMJ)06074Hs.318501 
AA403045 Hs.6906 
AI147861 Hs.213289 
AW631255 Hs.8110 
AW103364 Hs.727 
N1VL000499HS.72912 
BE294068 Hs.737 
BE547647 Hs.177781 
X83703 Hs.31432 
BE619053 Hs.170001 
AW880841 Hs.96908 
NMJ00165Hs.74471 
AU077051 Hs.74561 
NIVL003257HS.74614 
M60721 Hs.74870 
D90209 Hs.181243 
T70956 Hs.75106 



calnexin 

mitochondrial ribosomal protein L12 
pituitary tumor-transforming 1 
ESTs 

ESTs, Moderately similar to I54374 gene 
ESTs 

Homo sapiens cDNA FU10934 fis, cione OV 

cysteine-rich motor neuron 1 

hypothetical protein SP329 

small inducible cytokine subfamily A (Cy 

serum/glucocorticold regulated kinase 

KIAA0758 protein 

endothefin 1 

apelin; peptide Ggand for APJ receptor 

dipeptidyipeptidase ill 

GCN1 (general control of amino-acid synt 

ESTs 

latrophlfin 

stanniocalcln 1 

v-fos FBJ murine bsteosarcoma viral onco 
v-fos FBJ murine osteosarcoma viral onco 
v-fos FBJ murine osteosarcoma viral onco 
v-fos FBJ murine osteosarcoma viral onco 
jun D proto-oncogene 
Interferon, alpha-inducible protein 27 
Homo sapiens cDNA FU11041 fis, clone PL 
ESTs 

KIAA0124 protein 
ESTs 

transmembrane 4 superfamify member 1 

SCAN domairHX>ntarning 1 

ESTs, Highly similar to S94541 1 clone 4 

chromosome 14 open reading frame 4 

ESTs 

Homo sapiens cDNA: FU22050 fis, clone H 

procollagen-lysine, 2-oxogiutarate 5-dio 

DKFZP58601624 protein 

Homo sapiens cDNA: FU21210 fis, clone C 

pinin, desmosome associated protein 

H4 htetone family, member G 

ESTs, Weakly similar to A43932 mucin 2 p 

double ring-finger protein, Dorfin 

LIM domain binding 2 

Janus kinase 1 (a protein tyrosine kinas 

intercellular adhesion molecule 1 (CD54) 

amino acid system N transporter 2; porcu 

casein kinase 1, alpha 1 

GTPaseRab14 

Homo sapiens mRNA; cDNA DKFZp586P1622 (f 
myosin X 

hypothetical protein FU13213 
Interleukin 8 

ESTs, Weakly similar to AF257182 1 G-pro 

Homo sapiens mRNA; cDNA DKFZp586J021 (fr 

amyloid beta (A4) precursor-like protein 

Interleukin 1 receptor-tike 1 

AXIN1 up-regulated 

hypothetical protein FU20373 

hypothetical protein FU10210 

Homo sapiens mRNA full length insert cDN 

Homo sapiens cDNA: FU23197 fis, clone R 

low density lipoprotein receptor (famBi 

L-3-hydroxyacyl-Coenzyme A dehydrogenase 

Inhibin, beta A (activin A, acrjvin AB a 

cytochrome P450, subfamily I (aromatic c 

immediate early protein 

hypothetical protein MGC5618 

cardiac ankyrln repeat protein 

eukaryotJc translation initiation factor 

p53-lnduced protein 

gap junction protein, alpha 1, 43kD (con 

alpha-2-macroglobulin 

tight junction protein 1 (zona occluders 

H2.0 (Drosophiia)-like homeo box 1 

activating transcription factor 4 (tax-r 

crusterin (complement lysis inhibitor, S 



126 



WO 02/079492 



PCTYUS02/04915 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



133617 
133651 
133671 
133678 
133681 
133722 
133730 
133750 
133802 
133825 
133838 
133859 



133960 
133975 
133977 
134039 
134075 
134081 
134164 
134203 
134238 
134299 
134332 
134339 
134343 
134381 
134403 
134416 
134493 
134558 
134817 
134983 
134989 
135052 
135062 
135069 
135071 
135073 
135170 
135195 
135348 
134404 
439561 
100082 
132817 
130150 
100104 
447973 
332613 
100113 
133980 
100129 
100154 
100169 
129718 
100190 
134742 
100211 
100238 
130283 
134237 
100248 
100256 
100262 
134329 
100281 
100294 
100327 
100335 
134495 
100338 
135152 
-134269 
100372 
134304 



AA148318 

U97105 

T25747 

K02574 

D78577 

X53331 

S73591 

X95735 

L16862 

U44975 

M97796 

U86782 

AA099391 

M19267 

D29992 

L19314 

S78569 

U28811 

L77886 

C14407 

M60278 

R81509 

AA487558 

D86962 

M478971 

D50683 

U56637 

M51199 

M28882 

X15183 

S53911 

U20734 

D28235 

AA236324 

AA1 48923 

AA174183 

AA456311 

L08069 

M452000 

AA282140 

J02854 

AA442054 

AB000450 

AB002380 

AB003103 

AB004884 

AF000573 

AF008937 

AF009301 

AF009368 

D00591 

D00760 

D11139 

D14657 

D14878 

D17716 

D21090 

D26135 

D26528 

D30742 

D31762 

D31765 

031888 

D3B128 

D3B500 

D38551 

D42087 



D55640 
D63391 
D63477 
D63483 
D64015 
D79990 
D79997 
D80010 



BE244334 Hs.75249 
AI301740 Hs.173381 
AW503116 Hs.301819 
AW247252 
AI352558 

AW969976 Hs.279009 
BE242779 Hs.179526 
BE410769 HS75873 
AW239400 HS76297 
BE616902 Hs.285313 
BE222494 Hs.180919 
U86782 Hs.178761 



U48959 
M19267 
C18356 



Hs.211582 

Hs.77899 

Hs.295944 



AI125639 Hs.250666 
NM_002290HsJ8672 
NIVL012201HS78979 
AL034349 Hs.79005 
AW245540 Hs79516 
AA161219 Hs799 
AA102179 Hs.160726 
AW580939 Hs.97199 
D86962 Hs.81875 



R70429 
D50683 



Hs.81988 
Hs.82028 



AI557280 Hs.184270 
AA334551 



X68264 
M30627 



Hs.211579 
Hs.289088 



NM_001773Hs.85289 
AU076592 Hs.198951 
028235 Hs.1 96384 
AW968058 Hs.92381 
AL136653 Hs.93675 
AK000967 Hs.93872 
AA876372 Hs.93961 
W27190 Hs.94 



W55956 
T53169 
C03577 
U80983 



Hs.94030 
Hs.9587 
Hs.9615 
Hs.268177 



AB000450 Hs.82771 
AF180681 Hs.6582 
AA130080 Hs.4295 
N27852 Hs.57553 
BE094848 Hs.15113 
AF008937 

AB011169 Hs.20141 
AF029674 Hs.173422 
NIVL001269HS.84746 
AA294921 Hs.348024 
AA469369 Hs.5831 
H60720 Hs.81892 
AL037228 Hs.82043 
NWL002410HS.1 21502 
M91401 Hs.178658 
NM_001346Hs.89462 
D26528 Ms.123058 
L24959 Hs.348 
NM_012288Hs.153954 
D31765 Hs.170114 
NM_015156Hs.78398 
D25418 Hs.393 
D38500 Hs.278468 
N92036 Hs.81848 
AF091035 Hs.184627 
AA331881 Hs.75454 
D55640 

AW247529 Hs.6793 
D63477 Hs.84087 
D86864 Hs.57735 
M95954 Hs.182741 
NMJ14737HS.80905 
NM.014791HS.184339 
BE613486 Hs.81412 



ADP-ribosylation factor-like 6 interact} 

d5iydropyrimidinas&-fike 2 

zinc finger protein 146 

nucleoside phosphorylase 

tyrosine 3-monooxygenase/tryptophan 5-mo 

matrix Gla protein 

upregulated by 1,25-dihydroxyvitamin D-3 
zyxln 

G protein-coupled reoeptor kinase 6 

core promoter element binding protein 

inhibitor of DNA binding 2, dominant neg 

26S proteasome-associated pad1 homoiog 

myosin, light polypeptide kinase 

tropomyosin 1 (alpha) 

tissue factor pathway inhibitor 2 

hairy (Drosophila)-homolog 

lamlnin, alpha 4 

Golgi apparatus protein 1 

protein tyrosine phosphatase, receptor t 

brain abundant, membrane attached signal 

diphtheria toxin receptor (heparin-bindl 

Homo sapiens cDNA FU11680 fis, clone HE 

complement component C1q receptor 

growth factor receptor-bound protein 10 

disabled (Orosophlla) homoiog 2 (mitogen 

transforming growth factor, beta recepto 

capping protein (actin filament) muscle 

sperm specific antigen 2 

melanoma cell adhesion molecule 

heat shock 90kD protein 1 , alpha 

CD34 antigen 

)un B proto-oncogene 

prostaglandin-endoperoxide synthase 2 (p 

nudix (nucleoside diphosphate linked moi 

decidual protein induced by progesterone 

KIAA1682 protein 

Homo sapiens mRNA; cDNA DKFZp667D095 (fr 
DnaJ (Hsp40) homoiog, subfamily A, membe 
Homo sapiens mRNA; cDNA DKFZp586E1624 (f 
Homo sapiens cDNA: FU22290 fis, clone H 
myosin regulatory light chain 2, smooth 
phospholipase C, gamma 1 (formerly subty 
vaccinia related kinase 2 
Rho guanine exchange factor (GEF) 1 2 
proteasome (prosome, macropain) 26S subu 
tousled-like kinase 2 

homogentisate 1,2-dioxygenase (homogenti 
syntaxin 16 

similar to S. cerevisiae SSM4 

KIAA1605 protein 

chromosome condensation 1 . 

v-ral simian leukemia viral oncogene horn 

tissue Inhibitor of metalloproteinase 1 

KIAA0101 gene product 

D123gene product 

mannosyl (alpha-1,6-)*glycoprotein beta- 
RAD23 (S. cerevisiae) homoiog B 
diacylglycerol kinase, gamma (90kO) 
DEAO/H (Asp-Glu-Ala-Asp/His) box polypep 
calcium/calmodulin-dependent protein kin 
TRAM-like protein 
KIAA0061 protein 
KIAA0071 protein 

prostaglandin 12 (prostacycfin) receptor 
postmeiotic segregation increased 2-fike 
RAD21 (S. pombe) homoiog 
KIAA01 18 protein 
peroxiredoxin 3 

gbiHuman monocyte PABL (pseudoautosomal 

platelet-activating factor acetylhydrola 

KIAA0143 protein 

acetyl LDL receptor, SREC 

TIA1 t^toxic granule-associated RNA-bi 

Ras association (RalGDS/AF-6) domain fam 

K1AA0175 gene product 

Dpinl 
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100394 
100405 
100418 
133154 
134347 
444099 
100438 
134593 
100481 
100552 
100591 
100652 
100662 



100905 
100945 
100950 
100964 
135407 
130149 
131877 
101016 
134786 
134100 
134078 
101132 
134849 
332736 
101152 
135397 
432642 
101168 
421155 
101226 
415138 
134739 
130155 
440538 
409916 
101294 
101300 
101310 
130344 
101381 
101381 
415678 
133780 
101395 
101447 
101458 
101470 
134604 
101478 
133519 
131185 
134116 
133999 
130174 
129963 
132983 
133900 
101543 
101545 
101620 
134691 
133595 
101700 
101714 
134246 
101760 
415022 
415022 
415022 
415022 
415022 
101791 



D84276 
D86425 
D86978 
D87012 
D87075 
D87432 
087448 
087845 

HG1098-HT1098 

HG2167-HT2237 

HG2415-HT2511 

HG28254TT2949 

HG2887-HT3031 

HG4660-HT5073 

HG4704-HT5146 

HG884-HT884 

HG919-HT919 

J00212 

J04029 

J04031 

J04088 

J04543 

L06139 

L07540 



L11239 

L11353 

L13773 

L13800 

L14922 

L15189 

L15388 

L16895 

L27476 

L27624 

L32976 

L33404 

L35263 

L37347 

L40371 

L40391 

L41607 

L77566 

M13928 

M13928 

M14016 

M14219 

M15796 

M21305 

M22092 

M22898 

M22995 

M23379 

M24400 

M25753 

M27691 

M28213 

M29550 

M29971 

M30269 

M31158 

M31166 

M31210 

M55420 

M59979 

M62810 

M64710 

M68874 

M74524 

M80254 

M81780 

M81780 

M81780 

M81780 

M81780 

M83822 



D84284 Hs.66052 
AW291587 Hs.82733 
D86978 Hs.84790 
D87012 Hs.194685 
AF164142 Hs.82042 
D87432 Hs.10315 
AA013051 Hs.91417 
NM-000437Hs^34392 
X70377 Hs.121489 
AA019521 Hs.301946 
NM.004091HS.231444 
BE613608 Hs.142653 
AI368680 Hs.816 
AL039123 Hs.103042 
L12260 Hs.172816 
AF002225 Hs.180686 
AF128542 Hs.166846 
J00212 

J04029 Hs.99936 
AW067805 Hs.172665 
J04088 Hs.156346 
J04543 Hs.78637 
T29618 Hs.89640 
AA460085 Hs.171075 
L08895 Hs.78995 
L11239 Hs.36993 
BE409525 Hs.902 
Z83689 Hs.114765 
A1984625 Hs.9884 
L14922 Hs.166563 
BE297635 Hs.3069 
NM-005308HS.211569 
H87879 Hs.102267 
AF083892 Hs.75608 
C18356 Hs.295944 
NM002419Hs.89449 
AA101043 Hs.151254 
W76332 Hs.79107 
BE313625 Hs.57435 
AF168418 Hs.116784 
BE535511 
L41607 Hs.934 
AW250122 Hs.154879 
AW675039 Hs.1227 
AW675039 Hs.1227 
AW005903 Hs.78601 
AA557660 Hs.76152 
BE267931 Hs.78996 
M21305 
M22092 

NM_000546Hs.1846 
NM.002884Hs.865 
NIVL002890HS.758 
AW583062 Hs.74502 
BE280074 Hs.23960 
R84694 Hs.79194 
AA535244 Hs.78305 
M29551 Hs.151531 
M29971 Hs.1384 
M30269 

M31158 Hs.77439 
M31166 Hs.2050 
BE246154 Hs.154210 
S55271 Hs.247930 
AW382987 Hs.88474 
AA393273 Hs.75133 
D90337 Hs.247916 
M68874 Hs.211587 
Hs.80612 
Hs.173125 
Hs.77813 



D28459 
MB0254 
X59960 



X59960 Hs.77813 

X59960 Hs.77813 

X59960 Hs.77813 

X59960 Hs.77813 

M83822 Hs.62354 



CD38 antigen (p45) 

nidogen 2 

KIAA0225 protein 

topoisomerase (DNA) til beta 

solute carrier family 23 (nucieobase tra 

solute carrier family 7 (cationic amino 

topoisomerase (DNA) il binding protein 

platelet-activating factor acetylhydrob 

cystafinD 

lysosomal 

Homo sapiens, Similar to hypothetical pr 
ret finger protein 

SRY (sex determining region Y)-box 2 
microtubule-assoclated protein 1B 
neureguiin 1 

ubtquitin protein Ggase E3A (human papi 
polymerase (DNA directed), epsilon 
Empirically selected from AFFX single pr 
keratin 10 (epidermoiytic hyperkeratosis 
methyienetetrahydrofblate dehydrogenase 
topoisomerase (DNA) il alpha (170kD) 
annexlnA7 

TEK tyrosine kinase, endothelial (venous 
replication factor C (activator 1) 5 (36 
MADS box transcription enhancer factor 2 
gastrulation brain homeo box 1 
neurofibromin 2 (bilateral acoustic neur 
myeloid/iymphoid or mixed-lineage leukem 
spindle pole body protein 
replication factor C (activator 1) 1 (14 
heat shock 70kD protein 9B (mortafin-2) 
G protein-coupled receptor kinase 5 
iysyl oxidase 

tight junction protein 2 (zona occludens 
tissue factor pathway inhibitor 2 
mitogen-adivated protein kinase kinase 
kalDkrein 7 (chymotryptic, stratum com 
mitogen-activated protein kinase 14 
solute carrier family 11 (proton-coupled 
thyroid hormone receptor interactor 4 
transmembrane trafficking protein 
glucosamine (N-acetyl) transferase 2, 1 
DiGeorge syndrome critical region gene D 
aminolevulinate, delta-, dehydratase 
aminolevulinate, delta-, dehydratase 
uroporphyrinogen decarboxylase 
decorin 

proliferating cell nuclear antigen 
gb:Human alpha satellite and satellite 3 
gb:Human neural cell adhesion molecule ( ' 
tumor protein p53 (Li-Fraumeni syndrome) 
RAP1A, member of RAS oncogene family 
RAS p21 protein activator (GTPase activa 
chymotrypsinogen B1 
cyciin B1 

cAMP responsive element binding protein 
RAB2, member RAS oncogene femSy 
protein phosphatase 3 (formerly 2B), cat 
06-methylguanine-DNA methyltransferase 
nidogen (enactin) 

protein kinase, cAMP-dependent, regulato 
pentaxin-reiated gene, rapidly induced'b 
endothelial differentiation, sphingoiipi 
Epsilon , IgE 

prostaglandin-endoperoxide synthase 1 (p 
. transcription factor 6-0ke 1 (mrtochond 
natriuretic peptide precursor C 
phospholipase A2, group IVA (cytosoUc 
ubiquitin-conjugating enzyme E2A (RAD6 h 
peptidylprolyt isomerase F (cyciophilin 
sphingomyelin phosphodiesterase 1, acid 
sphingomyelin phosphodiesterase 1, acid 
sphingomyelin phosphodiesterase 1, acid 
sphingomyelin phosphodiesterase 1, acid 
sphingomyelin phosphodiesterase 1, acid 
cell division cycle 4-Cke 
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101812 
101813 
133396 
428161 
•129026 
101901 
134831 
134039 
442355 
101975 
101977 
101978 
101998 
102003 
102007 
102009 
416658 
132951 
135389 
102048 
130145 
303153 
420269 
102095 
102123 
102126 
102133 
102139 
102162 
102164 
427653 
131817 
102200 
102210 
102214 
132811 
131319 
102256 
132316 
102269 
417526 
102293 
102298 
102325 
428734 
102361 
102367 
102388 
102394 
129829 
102409 
133746 
102423 
132828 
132828 
132828 
132828 
425322 
102450 
129350 
102534 
130457 
135065 
102560 
102567 
417173 
102638 
132736 
133070 
102663 
134660 
102735 
102741 
130564 
130564 
132164 



M86934 
M87338 
M96326 
M96954 
M98833 
S66793 
S72370 
S78569 
S79873 
S83325 
S83364 
SB3365 
U01212 
U01922 
U02556 
U02680 
U03272 
U04209 
U05237 
U07225 
U07620 
U09759 
U09820 
U11313 
U14518 
U14575 
U15173 
U15932 
U18291 
U18300 
U18383 
U20536 
U21551 
U23028 
U23752 
U25435 
U25997 
U28251 
U28831 
U30245 
U32315 
U32439 
U32849 
U35139 
U36764 
U39400 
U39657 
U41344 
U41766 
U41813 
U43286 
U44378 
U44754 
U47011 
U47011 
U47011 
U47011 
U47077 
U48251 
U50535 
U56833 
U58091 
U58837 
U59289 
U59863 
U67122 
U67319 
U68019 
U69611 
U70322 
U73524 
U79267 
U79291 
U82671 
U82671 
U84573 



BE439894 Hs.78991 
NMJJ02914HS.139226 
M96326 Hs.72885 
M96954 Hs.182741 
AL120297 Hs.108043 
H38026 Hs.308 
AA853479 Hs.89890 
NNL002290HS.78672 
AA456539 Hs.8262 
AA079717 Hs.283664 
AF112213 Hs.184062 
BE561610 Hs.5809 
U01212 Hs.248153 
U01922 Hs.125565 
U02556 Hs.75307 
BE245149 Hs.82643 
U03272 Hs.79432 
AW821182 Hs.61418 
U05237 Hs.99872 



U07225 
U 34820 
U09759 
U72937 
U11313 



Hs.339 

Hs.151051 

Hs.246857 

Hs.96264 

Hs.75760 



NM_001809Hs.1594 
AW950870 Hs.78961 
AU076845 Hs.155596 
NM_004419Hs.2128 
M450274 Hs.1592 
NM.000107HS.77602 
M159001 Hs.180069 
U20536 Hs.3280 . 
AA232362 Hs.157205 
BE619413 Hs.2437 
U23752 Hs.32964 
U25435 Hs.57419 
NrVL003155Hs.25590 
U28251 Hs.53237 
U28831 Hs.44566 
U30245 

AA568906 Hs.82240 
AF090116 Hs.79348 
AA382169 Hs.54483 
AI815B67 Hs.50130 
BE303044 Hs.192023 
AA223616 Hs.75859 
U39656 Hs.1 18825 
AA362907 Hs.76494 
NM_003816Hs.2442 
AF010258 Hs.127428 
BE300330 Hs.118725 
AW410035 Hs.75862 
Z47542 Hs.179312 
AB014615 Hs.57710 
AB014615 Hs.57710 
AB014615 Hs.57710 
AB014615 Hs.57710 



U63630 
U48251 
U50535 
U96759 



Hs.155637 
Hs.75871 
Hs.110630 
Hs.198307 



AB014595 Hs.155976 
AA019401 Hs.93909 



R97457 
U63830 
U61397 
U67319 



Hs.63984 
Hs.146847 
Hs.81424 
Hs.9216 



AW081883 Hs.211578 
U92649 Hs.64311 
NMJ)O2270Hs.168O75 
U73524 Hs.87465 
AF111106 Hs.3382 
AW959829 Hs.83572 
U82671 Hs.36980 
U82671 Hs.36980 
A1752235 Hs.41270 



DNA segment numerous copies, expressed 
replication factor C (activator 1) 2 (40 
azurocidin 1 (catfonte antimicrobial pro 
TIA1 cytotoxic granule-associated RNA-bi 
Friend leukemia virus integration 1 
arrestin 3, retinal (X-arrestin) 
pyruvate carboxylase 
laminin, alpha 4 

lysosomal-associated membrane protein 2 

aspartate beta-hydroxylase 

putative Rab5-interacting protein 

putative transmembrane protein; homolog 

olfactory marker protein 

translocase of inner mitochondrial membr 

t-complex-associated-testis-expressed 1- 

protein tyrosine kinase 9 

fibrillin 2 (congenita) contracture! ara 

microfibriliar-associated protein 1 

fetal Alzheimer antigen 

purinerglc receptor P2Y, G-protein coupl 

mitogen-activated protein kinase 10 

mitogen-actfVated protein kinase 9 

alpha thalassemia/menta) retardation syn 

sterol carrier protein 2 

centromere protein A (17kD) 

protein phosphatase 1, regulatory (inhEb 

BCL2/adenovirus E1B 19kD-interacting pro 

dual specificity phosphatase 5 

CDC16 (cell division cycle 16, S. cerevi 

damage-specific DNA binding protein 2 (4 

nuclear respiratory factor 1 

caspase 6, apoptosis-reiated cysteine or 

branched chain aminotransferase 1, cytos 

eukaryotic translation initiation factor 

SRY (sex determining region YJ-box 11 

CCCTC-binding factor (zinc finger profel 

stanniocalcin 1 

ESTs, Highly similar to Z169JHUMAN ZINC 
KIAA1641 protein 

gb:Human myelomonocytlc specific protein 
syntaxin 3A 

regulator of G-protein signalling 7 
N-myc (and STAT) interactor 
necdin (mouse) homotog 
eukaryotic translation initiation factor 
chromosome 11 open reading frame 4 
mitogen-activated protein kinase kinase 
proline arginlne-rich end leucine-rich r 



homeo box A9 
seienophosphate synthetase 2 
MAD (mothers against decapentaplegic, Dr 
small nuclear RNA activating complex, po 
fibroblast growth factor 8 (androgen-ind 
fibroblast growth factor 8 (androgen-ind 
fibroblast growth factor 8 (androgen-ind 
fibroblast growth factor 8 (androgen-ind 
protein kinase, DNA-activated, catalytic 
protein kinase C binding protein 1 
Human BRCA2 region, mRNA sequence CG006 
von HippeMJndau binding protein 1 
cullin 4B 

cycEc nucleotide gated channel beta 1 

cadherin 13, H-cadherin (heart) 

TRAF family member-associated NFKB activ 

ubtquitin-like 1 (sentrin) 

caspase 7, apoptosis-reiated cysteine pr 

Homo sapiens cDNA: FU23037 fis, done L 

a disintegrin and metalloproteinase doma 

karyopherin (importin) beta 2 

ATP/GTP-binding protein 

protein phosphatase 4, regulatory subuni 

hypothetical protein MGC14433 

melanoma antigen, family A, 2 

melanoma antigen, family A, 2 

procoiiagen-rysine, 2-oxogfuiarate 5-<fio 
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102823 
102826 
102831 
102846 
129777 
134161 
134854 
429257 
413985 
419768 
102915 
134656 
413858 
102968 
102971 
134037 
134037 
103023 
103037 
130282 
134542 
128568 
128568 
103093 
413076 
129063 
424460 
411077 
103181 
103184 
103194 
103208 
129698 
131486 
130729 
103334 
132645 
135094 
103352 
103352 
103353 
132173 
103371 
131584 
103376 
103378 
128510 
103410 
133490 
50 332689 
103438 
103440 
103452 
133536 
420234 
426502 
134662 
132083 
103500 
134389 
132084 
103540 
133152 
103548 
103612 
129092 
103692 
103695 
129796 
434993 
132683 
131887 
103723 
453368 
133260 
103765 



10 



15 



20 



25 



30 



35 



40 



45 



55 



60 



65 



70 



75 



U90914 
U91316 
U91932 
U95131 
U97018 
U97188 
V00503 
X04327 



X07496 
X07820 
X14787 
X15525 
X16396 
X16609 
X53586 
X53586 
X53793 
X54936 
X55740 
X57025 
X60673 
X60673 
X60708 
X62048 
X63097 
X63563 
X64037 



X69878 
X70649 
X72841 
X74987 
X83107 
X84194 
X85753 
X87870 
X89066 
X89398 
X89398 
X89399 
X89426 
X91247 
X91648 



X92110 

X94703 

X96506 

X97230 

X97230 

X98263 

X98296 

X99584 

Y00264 

Y07566 

Y07759 

Y07827 

Y07867 

Y09443 

Y09858 

Y12394 

Z11559 

Z11695 

Z15005 

Z46261 

AA011243 

AA018418 

AA018758 

AA018804 

AA031993 

AA044217 

AA046548 

AA057447 

AA058376 

AA083572 

AA085696 



D85390 Hs.5057 
NM.007274Hs.8679 
AA262170 Hs.80917 
BE264974 Hs.6566 
U97018 Hs.12451 
AA634543 Hs.79440 
J03464 Hs.179573 
AW163799 Hs.198365 
AI018666 Hs.75667 
T72104 Hs.93194 
X07820 Hs.2258 
AI750878 Hs.87409 
NM_001610Hs.75589 
AU076511 Hs.154672 
X16609 Hs.183805 
AI808780 Hs.227730 
AI808780 Hs.227730 
AW500470 Hs.1 17950 
BE018302 Hs.2894 
BE245380 Hs.1 53952 
M14156 Hs.85112 



H12912 
H12912 
S79876 
U10564 
X63094 



Hs.274691 
Hs.274691 
Hs.44926 
Hs.75188 
Hs.283822 



BE275979 Hs.298014 
AW977263 Hs.68257 
X69636 Hs.334731 
U43143 Hs.74049 
NM_004939Hs78580 
AW411340 Hs.31314 
BE242144 Hs.12013 
F06972 Hs.27372 
AI963747 Hs.18573 
NM_001260Hs.25283 
AI654712 Hs.54424 
NM_003304Hs.250687 
H09366 Hs.78853 



H09366 
X89399 
X89426 
X91247 



Hs.78853 
Hs.1 19274 
Hs.41716 
Hs.13046 



AA598509 Hs.29117 
AL036166 Hs.323378 
AL1 19690 Hs.153618 
X94703 

AA158294 Hs.295362 
AF022044 Hs.274601 
AF022044 Hs.274601 
AW1 75781 Hs.152720 
X98296 Hs.77578 
NfvL006936Hs.85119 
W25797.comp 
AW404908 Hs.96038 
Y07759 Hs.170157 
NNL007048HS.284283 
BE386490 Hs.279663 
AW408009 Hs.22580 
Y09858 Hs.82577 
NMJX)2267Hs.3886 
NM.002197HS.154721 
Z11695 Hs.324473 
Z15005 Hs.75573 
BE336654 Hs.70937 
D56365 Hs.63525 
AW137912 Hs.227583 
' AW207152 Hs.186600 
BE218319 Hs.5807 
AA306325 Hs.4311 
BE264633 Hs.143638 
W17064 Hs.332848 
BE274312 Hs.214783 
W20296 Hs.288178 
AA403045 Hs.6906 
AA085696 Hs.169600 



carboxypeptldase D 

cytosoGc acyl coenzyme A thioester hydr 
adaptor-related protein complex 3, slgma 
thyroid hormone receptor Interactor 13 
echlnoderm mlcrotubule-associated protei 
IGF-fl mRNA-binding protein 3 
collagen, type I, alpha 2 
2,3-bisphosphoglycerate mutase 
synaptophysln 
apolipoprotein A-l 

matrix metaltoprotelnase 10 (stromelysln 

thrombospondln 1 

acid phosphatase 2, lysosomal 

methylene tetrahydrofolatB dehydrogenase 

ankyrin 1, erythrocytic 

integrin, alpha 6 

integrin, alpha 6 

multifunctional polypeptide similar to S 

placental growth factor, vascular endoth 

5' nucleotidase (CD73) 

insuEn-Bce growth factor 1 (somatomedi 

adenylate kinase 3 

adenylate kinase 3 

dipeptidylpeptidase IV (CD26, adenosine 

weel (S. pombe) homolog 

Rhesus blood group, D antigen 

polymerase (RNA) II (DNA directed) polyp 

general transcription factor IIF, polype 

Homo sapiens, clone 1MA6E:3448306, mRNA, 

frns-related tyrosine kinase 4 

DEAD/H (Asp-Glu-AIa-Asp/Hls) box polypep 

retinoblastoma-binding protein 7 

ATP-binding cassette, sub-family E (OABP 

BMX non-receptor tyrosine kinase 

acylphosphatase 1 , erythrocyte (common) 

cydln-dependent kinase 8 

hepatocyte nuclear factor 4, atpha 

transient receptor potential channel 1 

uracil-DNA glycosyJase 

uracil-DNA gtycosyiase 

RAS p21 protein activator (GTPase activa 

endothelial cell-specific molecule 1 

thioredoxin reductase 1 

purine-rich element binding protein A 

coated vesicle membrane protein 

HCGVHM protein 

RAB28, member RAS oncogene family 

DR1 -associated protein 1 (negative cofac 

killer cell immunoglobulin-like receptor 

killer ceil immunoglobulln-like receptor 

M-phase phosphoprotein 6 

ubiquitin specific protease 9, X chromos 

SMT3 (suppressor of mif two 3, yeast) ho 

Hs.1 77486 amyloid beta (A4) precursor protein (pro 

Ric (DrosophilaHike, expressed in many 

myosin VA (heavy polypeptide 12, myoxln) 

butyrophilin, subfamily 3, member A1 

Pirin 



karyopherin alpha 3 (importin alpha 4) 

aconitasel, soluble 

mitogen-activated protein kinase 1 

centromere protein E (312kD) 

H3 histone family, member A 

poly(rC)-binding protein 2 

Homo sapiens chromosome X map Xp1 1 .23 L- 

ESTs 

GTPase Rab14 

SUMO-1 activating enzyme subunlt 2 
WD repeat domain 4 
SW1/SNF related, matrix associated, act! 
Homo sapiens cDNA RJ14041 fis, done HE 
Homo sapiens cDNA FU1 1968 fis, clone HE 
Homo sapiens cDNA: FLJ23197 fis, clone R 
KIAA0826 protein 
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103766 AA088744 

103767 AA089688 
132051 AA091284 
103773 AA092700 
135289 AA092968 
409659 AA094800 
103794 AA100219 
131471 AA114885 
134319 AA129547 
103807 AA133016 
446392 AA149507 
129863 AA151005 
103850 AA187101 
103855 AA195179 
103861 AA206236 
130634 AA227621 
447735 AA248283 
103909 AA249611 . 
458928 AA282640 
415824 AA287199 
129013 AA313990 
129435 AA314256 
103988 AA314389 
104000 AA324364 
425284 AA329211 
128629 AA399187 
133281 AA421079 
104104 AA422029 
332455 AA425230 
132091 AA447052 
135073 AA452000 
131367 M456687 
129593 AA487015 
133505 C01527 
132064 C01714 
442351 C01811 
131427 C02352 
433892 C02375 
104282 C14448 
134827 D16611 
425330 D25216 
131742 D31352 
456935 D58024 
425218 D80897 
104334 D82614 
134593 D87845 
134731 D89377 
445776 H06583 
131670 H40732 
104394 H46617 
104402 H56731 
439130 H75570 
129077 H78886 
104417 H81241 
134927 L36531 
129260 M53154 
134498 M63180 
104460 M91504 
104488 N56191 
131248 N78483 
130017 R14652 
104530 R20459 
104534 R22303 
104544 R33779 
'133328 R36553 
104567 R64534 
129575 R70621 
130776 R79356 
104599 R84933 
104660 AA007160 
104667 AA007234 
104718 AA018409 
104764 AA025351 

104786 AA027168 

104787 AA027317 
134079 AA029423 



AI920783 Hs.191435 
BE244667 

AA393968 Hs.180145 
AI219323 Hs.101077 
AW372569 Hs.9788 
AW970843 Hs.55682 
AF244135 Hs.30870 
AA164842 Hs.192619 
BE304999 Hs.285754 
AW958264 Hs.103832 
AF142419 Hs.15020 
BE379765 Hs.129872 
AA187101 Hs.213194 
W02363 

AA206236 Hs.4944 
AI769067 Hs.127824 
AA775268 Hs.6127 
AA24S611 Hs.47438 
AF043117 Hs.24594 
D42039 Hs.78871 
AA371156 Hs.107942 
AF151852 Hs.1 11449 
AA314389 Hs.342849 
AI146527 Hs.80475 
AF155568 Hs.348043 
AL096748 Hs.102708 
AK001601 Hs.69594 
AA422029 Hs.143640 
NM_005754Hs.220689 
AW954243 
W55956 Hs.94030 
AI750575 Hs.173933 
AI338247 Hs.98314 
AI630124 Hs.324504 
AA121098 Hs.3838 
W52642 Hs.8261 
AF151879 Hs.26706 
AI929357 Hs.323966 
C14448 Hs.332338 
BE314037 Hs.89866 
D25216 Hs.155650 
AA961420 Hs.31433 
AA370362 Hs.57958 
NM_014909Hs.155182 
D82614 Hs.78771 
NM_000437Hs.234392 
D89377 Hs.89404 
NM 001310HS.13313 
H03514 Hs.15589 
AA129551 Hs.172129 
H56731 Hs.132956 
AA306090 Hs.124707 
N74724 Hs.108479 
AI819448 Hs.320861 
L36531 Hs.91296 
M63154 Hs.110014 
AW246273 Hs.84131 
AW955705 Hs.62604 
N56191 Hs.1 06511 
AI0389B9 Hs.332633 
AK000096 HS.143198 
AK001676 Hs.12457 
R22303 

AI091173 Hs.222362 
AW452738 Hs.265327 
AA040620 Hs.5672 
F08282 Hs.278428 
AF167706 Hs.19280 
AW815036 Hs.151251 
BE298665 Hs.14846 
AI239923 Hs.63931 
AI143020 Hs.36250 
AI039243 Hs.278585 
AA027167 Hs. 10031 
AA027317 

AK001751 Hs.171835 



ESTs 

CGMOO protein 
HSPC030 protein 

ESTs, WeaWy simter to T22363 hypotheti 

hypothetical protein MGC10924 similar to 

eukaryotic translation initiation {actor 

hepatocellular carcinoma-associated anti 

KIAA1600 protein 

fumarate hydratase 

similar to yeast Up/3, variant B 

homolog of mouse quaking QKI (KH domain 

sperm associated antigen 9 

hypothetical protein M6C10895 

hypothetical protein FU10330 

hypothetical protein FU12783 

ESTs, Weakly simDar to T28770 hypotheti 

Homo sapiens cDNA: FU23020 fis, clone L 

SH3 domain binding glutamic acid-rich pr 

ubiquitination factor E4B (homologous to 

mesoderm development candidate 2 

DKFZP564M112 protein 

CGI-94 protein 

ADP-ribosylation factor-jike 5 .. 
polymerase (RNA) II (DNA directed) polyp 
NS1-associated protein 1 
DKFZP434A043 protein 
high-mobility group 2QA 
ESTs, Weakly similar to hyperpolarizatio 
Ras-GTPase-activating protein SH3-domain 
KIAA0251 protein 

Homo sapiens mRNA; cDNA DKFZp586E1624 (f 
nuclear factor l/A 

Homo sapiens mRNA; cDNA DKFZp586L0120 (f 
Homo sapiens mRNA; cDNA DKFZp586J0720 (f 
serum-induclble kinase 
hypothetical protein FU22393 
CGI-121 protein 

Homo sapiens clone H63 unknown mRNA 
EST 

coproporphyrinogen oxidase (coproporphyr 

KIAA0014 gene product 

ESTs 

EGF-TM7-latrophifin-felated protein 
KIAA1036 protein 
phosphoglycerate kinase 1 
platelet-activating factor acetylhydrola 
msh (Drosophiia) homeo box homoiog 2 
cAMP responsive element binding protein- 
ESTs 

Homo sapiens cDNA: FU21409 fis, clone C 

ESTs 

ESTs 

ESTs 

KruppeHike factor 8 
integrin, alpha 8 

gastric intrinsic factor (vitamin B synt 

threonyt-tRNA synthetase 

Homo sapiens, clone IMAGE:4299322, mRNA, 

protocadherfn 17 . 

Bardet-Biedl syndrome 2 

inhibitor of growth family, member 3 

hypothetical protein FU10814 

gb:yh26b09.rl Soares placenta Nb2HP Homo 

ESTs, Weakly similar to p40 [H.sapiens] 

hypothetical protein DKFZp761l141 

hypothetical protein AF140225 

progestin induced protein 

cysteine-rich motor neuron 1 

ESTs 

Homo sapiens mRNA; cDNA DKFZp564D016 (fr 
ESTs 

ESTs, WeaWy similar to I38022 hypotheti 
ESTs 

KIAA0955 protein 

gb:ze97d11.s1 Soares_fetaLhearLNbHH19W 
hypothetical protein FU10889 
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104804 AA031357 A1858702 Hs.31803 ESTs, Weakly similar to N-WASP (Rsaplen 

104855 AA045136 T79340 Hs.22575 B-cell CLL/lymphoma 6, member B (zinc fi 

130828 AA053400 AW631469 Hs.203213 ESTs 

104907 AA055829 AA055829 Hs.196701 ESTs, Weakly similar to ALU1JWMAN ALUS 

5 104943 AA065217 AF072873 Hs.114218 frizzled (DrosophDa) homolog 6 

105013 AA1 16054 H63789 Hs.296288 ESTs, Weakly similar to KIAA0638 protein 

105024 AA126311 AA126311 Hs.9879 . ESTs 

132592 AA129390 AW803564 Hs.288850 Homo sapiens cDNA: FU22528 fe, clone H 

105038 AA130273 AW503733 Hs.9414 KIAA1488 protein 

10 105077 AA142919 W55946 Hs.234863 Homo sapiens cDNA FU 12082 fis, clone HE 

105098 AA150205 AL042506 Hs.21599 KruppeWike factor 7 (ubiquitous) 

129215 AA176867 AB040930 Hs.126085 KIAA1497 protein 

105169 AA180321 BE245294 Hs.180789 S164 protein 

132796 AA180487 NM_006283Hs.173159 transforming, acidic coiled-coil contain 

15 427210 AA187634 BE396283 Hs.173987 eukaryotic translation initiation factor 

105200 AA195399 AA328102 Hs.24641 cytoskeleton associated protein 2 

130114 AA234717 AA233393 Hs.14992 hypothetical protein FU1 1151 

105330 AA234743 AW338625 Hs.22120 ESTs 

105337 AA234957 AI468789 Hs.347187 myotubularin related protein 1 

20 422040 AA235604 . AA172106 Hs.1 10950 Rag C protein 

105376 AA236559 AW994032 Hs.8768 hypothetical protein FU10849 

105397 AA242868 AA814807 HsJ395 hypothetical protein FU23182 

431679 AA251776 AK000046 Hs.343877 hypothetical protein FU20039 

131991 AA251909 AF053306 Hs.36708 budding uninhibited by benzimidazoles 1 

25 421305 AA252672 BE397354 Hs.324830 diptheria toxin resistance protein requi 

105489 AA256157 AA256157 Hs.24115 Homo sapiens cDNA FU14178 fis, clone NT 

105508 AA256680 AA173942 Hs.326416 Homo sapiens mRNA; cDNA DKFZp564H1916 {f 

105539 AA258873 AB040884 Hs.109694 KIAA1451 protein 

135172 AA262727 AB028956 Hs.12144 KIAA1033 protein 

30 131569 AA281451 AL389951 Hs.271623 nudeoporin 50kD 

431129 AA281545 AL137751 Hs.263671 Homo sapiens mRNA; cDNA DKFZp434l0812 (f 

105643 AA282069 BE621719 Hs.173802 K1AA0603 gene product 

105659 AA283044 AA283044 Hs.25625 hypothetical protein FU1 1323 

105666 AA283930 AA426234 Hs.34906 ESTs, Weakly similar to T17210 hypotheti 

35 105674 AA284755 AI609530 Hs.279789 histone deacetylase 3 

105709 AA291268 AI928962 Hs.26761 DKFZP586L0724 protein 

105722 AA291927 AI922821 Hs.32433 ESTs 

105765 AA343514 AA299688 Hs.24183 ESTs 

115951 AA398109 BE546245 Hs.301048 sec13-like protein 

40 130884 AA398109 BE546245 Hs.301048 sec134ike protein 

105962 AA405737 AW880358 Hs.339808 hypothetical protein FU10120 

105985 AA406610 AA406610 gb:zv15b10.s1 SoaresJJhHMPu_S1 Homosapi 

106008 AA411465 AB033888 Hs.8619 SRY (sex determining region Y)-box 18 - 

457322 AA416886 AI815486. Hs.243901 Homo sapiens cDNA FU20738 fis, clone HE 

45 134222 AA424013 AW855861 Hs.8025 Homo sapiens clone 23767 and 23782 mRNA 

446954 AA424148 AB037850 Hs.16621 DKFZP4341 116 protein 

106141 AA424558 AF031463 Hs.9302 phosducin-like 

447973 AA424961 AB011169 Hs.20141 similar to S. cercvisiae SSM4 

• 106157 AA425367 W37943 Hs.34B92 WAA1 323 protein 

50 428314 AA425921 AW135049 Hs.26285 Homo sapiens cDNA FU10643 fis, clone NT 

• 446727 AA426220 AB011095 Hs.16032 K1AA0523 protein 

106196 AA427735 AA525993 Hs. 173699 ESTs, WeaWy similar to ALU1_HUMAN ALU S 

457714 AA430673 AA083764 hypothetical protein MGC3178 

133200 AA432248 AB037715 Hs.183639 hypothetical protein FU10210 

55 105302 AA435896 AA398859 Hs.18397 hypothetical protein FU23221 

' • 106328 AA436705 AL079559 Hs.28020 KIAA0766 gene product 

450534 AA446561 AI570189 Hs.25132 KIAA0470 gene product 

106423 AA448238 AB020722 Hs.16714 Rho guanine exchange factor (GEF) 15 

439608 AA449756 AW864696 Hs.301732 hypothetical protein MGC5306 

60 106477 AA450303 R23324 Hs.41693 DnaJ (Hsp40) homolog, subfamily B, membe 

106503 AA452411 AB033042 Hs.29679 cofactor required for Sp1 transcripfiona 

446999 AA454566 AA151520 hypothetical protein MGC4485 

106543 AA454667 AA676939 Hs.69285 neuropllin 1 

442007 AA456437 AA301116 Hs.142838 nucleolar phosphoprotein Nopp34 

65 106589 AA456646 AK000933 Hs.28661 Homo sapiens cDNAFUl 0071 fis, clone HE 

106593 AA456826 AW296451 Hs.24605 ESTs 

106596 AA456981 AA452379 ESTs, Moderately similar to ALU7_HUMAN A 

423064 AA458959 AF265208 Hs.8740 SWI/SNF related, matrix associated, acti 

105636 AA459950 AW958037 Hs.286 ribosomal protein L4 

70 105654 AA460449 AW075485 Hs.286049 phosphoserine aminotransferase 

131353 AA463910 AW754182 gb:RC2-CT0321-131 19901 1-c01 CT0321 Homo 

106707 AA464603 AK000566 Hs.98135 hypothetical protein FU20559 

452909 AA464606 NMJ)15368Hs.3G985 pannexlnl 

106717 AA465093 AA600357 Hs.239489 TIA1 cytotoxic granule-associated RNA-bl 

75 453141 AA465692 AB014548 Hs^1921 K1AA0648 protein 

106747 AA476473 NM_007118Hs.171957 triple functional domain (PTPRF interact 
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105773 AA478109 
106781 AA478474 
108817 AA480869 
106846 AA485223 
106848 AA485254 
106856 AA486183 
418699 AA496936 
107001 AA598589 
442853 AA598831 
107054 AA600150 
107059 AA608545 
107080 AA609210 
107115 AA610108 
107130 AA620582 
107156 AA621239 
107174 AA621714 
130621 AA621718 
107190 D19673 
132626 D25755 
107217 D51095 
332584 D60272 
444655 T08879 
107295 T34527 
107299 T40327 

107315 T62771 

107316 T63174 
107328 T83444 
107334 T93641 
456340 U48263 
128636 U49065 
129938 U79300 
107375 U88573 
130074 U93867 
107387 W01094 
132036 W01568 
107426 W26853 
135388 W27965 
130419 W36280 
107469 W47063 
434203 W79060 
107506 W88550 
132358 X60486 
107522 X78931 
456495 Z14077 
107582 AA002147 
107609 AA004711 
107661 AA010383 
107714 AA015761 
107775 AA018772 
107832 AA021473 
107859 AA024835 
107914 AA027229 
107935 AA029428 
410196 AA035143 
131461 AA035237 
108007 AA039347 
108029 AAO40740 
108040 AA041551 
108084 AA045513 
108088 AA045745 
108168 AA055348 
130719 AA056582 

108189 AA056697 

108190 AA056746 
108203 AA057678 

108216 AA058681 

108217 AA058686 
108245 AA062840 
108277 AA064859 
108280 AA065069 
108309 AA069923 
108340 AA070815 
108403 AA075374 
108427 AA076382 
108435 AA078787 
108439 AA078986 



AA478109 Hs.188833 
AA330310 Hs.24181 
D61216 Hs.18672 
AB037744 Hs.34892 
AA449014 Hs.121025 
W58353 Hs.285123 
BE539639 Hs.173030 
AI926520 Hs.31016 
AW021276 Hs.17121 
A1076459 Hs.15978 
BE614410 Hs.23044 
AL122043 Hs.19221 
BE379623 Hs.27693 
AB033106 Hs.12913 
AA137043 Hs.9663 
BE122762 Hs.25338 
AW513087 Hs.16803 
AA836401 Hs.87860 
AW504732 Hs.21275 
AL080235 Hs.35861 
AA357879 Hs.29423 
AF088886 Hs.11590 
AA186629 Hs.80120 
BE277457 Hs.30661 
AA316241 Hs.90691 
T63174 Hs.193700 
AW959891 Hs.76591 
T93597 Hs.187429 
U48263 Hs.89040 
U49065 Hs.102865 
AW003668 Hs.135587 
BE011845 Hs.251064 
AL038596 Hs.250745 
D86983 Hs.1 18893 
AL157433 Hs.37706 
W26853 Hs.291003 
W27965 Hs.99865 
AF037448 Hs.1 55489 
W47063 Hs.94668 
BE262677 Hs.283558 
AB028981 Hs.8021 
NMJ03542H3.46423 
X78931 Hs.99971 
NMJ034O3HS.97496 
AA002147 Hs.59952 
R75654 Hs.164797 
AA010383 Hs.60389 
AA015761 Hs.60642 
AW008846 Hs.60857 
AA021473 

AW732573 Hs.47584 
AA027229 Hs.61329 
AA029428 Hs.61555 
AI936442 Hs.59838 
AA992841 Hs.27263 
AA039347 Hs.61916 
AA040740 Hs.62007 
AL121031 Hs.159971 
AA058944 Hs.1 16602 
AA045745 Hs.62886. 
AI453137 Hs.63176 
AA679262 Hs.14235 
AW376061 Hs.63335 
AA056746 Hs.6333B 
AW847814 Hs.289005 
AA524743 Hs.44883 
AA058686 Hs.62588 
BE410285 Hs.89545 
AA064859 
AA065069 
AA069818 

AA069820 Hs.180909 

AA075374 

AA076382 

T82427 Hs.194101 
AA078986 



ESTs 
ESTs 
ESTs 

K1AA1323 protein 

chromosome 11 open reading frame 5 
Homo sapiens mRNA full length insert cDN 
ESTs, Weakly similar to ALU8_HUMAN ALU S 
putative DNA binding protein 
ESTs 

KIAA1272 protein 

RAD51 (S. cerevislae) homolog (E cofi Re 
hypothetical protein DKFZp566G1424 
peptidylprolyl Isomerase (cyclophSin)-) 
KIAA1280 protein 

programmed cell death 6-lnterading prot 
ESTs 

LUC7(S.cerevislae).like 
ESTs. 

hypothetical protein FLJ1101 1 
DKFZP586E1621 protein 
ESTs; Weakly similar to macrophage lecti 
cathepsin F 

UDP-rtocetyl-alpha-l>gaIactosamlne:polyp 

hypothetical protein MGC4606 

nucteophosmin/nucleoplasmin 3 

Homo sapiens mRNA; cDNA DKFZp586l0324 (f 

KIAA0887 protein 

ESTs 

prepronociceptin 
interieukin 1 receptor-like 2 
Human clone 23629 mRNA sequence 
high-mobility group (nonhistone chromoso 
polymerase (RNA) III (DNA directed) (62k 
Melanoma associated gene 
hypothetical protein DKFZp434E2220 
hypothetical protein MGC4707 - 
epimorphln 

NS1 -associated protein 1 
ESTs 

hypothetical protein PR01855 

KIAA1058 protein 

H4 histone family, member G 

zinc finger protein 272 

YY1 transcription factor 

EST 

hypothetical protein FU13693 

ESTs 

ESTs 

ESTs 

gb:ze66c11.s1 Soares retina N2MHR Homo 
potassium voltage-gated channel, delayed 
ESTs, Weakly similar to T1 6370 hypotheti 
ESTs 

hypothetical protein FU108G8 

KIAA1458 protein 

EST 

ESTs . : 

SWI/SNF related, matrix associated, act! 
Homo sapiens, clone IMAGE:41 54008, mRNA, 
ESTs 
ESTs 

hypothetical protein FU20008; KIAA1839 
ESTs, Moderately similar to A46010 X-Hn 
EST 

Homo sapiens cDNA: FU21532 fis, clone C 

ESTs 

ESTs 

proteasome (prosome, macropain) subunit 
gb:zm50fQ3.s1 Stratagene fibroblast (937 
gb:zm12e11.s1 Stratagene pancreas (93720 
gb2m67e03.ii Stratagene neuroepitheiium 
peroxiredoxin 1 

gb:zm87a01.s1 Stratagene ovarian cancer 
gb:zm91g08.s1 Stratagene ovarian cancer 
Homo sapiens cDNA: FU20869 fis. clone A 
gbzm92h01.s1 Stratagene ovarian cancer 
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108465 AA079393 
108469 AA079487 

108500 AA083207 

108501 AA083256 
108533 AA084415 
108562 AA085274 
108589 AA088678 
130890 AA 100 925 
432645 AA101255 
130385 AA126474 
108749 AA127017 

108807 AA129968 

108808 AA130240 
108833 AA131866 
108846 AA132983 
108857 AA1 33250 
131474 AA133583 
108894 AA135941 
108941 AA148650 
108968 AA151110 
108996 AA155754 
109001 AA156125 
131183 AA156289 
109019 AA156997 

109022 AA157291 

109023 AA157293 
109088 AA164293 
109072 AA164676 
426981 AA167375 
130346 AA167550 
109146 AA176589 
109172 AA180448 
428438 AA187144 
129208 AA189170 
109222 AA192757 
109300 AA205650 
109481 AA233342 
109485 AA233472 
109516 AA234110 
109537 D80981 
109556 F01660 

109577 F02206 

109578 F02208 
109595 F02544 
109625 F03918 
428376 F04258 
109648 F04600 
109671 F08998 
109699 F09605 
109820 F11115 
109933 H06371 
110014 H10995 
110039 H11938 
110099 H16568 
110107 H16772 
110155 H18951 
110197 H20859 
110223 H23747 
110306 H38087 
110335 H40331 
110342 H40567 
110395 H46966 
'110511 H56640 
110523 H57154 
110715 H96712 
110754 N20814 
428454 N25249 
431663 N27100 
134263 N39616 
110938 N48982 
110983 N51957 
111081 N59435 
111128 N64139 
431548 N66981 
111216 N68640 
437562 N69352 



AA079393 Hs.3462 
AA079487 

AA083207 Hs.68270 
AA083256 
AA084415 
AA100796 

AI732404 Hs.68846 
AI907537 Hs7B698 
D14041 Hs.347340 
AW067800 Hs.155223 
AA127017 Hs.71052 
AI652236 Hs.49376 
AA045088 Hs.62738 
AF188527 Hs.61661 
AL1 17452 Hs.44155 
AK001468 Hs.62180 
L46353 Hs.2726 
AK001431 Hs.5105 
AA148650 

AI304870 Hs.1 88680 
AW995610 Hs.332436 
AI056548 Hs.72116 
AI611807 Hs.285107 
AA156755 Hs.72150 
AA157291 Hs.21479 
AA157293 Hs.72168 
AA164293 Hs.72545 
AI732585 Hs.22394 
AL044675 Hs.173081 
H05769 Hs.188757 
AA176589 Hs.142078 
AA180448 Hs.144300 
NM_001955Hs.2271 
AI587376 Hs.109441 
AA192833 Hs.333512 
AA418276 Hs.170142 
AA878923 Hs.289069 
BE619092 Hs.28465 
AI471639 Hs.71913 
AI858695 Hs.34898 
AI925294 Hs.87385 
F02206 Hs.295639 
F02208 Hs.27214 
AA078629 Hs.27301 
H29490 Hs.22697 
AF119665 Hs.184011 
H17800 Hs.7154 
R59210 Hs.26634 
H18013 Hs.167483 
AW016809 Hs.119021 
R52417 Hs.20945 
AJ-109666 Hs7242 
H11938 Hs.21907 
R44557 Hs.23748 
AW151660 Hs.31444 
AI559826 Hs.93522 
AW090386 Hs.112278 
H19836 Hs.31697 
H38087 Ks.105509 
H65490 Hs.18845 
H40961 Hs.33008 
AA025116 Hs.33333 
H56640 Hs.221460 
AI040384 Hs.19102 
H96712 Hs.269029 
AW302200 Hs.6336 
U55936 Hs.1 84376 
NM_016569Hs.267182 
AW973443 Hs.8086 
N48982 Hs.38034 
NM.015367HS.10267 
AI146349 Hs.271614 
AW505364 Hs.19074 
AI834273 Hs.9711 
AW139408 Hs.152940 
AB001636 Hs.5683 



cytochrome c oxidase subunli Vile 
gb:zm97fD8.s1 Stratagene colon HT29 (937 
EST 

gb:zn08g12.s1 Stratagene hNT neuron (937 
gb:zn06g09.s1 Stratagene hNT neuron (937 
gb:zm26c06.s1 Stratagene pancreas (93720 
ESTs 

stress-associated endoplasmic reticulum 
H-2K binding fector-2 
stanniocaicin 2 
ESTs 

hypothetical protein FU20644 
ESTs 

ESTs, Weakly similar to AF174605 1 F-box 

DKFZP586G1517 protein 

aniflin (DrosophOa Scraps homotog), act 

high-mobility group (nonhistone chromoso 

hypothetical protein FU10569 

gbzo09e06.$1 Stratagene neuroepithelium 

ESTs 

EST 

hypothetical protein FU20992 similar to 
hypothetical protein FU13397 
ESTs 

ubinucleln 1 

ESTs 

ESTs 

hypothetical protein FU10893 
KIAA0530 protein 

Homo sapiens, done MGC:5564, mRNA, comp 

EST 

EST 

endothelin 1 
MSTP033 protein 
similar to rat myomegalin 
ESTs 

hypothetical protein FU21016 

Homo sapiens cDNA: FU21869 fis, clone H 

ESTs 

ESTs 

ESTs 

Homo sapiens potassium channel subunit ( 

ESTs 

ESTs 

ESTs 

pyrophosphatase (inorganic) 

ESTs 

ESTs 

ESTs 

ESTs 

Homo sapiens clone 24993 mRNA sequence 
Homo sapiens mRNA full length insert cDN 
histone acetyltransferase 
ESTs . 
ESTs 

Homo sapiens mRNA for KIAA1647 protein, 

arrestin, beta 1 

ESTs 

CTL2gene 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs, Weakly similar to organic anion tr 
ESTs 

K1AA0672 gene product 
synaptosomal-assoriated protein, 23kD 
TBX3-iso protein 

RNA (guanlne-7-) methyltransferase 
Homo sapiens cDNA FU12924 fis, done NT 
MIL1 protein 
CGI-112 protein 

LATS (large tumor suppressor, Drosophiia 

novel protein 

ESTs 

DEAD/H (Asp^lu^ia-Asp/His) box polypep 
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111399 R00138 


AW270776 Hs.18857 


111514 R07998 


R07998 




428744 R08929 


BE267033 


Hs.192853 


111574 R10307 


AI024145 


Hs.188526 


111804 R33354 


AA482478 


Hs.181785 


111831 R36083 


R36095 


Hs.268695 


426773 R37938 


NM_015556Hs.172180 


111904 R39330 


Z41572 




428371 R40816 


AB012193 


Hs.183874 


112033 R43162 


R49031 


Hs.22627 


130987 R45698 


BE613269 


Hs.21893 


112300 R54554 


H24334 


Hs.26125 


112513 R68425 


R68425 


Hs.13809 


112514 R68568 


R68568 


Hs.183373 


112522 R68763 


R68857 


Hs.265499 


112540 R7Q467 


R69751 




428655 R73565 


H05769 


Hs.188757 


129534 R73640 


AK002126 


Hs.11260 


112597 R78376 


R78376 


Hs.29733 


112732 R92453 


R92453 


Hs.34590 


451798 T03865 


BE297567 


Hs.27047 


112888 T03872 


AW195317 Hs.107716 


131863 T10072 


A1656378 


Hs.33461 


112911 T10080- 


AW732747 Hs.13493 


132215 T10132 


AL035703 


Hs.4236 


112931 T15343 


T02966 


Hs.167426 


112984 T23457 


T16971 


Hs.289014 


112998 T23555 


H11257 


Hs.22968 


133376 T23670 


BE618768 


Hs.7232 


113026 T23948 


AA376654 




113070 T33464 


AB032977 


Hs.6298 


410781 T34413 


AI375672 


Hs.165028 


113074 T34611 


AK001335 


Hs.31137 


113095 T40920 


AA828380 Hs.126733 


113179 T55182 


BE622021 


Hs.152571 


113337 T77453 


T77453 


Hs.302234 


113421 T84039 


A1769400 


Hs.189729 


113454 T86458 


AI022166 


Hs.16188 


113481 T87693 


T87693 


Hs.204327 


453345 T89350 


AA302862 


Hs.90063 


113557 T90945 


H66470 


Hs.16004 


113559 T90987 


T79763 


Hs.14514 


113589 T91863 


AI078554 


Hs.15682 


113591 T91881 


T91881 


Hs.200597 


113619 T93783 


R08665 


Hs.17244 


113683 T96687 


AB035335 Hs.144519 


113692 T96944 


AL360143 


Hs.17936 


113702 T97307 


T97307 




113717 T97764 


T99513 


Hs.187447 


113824 W48817 


AI631964 


Hs.34447 


113840 W58343 


R72137 


Hs.7949 


113844 W59949 


AI369275 


Hs.243010 


113902 W74644 


AA340111 


Hs.100009 


113904 W74761 


AF125044 


Hs.19196 


113905 W74802 


R81733 


Hs.33106 


113931 W81205 


BE255499 


Hs.3496 


113932 W81237 


AA256444 


Hs.126485 


131965 W90146 


W79283 


Hs.35962 


114035 W92798 


W92798 


Hs.269181 


114106 Z38412 


AW602528 




457308 Z38709 


AI416988 


Hs.238272 


114161 Z38904 


BE548222 


Hs.299883 


424949 Z39103 


AF052212 


Hs.153934 


457548 Z39930 


AW069534 Hs.279583 


128937 Z39939 


AA251380 Hs.10726 


' 432554 Z40012 


AI479813 


Hs.278411 


114277 Z40377 


AI052229 


H*25373 


114304 Z40820 


AI934204 


Hs.16129 


44AOCA 7>HCQft 

114364 


AL1 17427 


Hs.172778 


432620 AA005112 


AA777749 


Hs.5978 


129034 AA005432 


AA481157 


Hs.108110 


131881 AA010163 


AW361018 Hs.3383 


332421 AA026356 


A1909968 


Hs.108106 


114465 AA026901 


BE621056 


Hs.131731 


451271 AA036867 


AK001644 


Hs.26156 


332498 AA044644 


AA303661 





ESTs 

gb:yf16g11.s1 Scares fetal liver spleen 
ubiquitin-conjugating enzyme E2G 2 {homo 
ESTs 
ESTs 
ESTs 

KIAA0440 protein 

gb:HSCZYB122 normalized Infant brain cDN 
cuHin 4A 
ESTs 

hypothetical protein DKFZp761N0624 
ESTs 

hypothetical protein FU10648 
src homology 3 domain-containing protein 
ESTs 

gb:yi40a10.s1 Soares placenta Nb2HP Homo 
Homo sapiens, done MGC:5564, mRNA, comp 
hypothetical protein RJ1 1 264 
EST 
ESTs 

hypothetical protein FU20392 
hypothetical protein FLJ22344 
ESTs 

like mouse brain protein E46 
KIAA0478 gene product 
ESTs 

ESTs, Weakly similar to A43932 mucin 2 p 
Homo sapiens done IMAGE:451939, mRNA se 
acetyl-Coenzyme A carboxylase alpha 
eukaryotic translation initiation factor 
KIAA1151 protein 
ESTs 

protein tyrosine phosphatase, receptor t 
ESTs 

ESTs, Highly simitar to IGF-ll rnRNA-bind 

ESTs 

ESTs 

ESTs 

EST 

neurocaicin delta 

ESTs 

ESTs 

ESTs 

KIAA0563 gene produd 
hypothetical protein FU13605 
T-cei! leukemia/lyrnphoma 6 
DKFZP434H132 protein 
gb:ye53h05.s1 Soares fetal liver spleen 
ESTs 
ESTs 

DKFZP586B2420 protein 

Homo sapiens cDNA FU14445 fis, clone HE 

acyl-Coenzyme A oxidase 1, palmitoyl 

ubiqultin-conjugating enzyme HBUCE1 

ESTs 

hypothetical protein MGC15749 
hypothetical protein FU12604; K1AA1692 
ESTs 
ESTs 

gb:RC5-BT0562-260100-011-A02 BT0562 Homo 
inositol 1,4,5-triphosphate receptor, ty 
hypothetical protein FU23399 
core-binding factor, runt domain, alpha 
CGI-81 protein 

ESTs, Weakly similar to ALULHUMAN ALU S 

NCK-assodated protein 1 

ESTs, Weakly similar to T20410 hypothetl 

ESTs 

Homo sapiens mRNA; cDNA DKFZp566P013 (fr 
UM domain only 7 
DKFZP547E21 10 protein 
upstream regulatory element binding prot 
transcription factor 
hypothetical protein FU11099 
hypothetical protein FU10782 
rymphocyte-spedfic protein 1 
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431555 AA046426 
132944 AA054515 
114618 AA084162 
332509 AA085749 
114648 AA101056 
114658 AA102746 
132456 AA114250 
450847 AA126561 
132225 AA128980 
437197 AA129757 
114709 AA129921 
456926 AA133331 
114750 AA135958 
426806 AA136524 
114763 AA147044 
114767 AA148885 
114774 AA150043 
129388 AA151621 
457742 AA155743 
456200 AA156335 
130207 AA156336 
114798 AA159181 
114800 AA159825 
114828 AA234185 
114846 AA234929 
114848 'AA234935 
114902 AA236359 
132271 AA236466 
114907 AA236535 
420170 AA236935 
132204 AA236942 
114928 AA237018 
132481 AA237025 
114932 AA242751 
314162 AA242760 
131006 AA242763 
114935 AA242809 
408908 AA243133 
437754 AA243495 
114957 AA243706 
114974 AA250848 
114977 AA250868 
114995 AA251152 
115005 AA251544 
417177 AA251792 
115026 AA252144 
115045 AA252524 
115068 AA253461 
133138 AA255522 
332668 AA255522 
115114 AA256468 
129584 AA256528 
115137 AA257976 
417187 AA258296 

115166 AA258409 

115167 AA258421 
436719 AA262077 
115239 AA278650 
115243 AA278766 
428419 AA280791 
115322 AA280819 
413303 AA280828 
115372 AA282195 
409962 AA2B3127 
130269 AA284694 
456570 AA291137 
332675 AA291708 
407864 AA293495 
115536 AA347193 
408799 AA398474 
115575 AA398512 
115601 AA400277 
434428 AA400896 
115683 AA410345 
115715 AA416733 
132952 AA425154 



AI815470 Hs.260024 
T96641 Hs.6127 
AW979261 Hs.291993 
AA128376 Hs.153884 
AA101056 

AA102383 Hs.249190 

AB011084 Hs.48924 . 

NMJ03155HS.25590 

AA128980 

W38586 

AA397651 Hs.301959 
AB018284 Hs.158688 
AA887211 Hs.129467 
T19228 Hs.172572 
AA810755 Hs.102500 
AI859865 Hs.154443 
AV656017 Hs.184325 
AA662477 Hs.1 10954 
BE561824 Hs.273369 
AA768242 Hs.80618 
AF044209 Hs.144904 
AA159181 Hs.54900 
Z19448 Hs.131887 
AA252937 Hs.283522 
BE018682 Hs.166196 
BE614347 -HS.169B15 
AW275480 Hs.39504 
AB030034 Hs.115175 
N29390 Hs.13804 
U43374 Hs.95631 
AA235827 Hs.42265 
AA237018 Hs.94869 
W93378 Hs.49614 
AA971436 Hs.16218 
BE041820 Hs.38516 
AF064104 Hs.22116 
H23329 Hs.29088O 
BE296227 Hs.250822 
R60366 Hs.5822 
AW170425 Hs.87680 
AW966931 Hs.302649 
AW296978 Hs.87787 
AA769266 Hs.193657 
AI760825 Hs.153042 
NM_004458Hs.81452 
AA251972 Hs.188718 
AW014549 Hs.58373 
AW512260 Hs.87767 
AV657594 Hs.181161 
AV657594 Hs.181161 
AA527548 HsJ527 
AV656017 Hs.184325 
AW968304 Hs.56156 
AB011151 Hs.334659 
AF095727 Hs.287832 
AA749209 Hs.43728 
Y11192 Hs.5299 
BE251328 Hs.73291 
AA806600 Hs.1 16665 
U49436 

L08895 Hs.78995 
AW836130 Hs.75277 
AW014385 Hs.88678 
U82671 Hs.57698 
F05422 Hs.168352 
AA286914 Hs.183299 
BE439944 

AF069291 Hs.40539 
AK001468 Hs.62180 
AA059412 Hs.47986 
AA393254 Hs.43619 
AA148984 Hs.48849 
D14540 Hs.199160 
AF255910 Hs.54650 
BE395161 Hs.1390 
A1658580 Hs.61426 



Cdc42 effector protein 3 

Homo sapiens cDNA: FU23020 fis, done L 

ESTs 

ATP binding protein associated with ceil 
gb:zn25b03.s1 Stratagene neuroepithethjm 
tumor necrosis factor receptor superfaml 
K1AA0512 gene product; ALEX2 
stanniocaldn 1 

gb2o09a11.s1 Stratagene neuroepithelium 
guanine nucleotide binding protein (G pr 
proline synthetase co-transcribed (bade 
KIAA0741 gene product 
ESTs 

hypothetical protein FU20093 
hypothetical protein dJ511E16.2 
minichromosome maintenance deficient (S. 
CGI-76 protein 

hypothetical protein RJ23471 

uncharacterized hematopoietic stem/proge 

hypothetical protein 

nuclear receptor co-repressor 1 

serologically defined colon cancer antig 

ESTs, WeaWy similar to T24396 hypotheti 

Homo sapiens mRNA; cDNA DKFZp434J1912 (f 

ATPase, Class I, type 8B, member 1 

hypothetical protein FU20989 

hypothetical protein MGC4308 

sterile-alpha motif and leucine zipper c 

hypothetical protein dJ462023.2 

Human normal keratinocyte mRNA 

ESTs 

ESTs 

ESTs 

KIAA0903 protein 

Homo sapiens, done MGC:15887, mRNA, com 

CDC14 (cefl division cyde 14, S. cerevl 

ESTs, Weakly simitar to ALU1JHUMAN ALU S 

serine/threonine kinase 15 

Homo sapiens cDNA: FU22120 fis, clone H 

ESTs 

nudeosome assembly protein 1-Bke 1 

ESTs 

ESTs 

ESTs 

fatty-add-Coenzyme A ligase, long-chain 

ESTs 

ESTs 

ESTs 

Homo sapiens cDNA FU14643 fis, clone NT 
ESTs 

small fragment nudease 
CGI-76 protein 
ESTs 

hypothetical protein MGC141 39 
myelin protein zero-like 1 
hypothetical protein 

aldehyde dehydrogenase 5 family, member 
hypothetical protein FU10881 
KIAA1842 protein 
KIAA1856 protein 

MADS box transcription enhancer fatibr 2 
hypothetical protein FLJ1391 0 
ESTs, Weakly similar to Unknown [Ksapie 
Target CAT 

nudeoporin-like protein 1 

ESTs 

ESTs 

chromosome 8 open reading frame 1 
anDIin (Drosophfla Scraps homotog), act 
hypothetical protein MGC10940 
ESTs 

ESTs, Weakly similar to ALU4_HUMAN ALU S 
myetoid/lymphoid or mixed-lineage leukem 
junctional adhesion molecule 2 
proteasome (prosome, macropaln) subunft, 
Homo sapiens mesenchymal stem cell prote 
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115819 
409124 
115895 
458073 
115962 
115967 
115974 
115985 
129254 
446730 
116095 
426856 
116210 
116213 
432645 
116265 
129334 
116274 
426002 
116331 
116333 
132994 
418538 
116391 
116394 
134531 
116417 
116429 
116439 
116459 
427505 
409633 
116541 
132557 
414964 
116571 
451522 
421919 
116643 
116661 
116715 
116729 
318709 
418999 
116773 
116780 
453884 
116819 
427278 
407833 
116844 
116845 
116892 
116925 
116981 
453133 
117031 
117034 
431129 
417861 
117280 
117344 
117422 
117475 
117487 
117937 
130207 
117549 
117683 
117710 
117791 
117822 
422544 
117895 
452259 
133057 



AA426573 

AA431418 

AA436182 

AA437099 

AA446585 

AA446887 

AA447224 

AA447709 

AA453624 

AA455044 

AA456045 

AA460454 

AA476494 

AA476738 

AA481422 

AA482595 

AA485084 

AA485431 

AA489638 

AA491000 

M491250 

AA505133 

AA598447 

AA599243 

AA599574 

AA600153 

AA609309 

AA609710 

AA610068 

AA621399 

AA621752 

C21523 

D12160 

D19708 

D25801 

D45652 

D602Q8 

D80504 

F03010 

F04247 

F10966 

F13700 

H05063 

H16758 

H17315 

H22566 

H48459 

H53073 

H56559 

H57957 

H64938 

H64973 

H69535 

H73110 

H81783 

H86259 

H88353 

H88639 

H88675 

H93708 

N22107 

N24046 

N27028 

N30205 

N30621 

N33258 

N33256 

N 33390 

N40180 

N45198 

N48325 

N48913 

N49394 

N50656 

N50721 

N53143 



AA486620 Hs.41135 
AW292809 Hs.507Z7 
AB033035 Hs.51965 
AA192669 Hs.45032 
AI636361 Hs.179520 
AI745379 Hs.42911 
BE513442 Hs.238944 
AA447709 HsJ>68115 
AA252468 Hs.1098 
BE384932 Hs.64313 
M043429 Hs.62618 
R19768 Hs.172788 
BE622792 Hs.172788 
AA292105 Hs.326740 
D14041 Hs.347340 
BE297412 Hs.55189 
AW157022 Hs.343551 
AI129767 Hs.182874 
BE514376 Hs.165998 
N41300 H3.71616 
AF155827 Hs.203963 
AA112748 Hs.279905 
BE244323 Hs.85951 
T86558 Hs.75113 
NM_006033Hs.65370 
AI742845 Hs.110713 
AW499664 

AF191018 Hs.279923 
AA251594 Hs.43913 
R80137 Hs.302738 
AA361562 Hs.178761 
AW449822 Hs.55200 
D12160 Hs.249212 
AA1 14926 Hs.169531 
AA337548 Hs.333402 
D45652 Hs.211604 
BE565817 Hs.26498 
AJ224901 Hs.109526 
AI367044 Hs.153638 
R61504 

AL1 17440 Hs.170263 
BE549407 Hs.1 15823 
R52576 Hs.285280 
NMJ)00121Hs.89548 
AI823410 Hs.343581 
H22566 Hs.63931 
AA355925 Hs.36232 
H53073 Hs.93698 
AL031428 Hs.174174 
AW955632 Hs.66666 
H64938 Hs.337434 
AA649530 Hs.348148 
AI573283 Hs.38458 



H73110 
N29218 



Hs.260603 
Hs.40290 



AC005757 Hs.31809 
H88353 Hs.347265 
U72209 
AL137751 
AA334551 
M18217 
R19085 
AI355562 
N30205 
N30621 



Hs.263671 

Hs.172129 

Hs.210706 

Hs.43880 

Ks.93740 

Hs.44203 



AF044209 Hs.144904 
AF044209 Hs.144904 
N33390 Hs.44483 
N40180 

N45198 Hs.47248 
N48325 Hs.93956 
AA706282 Hs.93963 
AB018259 Hs.118140 
AW450348 Hs.93996 
AA317439 Hs.28707 
AA465131 Hs.64001 



endomudn-2 

N-acetytgtuoosamlnidase, alpha- (SanfiO 

K1AA1209 protein 

ESTs 

hypothetical protein MGC10702 
ESTs 

hypothetical protein FU10631 

ESTs, Weakly similar to T08599 probable 

DKFZp434J1813 protein 

ESTs, Weakly similar to AF257182 1 G-pro 

ESTs 

ALEX3 protein 
ALEX3 protein 

hypothetical protein MGC10947 

H-2K binding factor-2 

hypothetical protein 

hypothetical protein FLJ22584 

guanine nucleotide binding protein (G pr 

PAI-1 mRNA-binding protein 

Homo sapiens mRNA; cDNA DKFZp586N1720 (f 

hypothetical protein FU10339 

clone HQ0310PRO0310p1 

exportin, tRNA (nuclear export receptor 

general transcription factor IIIA 

lipase, endothelial 

DEK oncogene (DNA binding) 

Human done 23826 mRNA sequence 

putative nucleotide binding protein, est 

P1BF1 gene product 

Homo sapiens cDNA: FU21425 fis, clone C 
26S proteasome-associated pad1 homoiog 
ESTs 

polymerase (RNA) III (DNA directed) (155 
ESTs 

hypothetical protein MGC12760 

gb:HUMGS02848 Human adult lung 3* direct 

hypothetical protein FU21657 

zinc finger protein 198 

myeloid/lymphoid or mixed-lineage leukem 

gb:yh16a03.s1 Soares infant brain 1NIB H 

tumor protein p53-binding protein, 1 

ribonuclease P, 40kD subunlt 

Homo sapiens cDNA: FLJ22096 fis, clone H 

erythropoietin receptor 

karydpherin alpha 1 (importin alpha 5) 

ESTs 

K1AA0186 gene product 
EST 

KIAA0601 protein 

ESTs, Weakly similar to S19560 proRne-r 
ESTs, Weakly similar to A46010 X-finked 
gb:ns44f05.s1 NCLCGAPJVIvl Homo sapiens 
ESTs 

ESTs, Moderately similar to A47582 B-cel 
ESTs 



gb:yw2la02.s1 Morton Fetal Cochlea Homo 

YY1 -associated factor 2 

Homo sapiens mRNA; cDNA DKFZp434l0812 (f 

sperm specific antigen 2 

Homo sapiens cDNA: FU21409 fts, clone C 

Homo sapiens cDNA RJ13182 fis, clone NT 

ESTs, WeaJdyslmflarto A46010 X-<fnked 

ESTs, Weakly similar to I38022 hypotheti 

ESTs 



nuclear receptor co-repressor 1 
EST 

gb:yy44dOZs1 Soares_multiple_scterosls„ 
ESTs, Highly similar to similar to Cdc14 
EST 
ESTs 

WAA0716gene product 
ESTs. Highly similarto SORLJiUMAN SORT1 
signal sequence receptor, gamma (translo 
Homo sapiens clone 25218 mRNA sequence 
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18103 N55326 AA401733 Hs.184134 

18111 N55493 N55493 

18129 N57493 N57493 

18278 N62955 N62955 Hs.316433 

18329 N63520 N63520 

18336 NB3604 BE327311 Hs.47166 

17098 N54166 AB017365 Hs.173859 

18363 N64168 AI183838 Hs.48938 

18364 N64191 N46114 Hs.29169 
8475 N66845 N66845 

8491 N67135 AV647908 Hs.90424 

18500 N67295 W32889 Hs.154329 

18584 N68963 AW136928 

56647 N69331 AI252640 Hs.110364 

18661 N70777 AL137554 Hs.49927 

18684 N71364 N71313 Hs.163986 

18689 N71545 AW390601 Hs.184544 

18690 N71571 N71571 Hs.269142 
18766 N74456 N74456 Hs.50499 
18793 N75594 N75594 Hs.285921 
18817 N79035 AI668658 Hs.50797 
18844 N80279 AL035364 Hs.50891 
18919 N91797 AW452696 Hs.130760 
29558 N92454 AW580922 Hs.180446 
07604 N94581 • AW191962 Hs.288061 
18996 N94746 N94746 Hs.274248 
19021 N98238 N98238 Hs.55185 
19039 R02384 AI160570 Hs.252097 
19063 R16833 R16833 Hs.53106 

332622 R41828 R10674 

19111 R43203 T02865 Hs.328321 

15115 R46395 AA214228 Hs.127751 

19146 R58863 R58863 Hs.91815 

449224 R78248 AW995911 Hs.299883 

19239 T11483 T11483 

19281 T16896 AI692322 Hs.65373 

19298 T23820 NMJ01 241 Hs. 155478 

26502 T30222 T10077 Hs.13453 

19983 W15275 W55956 . Hs.94030 

19558 W38194 W38194 

29641 W42414 AW081883 Hs.211578 

19445 W49632 AA884471 Hs.90449 

19650 W57613 R82342 Hs.79856 

19654 W57759 W57759 

19683 W61118 W65379 Hs.57835 

19694 W65344 AA041350 Hs.57847 

19718 W69216 W69216 Hs.92848 

10365 W69379 AI287518 

19938 W86728 AW014862 Hs.58885 

20128 Z38499 BE379320 Hs.91448 

20130 Z38630 AA045767 Hs.5300 

20148 Z39494 F02806 Hs.65765 

20155 Z39623 Z39623 Hs.65783 

,51979 Z40071 F06972 Hs.27372 

20183 Z40174 AW082866 Hs.65882 

20184 Z40182 Z401B2 Hs.65885 
20211 Z40904 Z40904 Hs.66012 
20245 AA166965 AW959615 Hs.111045 
20247 AA167500 AA167500 Hs.103939 
20254 AA169599 W90403 Hs.111054 

20259 AA171724 AW014786 Hs.192742 

20260 AA171739 AK000061 Hs.101590 
20275 AA177105 AA177105 Hs.78457 
20284 AA182626 AA179656 

17735 AA186324 AA188175 Hs.82506 

122137 AA192099 AJ236885 

20302 AA192173 AA837098 Hs.269933 

20303 AA192415 AI216292 Hs.96184 
20305 AA192553 AW295096 Hs.101337 
20319 AA194851 T57776 Hs.191094 
I08729 AA195520 AA195764 Hs.72639 
20326 AA196300 AA196300 Hs.21145 
33145 AA196549 H94227 Hs.6592 
203Z7 AA196721 AK000292 Hs.130732 
20328 AA196979 AA923278 Hs.290905 
20340 AA206B28 AA206828 



ESTs 

gb:yv50c02.s1 Soares fetal fiver spleen 
gb:yy54c08.s1 Soaresjmjltiple_sderosis_ 
Homo sapiens cDNA FU11375 fls, clone HE 
gbryy62f01.s1 SoaresjnuItipte_sderosls_ 
HT021 

frizzled (DrosophBa) homolog 7 
hypothetical protein FU21802 
hypothetical protein FU22623 
gb:za46c11.s1 Soares fetal liver spleen 
Homo sapiens cDNA: FU23285 fis, clone H 
ESTs 

gb:UW^l1-adp-d-08-0-Ul.s1 NCLCGAP.Su 

peptidytprolyl isomeraseC (cyclophiiin 

protein kinase NYD-SP15 

Homo sapiens cDNA: FLJ22765 fis, clone K 

Homo sapiens, clone IMAGE:3355383 ( mRNA. 

ESTs 

EST 

ESTs, Moderately similar to T47135 hypot 
ESTs 

hypothetical protein 

myosin phosphatase, target subunit 2 

karyopherin (importin) beta 1 

collagen, type VIII, alpha 2 

hypothetical protein FU20758 

ESTs 

pregnancy specific beta-1 -glycoprotein 6 
ESTs, Moderately similar to ALU1_HUMAN A 
CSR1 protein 
EST 

hypothetical protein 
ESTs 

hypothetical protein FU23399 
gb:CHR90049 Chromosome 9 exon Homo sapie 
ESTs, Weakly similar to T02345 hypotheti 
cyciin T2 

hypothetical protein FLJ14753 
Homo sapiens mRNA; cDNA DKFZp586E1624 (f 
Empirically selected from AFFX single pr 
Homo sapiens cDNA: FU23037 fis, done L 
Human clone 23908 mRNA sequence 
ESTs, Weakly simllarto S65657 alpha-1C- 
gb:zd20g11.s1 SoajesJetaLhearLNbHH19W 
ESTs 

ESTs, Moderately similar to ICE4 JWMAN C 
ESTs ■ 

Homo sapiens mRNA; cDNA DKFZp586D0923 (f 
ESTs 

MKP-1 Oke protein tyrosine phosphatase 
bladder cancer associated protein 
ESTs 
ESTs 

BMX non-receptor tyrosine kinase 

ESTs 

EST 

EST 

ESTs 

EST 

ESTs 

hypothetical protein FU12785 

hypothetical protein 

solute carrier family 25 (mitochondrial 

gb:zp54e1 1.s1 Stratagene NT2 neuronal pr 

K1AA1254 protein 

zinc finger protein 148 (pHZ^2) 

ESTs 

ESTs 

uncoupling protein 3 (mitochondrial, pro 

ESTs 

ESTs 

hypothetical protein RG083M05.2 

Homo sapiens, done !MAGE:2951368, mRNA, 

hypothetical protein FU20285 

ESTs, Weakly similar to protease [H.sapl 

gb:zq80b08.s1 Stratagene hNT neuron (937 
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417122 
131522 
421787 
120375 
120376 
120390 
410804 
434223 
312771 
120396 
120409 
120414 
120420 
120422 
419326 
120423 
120435 
120453 
120455 
120456 
120473 
128922 
120477 
120479 
120488 
120510 
120527 
120528 
120529 
120541 
120544 
120562 
120569 
120571 
'120572 
129434 
130529 
120575 
409339 
120591 
120593 
430275 
440303 
120609 
409702 
456870 
132614 
458750 
135376 
120621 
452279 
332484 
120644 
120660 
135370 
120661 
429828 
452291 
120699 
100690 
120726 
120737 
120745 
443574 
120750 
120761 
120768 
120769 
135232 
120793 
120796 
120809 
332633 
120825 
120827 



AA207123 
AA214539 
AA226914 
AA227260 
AA227469 
AA233122 
AA233334 
AA233347 
AA233714 
AA233796 
AA235050 
AA235704 
AA236031 
AA236352 
AA236390 
AA236453 
AA243370 
AA250947 
AA251083 
AA251113 
AA251973 
AA252023 
AA252414 
AA252650 
AA255523 
AA258128 
AA262105 
AA262107 
AA262235 
AA278298 
AA278721 
AA280036 
AA2B0648 
AA280738 
AA280794 
AA280837 
AA280886 
AA280934 
AA281535 
AA281797 
AA282047 
AA283002 
AA283709 
AA283902 
AA284108 
AA284109 
AA284371 
AA284744 
AA264784 
AA284840 
AA266844 
AA287032 
AA287038 
AA287546 
AA287553 
AA287556 
AA287564 
AA291015 
AA291716 
AA291749 
AA293656 
AA302430 
AA302809 
AA302820 
AA310499 
AA321890 
AA340589 
AA340622 
AA342457 
AA342864 
AA342973 
AA346495 
AA347573 
AA347614 
AA347717 
AA348913 



AI906291 

AI380040 

AA227068 

AF0287Q6 

AA227469 

AA837093 

U64820 

AI825842 

AA018515 

AA134006 

AA235050 

AW137156 

AI128114 

AL133097 

W94915 

AA236453 

AA243370 

AA250947 

AA251720 

AA488750 

AA251973 

AI244901 

AA252414 

AF006689 

AW952916 

AI796395 

AA262105 

AI923511 

AI434823 

W07318 

BE548277 

BE244580 

AA807544 

AB037744 

H39599 

AW967495 

AA178953 

AW978022 

AB020686 

AF078847 

AA748355 

Z11773 

AA306166 

AW978721 

AI752244 

AI241084 

AA284371 

AA1 15496 

BE617856 

AW961294 

AA286844 

AW172431 

AI869129 

AA286785 

BE622187 

AA287556 

AB019494 

AF015592 

AI683243 

AA383256 

AA293655 

AL049176 

AA302809 

U83993 

AI191410 

AA321890 

AA340589 

AI769467 

AL038812 

AA342864 

AI247356 

AA346495 

AL120071 

AI280215 

AA382525 

AA348913 



Hs.81234 
Hs.239489 
Hs.108301 
Hs.111227 

Hs.1 11460 

Hs.66521 

Hs.3776 

Hs.264482 

HS.79306 

Hs.181202 

Hs.112885 

Hs.301717 

Hs.42419 

Hs.18978 

Hs.96450 

Hs.170263 

Hs.104347 

Hs.88414 

Hs.269988 

Hs.9589 

Hs.43141 

Hs.110299 

Hs.63510 

Hs.111377 

Hs.4094 

Hs.104413 

Hs.104415 

Hs.240 

Hs.103104 

Hs.342307 

Hs.24970 

Hs.34892 

Hs.294008 

Hs.186644 

Hs.309648 

Hs.238911 

Hs.54037 

Hs.191356 

Hs.193522 

Hs.237786 

Hs.7145 

Hs.266076 

Hs.154353 

Hs.118064 

Hs.336898 

Hs.99756 

Hs.143818 

Hs.61260 

Hs.13012 

Hs.96616. 

Hs.99677 

Hs.99670 

Hs.263412 

Hs.225767 

Hs.28853 

Hs.97258 

Hs.1657 

Hs.21198 

Hs.82223 

Hs.321709 
Hs.96693 

Hs.104560 

Hs.9475 

Hs.96800 

Hs.96812 

Hs.96820 

Hs.48998 
Hs.96885 
Hs.132967 



Immunoglobulin superfamBy, member 3 
TIA1 cytotoxic granule-associated RNA-bi 
nuclear receptor subfamily 2, group C, m 
Zic family member 3 (odd-paired Drosophi 
gb:zr18a07.s1 Stratagene NT2 neuronal pr 
catdum/calmodulln-dependent protein kin 
Machado-Joseph disease (spinocerebellar 
zinc finger protein 216 

Homo sapiens mRNA; cONA DKFZp761A041 1 (f 
eukaryotlc translation Initiation factor 
gb:zs38e04.s1 Soares_NhHMPu_S1 Homosapi 
hypothetical protein FU10038 
spinal cord-derived growth factor-B 
hypothetical protein OKFZp434N1928 
ESTs 

Homo sapiens cDNA: FU22822 fis, clone K 
EST 

tumor protein p53-blnding protein, 1 
ESTs, Weakly similar to ALUC_HUMAN HI] 
BTB and CNC homology 1, basic leucine zl 
ESTs 
ubiquilin 1 

DKFZP727C091 protein 
mitogen-adivated protein kinase kinase 
K1AA0141 gene product 
ESTs 

Homo sapiens cDNA FU14208 fis, clone NT 

ESTs 

ESTs 

M-phase phosphoprotein 1 
ESTs 

hypothetical protein FU10330 

ESTs, Weakly similar to B34323 GTP-bindi 

KIAA1323 protein 

ESTs 

ESTs 

gb:zp39e03.s1 Stratagene muscle 937209 H 
hypothetical protein DKFZp762E1 51 1 ; KIAA 



general transcription factor 1IH, polype 
ESTs 
zinc finger protein 187 
calpain 7 

ESTs, Weakly similar to A46010 X-Rnked 

eukaryotic translation elongation factor 

nonselective sodium potassium/proton exc 

similar to rat nuclear ubiquitous casein 

Homo sapiens, Similar to RIKEN cDNA 1810 

mitochondrial ribosome recycling factor 

hypothetical protein FLJ23459 

hypoftietfcal protein FU13164 

ESTs 

ESTs 

ESTs 

ESTs, Weakly similar to I38022 hypotheti 
ESTs, Weakly similar to ALUBJWMAN III ! 
1DN3 protein 

CDC7 (ceil division cycle 7, S. cerevfei 
ESTs, Moderately similar to S29539 ribos 
estrogen receptor 1 
ESTs 

chordirHike 

gbf ST10426 Adipose tissue, white I Homo 
purinergic receptor P2X, figand-gated io 
ESTs, Moderately similar to 2109260A B c 
branched chain keto acid dehydrogenase E 
EST 
ESTs 

ESTs, Moderately similar to ALU7_HUMAN A 

ESTs 

ESTs 

gb:EST52657 Fetal heart II Homo sapiens 
fibronectin leucine rich transmembrane p 
ESTs 

Human EST clone 122887 mariner transposo 
gb;EST55442 infant adrenal gland II Homo 



139 



WO 02/079492 



PCT/US02/04915 



120850 AA349647 AA349647 Hs.96927 

120852 AA349773 AA349773 Hs.191564 

128852 AA350541 R40622 Hs.106601 

135240 AA357159 AA357159 Hs.96986 

5 120870 AA357172 AA357172 Hs.292581 

120894 AA370132 AA370132 Hs.97063 

435737 AA370472 AF229839 Hs.1732Q2 

120897 AA370867 AA37Q867 Hs.97079 

120915 AA377298 AL135556 Hs.97104 

10 120935 AA383902 AL048409 Hs.97177 

12093S AA385934 AA385934 Hs.97184 

120937 AA386255 AA386255 Hs.97186 

120938 AA386260 AA386260 Hs.104632 
417632 AA386266 R20855 Hs.5422 

15 120960 AA398014 AA398014 Hs.104684 

120985 AA398222 AI219896 Hs.97592 

120988 AA398235 AA398235 Hs.97631 

121008 AA398348 AA398348 Hs.130546 

121029 AA398482 AA398482 Hs.97641 

20 121032 AA398504 AA393037 Hs.161798 

121033 AA398505 AA398505 Hs.97360 

121034 AA398507 AL389951 Hs.271623 

121035 AA398523 AA398523 Hs.210579 
121058 AA398625 AA398625 Hs.97391 

25 121060 AA398632 AA398632 Hs.97395 

121061 AA398633 AA393288 Hs.97396 

121091 AA398894 AA398894 Hs.97657 

121092 AA398895 AA398895 Hs.97658 
121094 AA398900 AA402505 

30 121096 AA398904 AA398904 Hs.332690 

121115 AA399122 AA398187 Hs.104682 

121121 AA399371 AA399371 Hs.189095 

121122 AA399373 AI126713 Hs.192233 
121125 AA399441 AL042981 Hs.251278 

35 121151 AA399636 AA399636 Hs.143629 

121153 AA399640 AA399640 Hs.97694 

121163 AA399680 AI676062 Hs.1 11902 

121176 AA400080 AL121523 Hs.97774 

121192 AA400262 AA400262 Hs.190093 

40 121223 AA400725 AI002110 Hs.97169 

121227 AA400748 AA400748 Hs.97823 

121231 AA400780 AA814948 Hs.96343 

121278 AA401631 AA037121 Hs.98518 

121279 AA401688 AA292873 Hs.177996 
45 121282 AA401695 AA401695 Hs.97334 

.121299 AA402227 AA402227 Hs.22826 

121301 AA402329 NMJ06202HS.89901 

121302 AA402398 AA402587 Hs.325520 
121304 AA402449 fc AA293863 Hs.97316 

50 . 121305 AA402468 AA402468 Hs.291557 

134721 AA403268 . AK000112 Hs.89306 

121323 AA403314 AA291411 Hs.97247 

121324 AA404229 AA404229 Hs.97842 

. 444422 AA404260 AI768623 Hs.108264 

55 131074 AA404271 U16125 Hs.181581 

• - 121344 AA405026 AA405026 Hs.193754 

. 121348 AA405182 AA405182 Hs.97973 

121350 AA405237 AA405237 

121400 AA406061 AA406061 Hs.98001 

60 121402 AA406063 AA406063 Hs.98003 

121403 AA406070 AA406070 Hs.98004 

121408 AA406137 AA406137 Hs.98019 

121431 AA406335 AA035279 Hs.176731 

121471 AA411804 AA411804 Hs.261575 

65 121474 AA411833 AA402335 Hs.188760 

121526 AA412219 AW665325 Hs.98120 

121530 AA412259 AA778658 Hs.98122 

121558 M412497 AA412497 

121559 AA412498 AI192044 Hs.104778 
70 121584 AA416586 AI024471 Hs.98232 

121609 AA416867 AA416867 Hs.98185 

121612 AA416874 AA416874 Hs.98168 

121737 AA421133 AA421133 Hs.104671 

121740 AA421138 AA421138 Hs.143835 

75 436032 AA422079 AA150797 Hs.109276 

121784 AA423837 T90789 Hs.94308 



Homo sapiens cDNA FU12573 fis, done NT 

ESTs 

ESTs 

EST 

ESTs, Moderately similar to AW1 .HUMAN A 
ESTs 

l-kappa-B-interactlng Ras-like protein 1 
ESTs, Moderately similar to AF174605 1 F 
ESTs 

ESTs, Weakly similar to ALU1J1UMAN ALU S 
EST, Highly similar to (defllne not aval 
EST 
EST 

glycoprotein M6B 

EST 

ESTs 

ESTs 

Human DNA sequence from clone RP1 1-251 J8 

EST 

ESTs 

ESTs 

nucleoporin 50kD 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs, Moderately similar to ALU8_HUMAN A 
EST 

gb:zt62h10.rl Soares_testis_NHT Homo sap 
ESTs 

ESTs, Weakly similar to mitochondrial cl 

similar to SALL1 (sal (Droscphila)-iike 

ESTs, Highly similar to T00337 hypotheti 

KIAA1201 protein 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs, Weakly similar to dJ667H12.2.1 [H. 
Homo sapiens mRNA; cDNA DKFZp434D024 (fr 
ESTs. Weakly similar to ALUC.HUMAN 111! 
Homo sapiens cDNA FU11490 fis, clone HE 
ESTs 
ESTs 

tropomodulin 3 (ubiquitous) 

phosphodiesterase 4A, cAMP-specific (dun 

LAT1-3TM protein 

EST 

ESTs 

hypothetical protein FU20105 

ESTs 

EST 

ESTs 

glutamate receptor, tonotroplc, kainate 

ESTs 

ESTs 

gb:zt06e10.s1 NCLCGAP.GCB1 Homo sapiens 

EST 

ESTs 

EST 

EST 

ESTs 

ESTs 

ESTs, Highly similar to Trad [Ksapiens] 

ESTs 

ESTs 

gbzt95g12^1 SoaresJestisJIHT Homo sap 

ESTs 

ESTs 

EST 

ESTs 

erythrocyte transmembrane protein 
EST 

latexin protein 

RAB35, member RAS oncogene family 
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121802 AA424328 

121803 AA424339 
135286 AA424469 
332778 AA424469 
121806 AA424502 
129517 AA425004 
121845 AA425734 
121853 AA425887 
121891 AA426456 
121895 AA427396 
121899 AA427555 

121917 AA428218 

121918 AA428242 

121919 AA428281 

121941 AA428865 

121942 AA428994 
121970 AA429666 
121993 AA430181 
418706 AA430184 
122022 AA431293 

122050 AA431478 

122051 AA431492 
122055 AA431732 
122105 AA432278 

• 122125 AA434411 
135235 AA435512 
122162 AA435698 
422072 AA435711 
415106 AA435815 
122186 AA435842 
122235 AA436475 
412970 AA436489 
419288 AA442060 
122310 AA442079 
122334 AA443151 
122382 AA446133 
122425 AA447145 
122431 AA447398 
122450 AA447643 
426284 AA447742 
122477 AA448226 
122500 AA448825 
122522 AA449444 
122536 AA450087 
122538 AA450211 
122540 AA450244 
122560 AA452123 
421919 AA452155 
122562 AA452156 
122585 AA453036 
122608 AA453526 

122635 AA454085 

122636 AA454103 
122653 AA454642 
122660 AA454935 
122703 AA456323 
122724 AA457395 
122749 AA458850 
122772 AA459662 
430242 AA459668 
429838 AA459679 
122777 AA459702 
135362 AA460017 
122798 AA460324 
122837 AA461509 

122860 AA464414 

122861 AA464428 
122910 AA470084 
132899 AA476606 
122967 AA478521 
422845 AA478523 
123009 AA479949 
128917 AA481252 
123081 AA485351 
123133 AA487264 
123184 AA489072 



AI251870 Hs.1 88898 
AI338371 Hs.157173 
AW023482 Hs.97849 
AW023482 Hs.97849 
AA424313 Hs.98402 
AW972853 Hs.1 12237 
AI732692 Hs.165066 
AA425887 Hs.98502 
AA426456 Hs.98469 
AA427396 

R55341 Hs.50421 
AA406397 Hs.139425 
BE274689 Hs.184175 
AA428281 Hs.98560 
AA428865 Hs.98563 
AW452701 Hs.293237 
AA429666 Hs.98617 
AW297880 Hs.98661 
U73524 Hs.87465 
AA431293 Hs.98716 
AI453076 

AA431492 Hs.98742 
AA431732 Hs.9B747 
AW241685 Hs.98699 
AK000492 Hs.98806 
AW298244 Hs.266195 
AA628233 Hs.79946 
AB018255 Hs.111138 
U40763 HS77965 
AA398811 Hs.104673 
AA436475 Hs.1 12227 
AB026436 Hs.177534 
AA256106 Hs.87507 
AW192803 Hs.98974 
BE465894 Hs.98365 
AA446440 Hs.98643 
AB007859 Hs.100955 
AA447398 Hs.99104 
M447643 Hs.1 12095 
AJ404468 Hs.284259 
AA448226 Hs.324123 
AA448825 Hs.99190 
AA299607 Hs.98969 
AF060877 Hs.99236 
AA450211 Hs.99239 
M476741 Hs.98279 
AW392342 Hs.283077 
AJ224901 Hs.109526 
AA452156 

AI6B1654 Hs.170737 
AA453525 Hs.143077 
AA454085 

AW651706 Hs.99519 
AW009166 Hs.99376 
A1816827 Hs.180069 
AA456323 Hs.269369 
AA457395 Hs.99457 
AA458850 Hs.293372 
AW1 17452 Hs.99489 
U66669 Hs.236642 
AW904907 Hs.30732 
AK001022 Hs.214397 
AA978128 Hs.99513 
AW366286 Hs.145696 
AM61509 Hs.293565 
AA464414 

AA335721 Hs.213628 
AA470084 Hs.98358 
AA476606 Hs.59666 
AA806187 Hs.289101 
AA317841 Hs.7845 
AA535244 Hs.78305 
AI365215 Hs.206097 
A1815486. Hs.243901 
AA487264 Hs.154974 
BE247767 Hs.18166 



ESTs 
ESTs 
ESTs 
ESTs 
ESTs 
ESTs 

ESTs, Moderately similar to ALU2_HUMAN A 

hypothetical protein FU14303 

ESTs 

gb:zw33a02.s1 Soares ovary tumor NbHOTH 

K1AA0203 gene product 

ESTs 

chromosome 2 open reading frame 3 

EST 

ESTs 

ESTs 

EST 

ESTs 

ATP/GTP-binding protein 

ESTs, Moderately similar to T42650 hypot 

ELAV (embryonic lethal, abnormal vision, 

EST 

EST 

ESTs 

hypothetical protein 
ESTs 

cytochrome P450, subfamily XIX (aromatiz 
KIAA0712 gene product 
peptidyl-prolyl isomerase G (cydophiSn 
ESTs 

membrane-associated nucleic acid binding 
dual specificity phosphatase 10 
ESTs 

ESTs, Weakly similar to S65824 reverse t 
ESTs, Weakly similar to LB4D.HUMAN NADP- 
ESTs 

KIAA0399 protein 
ESTs 

hypothetical protein DKFZp434F1819 

dyneln, axonemal, heavy polypeptide 9 

ESTs 

ESTs 

ESTs 

regulator of G-protefn signalling 20 
ESTs 

ESTs, Weakly similar to A43932 mucin 2 p 
centrosomal P4.1 -associated protein; unc 
zinc finger protein 196 

gb:zx29c03.s1 SoaresJotaLfetus_Nb2HF8_ 

hypothetical protein FU23251 

ESTs 

gb:zx33a0B.s1 Soares_totaLfetus_Nb2HF8_ 

hypothetical protein FLJ 14007 

ESTs 

nuclear respiratory factor 1 

ESTs 

ESTs 

ESTs, Weakly similar to B34087 hypotheti 
ESTs 

3-hydroxyisobutyryi-Coenzyme A hydrolase 

hypothetical protein FU13409; KIAA1711 

hypothetical protein FU 10160 similar to 

ESTs, Weakly similar to T17454 diaphanou 

splicing factor (CC1 .3) 

ESTs, Weakly similar to putative p1 50 [H 

gb:zx78g01 .si Soares ovary tumor NbHOT H 

ESTs 

ESTs 

SMAD In the antisensB orientation 
glucose regulated protein, 5BkD 
hypothetical protein MGC2752 
RAB2, member RAS oncogene family 
oncogene TC21 

Homo sapiens cONA FU20738 fis, clone HE 
Homo sapiens mRNA; cONA DKFZp667N064 (fir 
K1AA0870 protein 
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332467 AA489630 

123233 AA490225 

123234 AA490227 
123236 AA490255 
123255 AA49G890 
430015 AA490916 
448892 AA490925 
123259 AA490955 
123284 AA495812 
123286 AA495824 
123315 AA496369 
457397 AA504125 
433049 AA521473 
123421 AA598440 
123449 AA598899 
426981 AA599244 
409986 AA599694 
123497 AA600037 
123604 AA609135 
123712 AA609584 
123731 AA609839 
123800 AA620423 
123841 AA620747 
123929 AA621364 
123978 C20653 
133184 D20085 
132835 D20749 
435147 D51285 
128695 D59972 
124029 F04112 
124057 F13604 
449316 H01662 
130973 H05135 
124106 H12245 
124136 H22842 
124165 H30894 
429627 H43442 
124178 H45996 
129948 H69281 
452114 H69485 
124+0826254 
129056 H70627 
427580 H73260 
426793 H77531 
124274 H80552 
129078 H80737 
457658 H93412 
124315 H94892 
437712 H95643 
124324 H96552 
452933 H97146 
132231 H99131 
421877 H99462 
443123 H99837 
132963 N22140 
420473 N22197 
417381 N23756 
130365 N24134 
456610 N24195 
439311 N26739 
124383 N27098 
124387 N27637 
129341 N3309Q 
419793 N35967 
124433 N39069 
124441 N46441 
132338 N48270 
436575 N48365 
124466 N51316 
408048 N51499 

124483 N53976 

124484 N54157 

124485 N54300 
124494 N54831 
129200 N59849 
124527 N62132 



NNL014700HS.1 19004 
AW974175 Hs.151875 
NM-001938HS.16697 
AW968504 Hs.123073 
AA830335 Hs.105273 
AW768399 Hs.106357 
AF084535 Hs.22464 
AI744152 Hs.283374 
AA488988 Hs.293796 
AA495824 Hs.188822 
AA496369 

AW969025 Hs.109154 
AU076668 Hs.334884 
AA598440 Hs.291154 
AL049325 Hs.1 12493 
AL044675 Hs.173081 
NM_014777Hs.57730 
AA765256 Hs.135191 
AA609135 Hs.293076 
AA609664 

AA609839 Hs.334437 
AA620423 Hs.1 12862 
AA620747 Hs.1 12896 
AA621364 Hs.112981 
T89832 Hs.170278 
AA001021 Hs.6685 
Z83844 Hs.5790 
AL133731 Hs.4774 
NM_003478Hs.101299 
F04112 Hs.312553 
AA902384 Hs.73853 
AI609045 Hs.321775 
AI638418 Hs.1440 
H12245 

H22842 Hs.101770 
H30039 Hs.107674 
NM_015340Hs.2450 
BE463721 Hs.97101 
AI537162 Hs.263988 
N22687 Hs.8236 
H69899 H69899 
AI769958 Hs.108336 
AK001507 Hs.44143 
X89887 Hs.172350 
H80552 Hs.102249 
AI351010 Hs.102267 
AW952124 Hs.13094 
NMJ05402HS.288757 
X04588 Hs.85844 
H96552 Hs.159472 
AW391423 Hs.288555 
AA662910 Hs.42635 
AW250380 Hs.109059 
AA094538 Hs.272808 
AA099693 Hs.34851 
AL1 18782 Hs.300208 
AF164142 Hs.82042 
W56119 Hs.155103 
AF172066 Hs.106346 
BE270668 Hs.1 51945 
N27098 Hs.102463 
N27637 Hs.109019 
AI193519 Hs.226396 
AI364933 Hs.1 68913 
AA280319 Hs.288840 
AW450481 Hs.161333 
AA353868 Hs.182982 
AI473114 

R10084 Hs.1 13319 
NM_007203Hs,42322 
AI821780 Hs.179864 
H66118 Hs.285520 
AB040933 Hs.15420 
N54831 Hs.271381 
N59849 Hs.13565 
N79264 Hs.269104 



KIAA0665 gene product 

ESTs, Weakly similar to MAPB^HUMAN MICRO 

down-regulator of transcription 1, TBP-b 

CDC2-related protein kinase 7 

ESTs 

ESTs 

epilepsy, progressive myoclonus type 2, 
ESTs, Weakly similar to CA15_HUMAN COliA 
ESTs 

ESTs, Weakly similar to A46010 X-finked 
gb:zv37d1 0.s1 Soares ovary tumor NbHOT H 
ESTs 

SEC10 (S. cerevisiae)-like 1 

EST, Weakly similar to I38022 hypothetic 

Homo sapiens mRNA; cDNA DKFZp564D036 (fr 

KIAA0530 protein 

KIAA0133 gene product 

ESTs, Weakly similar to unnamed protein 

ESTs 

Homo sapiens cDNA: FU21543 fe, clone C 

gb:ae62f01.s1 Stratagene lung carcinoma 

EST 

ESTs 

ESTs 

ESTs 

thyroid hormone receptor interactor 8 

hypothetical protein (JJ37E16.5 

Homo sapiens mRNA; cDNA DKFZp761C1712 (f 

cuIIinS 

gb:HSC2JH062 normalized infant brain cDN 
bone morphogenetic protein 2 
hypothetical protein DKFZp434D1428 
DEAD/H (Asp-GIu-AlanAsp/His} box polypep 
gb:ym17a12.rl Soares infant brain 1NIBH 
EST 
ESTs 

leucyl-tRNA synthetase, mitochondrial 
putative G protein-coupled receptor 
ESTs 
ESTs 

gb:yu70c12.s1 Webmann Olfactory Epithel 
ESTs, Weakly similar to ALUE_HUMAN lit! 
Homo sapiens clone FLB6914 PR01821 mRNA, 
HIR (histone cell cycle regulation defec 
EST 

lysosomal 

presenHins associated rhomboid-like pro 
v-ral simian leukemia viral oncogene horn 
neurotrophic tyrosine kinase, receptor, 
Homo sapiens cDNA: FLJ22224 fis, clone H 
Homo sapiens cDNA: FU22425 fis, clone H 
hypothetical protein DKFZp434K2435 
mitochondrial ribosomal protein L12 
putative transcription regulation nuclea 
epsilon-tubulin 

Sec23-interadng protein p125 
solute carrier family 23 (nucleobase tra 
eukaryotic translation initiation factor 
retinolc acid repressible protein 
mitochondrial ribosomal protein L43 
EST 
ESTs 

hypothetical protein FU11126 

serine/threonine kinase 24 (Ste20, yeast 

PR01 575 protein 

ESTs 

golgin-67 

ESTs 

kinesin heavy chain member 2 
A kinase (PRKA) anchor protein 2 
ESTs 

ESTs, Weakly similar to 2109260A B cell 
K1AA1500 protein 

ESTs, Weakly similar to 138022 hypothetl 
Sam68-Gke phosphotyrosine protein, T-ST 
ESTs 
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194R79 NR977R 


N62375 


Hs.102731 


177917 MR717R 
lOO^IO IMDOIOO 


AA903424 


Hs.6786 


194570 MR7179 


D54120 


Hs.146409 


19Q10R KIM707 
l^yiSD NOO/ 07 


BE296313 Hs.265592 


194T7R mrrira 

IdHDlD NOOlOO 


N68168 




194R7R MRP 901 


N68201 




194T77 MRR7rtfl 


N68300 


Hs.138485 


1 94R7R NRR791 
ItHO/O INDOOR I 


N68321 


Hs.231500 


194 SQA KIRQR7R 
I£*t090 r\l030/0 


N69575 


Hs.102788 


19ftRfM M7RnfY7 
HOOUl NfOUU/ 


AL133572 


Hs.199009 


779474 M7RR49 


AI680737 


Hs.289068 


1 9Q470 KIQfMRR 

i£OHfO INSUUQO 


T78277 


Hs.100293 


19RK7Q MQ194R 


AW582962 Hs.102897 


1 94RR9 MQ97R1 


W19407 


Hs.3862 


177117 WQ7914 


AB002316 


Hs.65746 


19AR71 MQQ14R 


AK001357 


Hs.102951 


1 77AR4 RJY7A7R 


AA464836 


Hs.291079 


/nocc dihqrc 
4^0^00 KluoOO 


J00077 


Hs.155421 


124/2U K11UOO 


R05283 




124/42 rslWOO 


T97733 


Hs.1 85685 


.120944 Kzjyou 


AL137586 


Hs.52763 


179QRK D9CKRQ 

loiyoo ruoDoy 


AI248173 


Hs.191460 


49RR04 R775AR 
4400U 1 * t\0 f 000 


AW162919 Hs.170160 


47RR9A R77R17 
HOOolo AO/ 0 1 0 


AL134275 


Hs.6434 


194757 R7fi7QA 
i&Hio/ rcoooso 


H11368 


Hs.141055 


1947R9 R7Q17Q 


AA553722 


Hs.92096 


194777 R4/1Q97 


R45154 


Hs.338439 


17R9RR R4117Q 

iod4od rvnirs 


R41179 


Hs.97393 


497QR1 R419Q4 


AW293165 Hs.143134 


Di!97n7 
4140UO K420U/ 


NIVL004427HS.165263 


12004U K4oio9 


AW297929 Hs.328317 


124785 R4ooU0 


W38537 


Hs.280740 


"10X700 D^/nK7 

124/92 K44J0/ 


R44357 


Hs.48712 


124793 R44019 


R44519 




124/99 K40UO0 


R45088 




10XQ49 DA7QAQ 
12401 £ ^4/940 


R47948 


Hs.188732 


194P91 DR1R94 
144041 KOI 044 


H87832 


Hs.7388 


494197 RR4Q5n 


AW966158 Hs.58582 


194A7R R5R941 
14*1000 K0044I 


R55241 


Hs.101214 


194A4R RRQ5AR 

144040 noyooo 


R59585 


Hs.101255. 


194A47 RRDD44 
1 4409/ r\0UU44 


W07701 


Hs.304177 


44DR7H RRHR79 
44U0 0\J KOUO/ £ 


BE561430 Hs.239388 


194AA1 DRRRQfi 
1 24001 KDDDyU 


R67567 


Hs.107110 


779RH7 RR79RR 
0J40U0 KO/400 


NM.004455HS.150956 


10/Q70 D7QCQQ 

124079 K/O0O0 


R73588 


Hs.101533 


124032 K794U0 


AI970003 


Hs.23756 


1249UO Ro7o4/ 


H75964 


Hs.107815 


124922 R9oo22 


R93622 


Hs.12163 


12494U Ryyayy 


AF068846 


Hs.103804 


124941 R99012 


AI766661 


Hs.27774 


124940 TU2000 


AW963279 Hs.123373 


12494/ tuoi/u 


T03170 


Hs.100165 


i44yo4 niroo 


AW964237 Hs.6728 


/1E&RR9 T1R41R 


U55184 


Hs.1 54145 


41f\RR7 T155Q7 
*t IUOOO 1 1009/ 


BE383768 


Hs.65238 . 


410 100 llODO^ 


R43504 


Hs.6181 


44fini4 T1RRQA 


AW960782 Hs.6856 


171DA9 T9RR44 
1J1U0& I^OO^f 


AI091121 


Hs.246218 




T40841 


Hs.98681 


10XQQ4 T/7RCR 

124ytt4 l 4/ 000 


BE313210 Hs.334798 


124991 lOUllO 


T50116 




/C700*) TCft-iAC 

To0145 


NM^004477Hs.203772 


lOCfiftA TCQR1C 


T58615 


Hs.235887 


1 o29o2 1 oyy 4u 


AW1 18826 Hs.6093 


AAA AAA TR7RQR 


AK002126 


Hs.1 1260 


12oUUo l04o91 


T91251 




19RflnO TR4094 


T64924 


Hs.303046 


440 Jo*t 1 OH300 


T79136 


Hs.127243 


195017 T68B75 


T68875 




125018 T69027 


T69027 


Hs.269481 


125020 T69924 


T69981 




437871 T70353 


AI084813 


Hs.1 14088 


134204 T79780 


AI873257 


Hs.7994 


125050 T79951 


AW970209 Hs.1 11805 


125052 T80174 


T85104 


Hs.222779 


125054 T80622 


T80622 


Hs.268601 



EST 
ESTs 

ceD division cycle 42 (GTP-blnding prot 
ESTs, Weakly similar to 138022 hypotheti 
gb:za11c0ts1 Soams fetal liver spleen 
ESTs, Weakly slmnarto 138022 hypotheti 
gb:za1 2g07.s1 Soares fetal Hver spleen 
EST 
ESTs 

protein containing CXXC domain 2 
Homo sapiens cDNA FU1 1918 fi3; clone HE 
0-llnked N-acety^hioosamine (GlcNAc) tr 
CGl-47 protein 

regulator of nonsense transcripts 2; DKF 
K1AA0318 protein 

Homo sapiens cDNA RJ10495 fis, clone NT 
ESTs, Weakly similar to T27173 hypotheti 
alpha-fetoprotein 

gb.ye91c08.s1 Soares fetal Cver spleen ' 
ESTs 

anaphase-promoting complex subunit 7 

hypothetical protein MGC12936 

RAB2, member RAS oncogene family-like 

hypothetical protein DKFZp76iF2D14 

Homo sapiens done 23758 mRNA sequence 

ESTs, Moderately similar to A46010 X-Bn 

ESTs 

KIAA0328 protein 
ESTs 

early development regulator 2 (homotog 0 
EST 

hypothetical protein MGC3040 
hypothetical protein FU20736 
gb:yg24h04,s1 Soares infant brain 1N1B H 
gb7g38g04.s1 Soares infant brain 1N1B H 
ESTs 

kelch (Drosophita)-Iike 3 

Homo sapiens cDNA FLJ12789 fis, clone NT 

EST 

ESTs 

Homo sapiens clone FLB8503 PR02286 mRNA, 
Human DNA sequence from clone RP1-304B14 
ESTs 

exostoses (muitiple)-Oke 1 
ESTs 

hypothetical protein similar to swine ac 
ESTs 

eukaryottc translation initiation factor 
heterogeneous nuclear nbonucleoprotein 
ESTs, Highly similar to AF1 61349 1 HSPCO 
ESTs, Weakly similar to ALU1_HUMAN ALU S 
ESTs 

KIAA1548 protein 

hypothetical protein FU11585 

95 kDa retinoblastoma protein binding pr 

ESTs 

ash2 (absent, small, or homeotic, Drosop 
Homo sapiens cDNA: FU21781 fis, done H 
ESTs 

eukaryotic translation elongation factor 
gb7b77c10^1 Stratagene ovary (937217) 
FSHD region gene 1 
ESTs 

Homo sapiens cDNA: FU22783 fis, clone K 
hypothetical protein FU1 1264 
gb:yd60a10.s1 Soares fetal liver spleen 
ESTs 

Homo sapiens mRNA for KIAA1724 protein, 
gbyc30f05.s1 Stratagene fiver (937224) 
sex comb on midieg homolog 1 
gbyc19d03 n Stratagene lung (937210) H 
ESTs 

hypothetical protein FU20551 
ESTs 

ESTs, Moderately similar to similar to N 
ESTs, Weakly similar to envelope [H^api 
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125083 T85352 
125064 T85373 
125066 T86284 
416507 T89579 
125080 T90360 
125097 T94328 
125104 T95590 
135107 T97257 
423122 T97599 
125118 T97620 
125120 T97775 
134160 T98152 
125136 W31479 
125144 W37999 
125150 W38240 
450142 W40150 
131987 W45435 
125178 W58202 
125180 W58344 
125182 W58650 
446888 W68736 
125197 W69106 
133497 W69111 
429922 W69399 
129232 W69459 
422166 W72424 
125209 W72724 
125212 W72834 
456631 W73955 
125223 W74701 
125225 W76540 
125228 W79397 
132393 W85888 
125238 W86038 
125247 W86881 
129296 W87804 
125263 W88942 
125266 W90022 
450862 W92272 
452401 W92764 
428243 W93040 

125277 W93227 

125278 W93523 
125280 W93659 
448205 W94003 
131844 W94401 
125284 W94688 
417111 W94787 
445424 238294 
125289 Z38311 
446313 Z38465 
431342 238525 
433227. Z38538 
428306 Z38551 
424624 238783 
125295 239113 
125298 239255 
125300 239591 
448378 239783 
444582 239920 
130862 240166 
128888 240388 
125310 240646 
125315 241697 
125317 299349 
135096 299394 



T85352 
T85373 
T86284 

AL045364 Hs.79353 
T90360 Hs.268620 
AW576389 Hs.335774 
T95590 

T97257 Hs.94560 
AA845462 Hs.124024 
R10606 Hs.269890 
T97775 Hs.100717 
T98152 Hs.79432 
AW962364 Hs.129051 
AB037742 Hs.24336 
W38240 

AW207469 Hs.24485 
AW453069 Hs.3657 
W93127 Hs.31845 
W58469 Hs.103120 
AA451755 Hs.263560 
AL030996 Hs.16411 
AF086270 Hs.278554 
BE617303 Hs.74266 
297630 Hs.226117 
R98881 Hs.109655 
W72424 Hs.1 12405 
W72724 Hs.103174 
AA746225 Hs.103173 
BE383436 Hs.108847 
AI916269 Hs.109057 
W74169 Hs.16492 
AA033982 Hs.1 10059 
AL135094 Hs.47334 
N99713 Hs.109514 
AA694191 Hs.163914 
AI051967 Hs.1 10122 
AA098878 

W90022 Hs.1 86809 
U91543 Hs.25601 
NMJ07115HS.29352 
H05317 Hs.283549 
W93227 Hs.1 03245 
A1218439 Hs.129998 
AI123705 Hs.106932 
W93949 Hs.33245 
AI419294 Hs.324342 
NM_002666Hs,103253 
AW016321 Hs.82306 
AB028945 Hs.12696 
T34530 Hs.4210 
H05245 Hs.106801 
AW971018 Hs.21659 
AB040923 Hs.106808 
AB037715 Hs.183639 
AB032947 Hs.151301 
AB022317 Hs.25887 
AW972542 Hs.289008 
239591 Hs.101376 
BE622770 Hs.264915 
R55344 Hs.22142 
AA497044 Hs.20887 
AI760853 Hs.241558 
R59161 Hs.124953 
R38110 Hs.106296 
299348 Hs.112461 
AA081258 



gbyd82d0U1 Soares fetal liver spleen 
gbryd82f07.s1 Soares fetal Over spleen 
gbyd77b07.s1 Soares fetal liver spleen 
transcription factor Dp- 1 
ESTs, Highly similar to ALU6.HUMAN ALU S 
EST, Moderately similar to S65657 alpha- 
gb.-ye40a03.s1 Soares fetal liver spleen 
ESTs, Moderately similar to I38022 hypot 
deltex (Drosophiia) homolog 1 
gb:yf35f1 1 $\ Soares fetal Over spleen 
EST 

fibrillin 2 (congenital contracture! ana 
ESTs 

KIAA1321 protein 

Empirically selected from AFFX single pr 

chondrojtin sulfate proteoglycan 6 (bama 

activity-dependent neuroprotective prate 

ESTs 

ESTs 

ESTs 

hypothetical protein LOC57187 

heterochromatin-like protein 1 

hypothetical protein MGC4251 

H1 histone family, member 0 

sex comb on midleg (Drosophila)-like 1 

S100 calcium-binding protein A9 (calgran 

ESTs, Weakly similar to TSP2_HUMAN THROM 

ESTs 

hypothetical protein MGC2749 

ESTs, Weakly similar to ALU5_HUMAN ALU S 

DKF2P564G2022 protein 

ESTs, Weakly similar to 138022 hypotheti 

hypothetical protein FLJ 14495 

ESTs 

ESTs 

ESTs 

gb:zn45g10.r1 Stratagene HeLa cell s3 93 
ESTs, Highly similar to LCT2.HUMAN LEUKO 
chromodomain hencase DNA binding protei 
tumor necrosis factor, alpha-induced pro 
ESTs 
EST 

enhancer of polycombl 

ESTs 

ESTs 

ESTs 

periDpin 

destrin (actin depolymerizing factor) 
cortactin SH3 domain-binding protein 
Homo sapiens cDNA FU13069 fis, clone NT 
ESTs, Weakly similarto PC4259 ferritin 
ESTs 

keich (Drosophiia)-like 1 
hypothetical protein RJ10210 
Ca2-Kjependent activator protein for seer 
sema domain, immunoglobulin domain (Ig), 
Homo sapiens cDNA: FU21814 fis, clone H 
EST 

Homo sapiens cDNA FU12908 fis, clone NT 

cytochrome b5 reductase b5R.2 

hypothetical protein FU10392 

ariadne (Drosophiia) homolog 2 

ESTs 

ESTs 

ESTs, Weakly similarto I38022 hypotheti 
zinc finger protein 36 (KOX 18) 
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TABLE 3 A 



Table 3A shows the accession numbers for those pkeys lacking unigenelD's for Table 3. Tne pkeys In Table 7 lacking unigenelD's are represented within 
Tables 1-6A. For each probeset we have listed the gene duster number from which the oligonucleotides were designed. Gene dusters were compiled 
using sequences derived from Genbank ESTs and mRNAs. These sequences were dustered based on sequence similarity using Clustering and 
Alignment Tools (DoubieTwist, Oakland California). The Genbank accession numbers for sequences comprising each duster are listed in the "Accession" 
column. 



Pkey: Unique Eos probeset Identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey 



CAT Number Accession 



108469 1 16761J AA079487 AA128547 M128291 AA079587 AA079600 
124106 1 25446.1 H12245 AA094769 R14576 
108501 13684.-12 AA083256 r 

108562 36375J AA100796 AF020589 AA074629 AA075946 M100849 AA085347 AA126309 AA079311 AA079323 AA085274 

. 101300 4669.1 BE535511 M62098 AA306787 AW891766 AA348998 AA338869 AA344013 AW956561 AW389343 AW403607 L40391 

AW408435 AA121738 AI568978 H13317 R20373 AW948724 AW948744 AA335023 AA436722 AA448690 C21404 
AW884390 AA345454 AA303292 AA1 74174 BE092290 T90614 AA035104 R76028 AA126924 AA741086 AW022056 
AW1 18940 AA121666 AI832409 AA683475 AI140901 AI623576 AW519064 AW474125 AI953923 AI735349 AW150109 
AI4361 54 AW1 18130 AW270782 AI804O73 N27434 AA876543 AA937815 A1051 166 AA505378 AI041975 AI335355 
AI089540 AA662243 AI127912 A1925604 AI250880 AI366874 A1564386 AI815196 AI683526 AI435885 AI160934 H79030 
AI801493 AA448691 AI673767 AI076042 A1804327 AA813438 AA680002 AI274492 T16177 AI287337 AI935050 
AA907805 AA91 1493 AI589411 AI371358 AW576236 AI078866 AW516168 AA346372 AI560185 AA471009 R75857 
AA296025 AA5231 55 AA853168 A1696593 A1658482 AI566601 AW072797 AA128047 AA035502 AW243274 AA992517 
R43760 

132091 94851.1 AW954243 AA829930 AA412478 AA828434 AA814538 AI927418 AI192435 W52897 AA443666 AA031913 AI683306 

AA918481 AI183314 D83907 AI206832 AA876122 D83836 D83838 D82533 AI761290 AI191125 AI143749 AW771909 
AI241436 AI767267 W56507 AA847787 AA568692 T10502 AI247870 AA715017 AA643304 AA890233 AA81 1387 
AA897470 AA907729 A170B679 AI078010 AA452830 AW419160 AI783713 N80205 W56778 AA676899 A1888718 N69930 
AI338935 AI217580 AA639508 AA575836 BE046852 AI312651 AI038406 AA628649 AA643838 AI493761 AA032024 
W38849 AA340178 AA447052 AA452969 W19369 AA296364 H44229 W58767 C05751 C05835 A1741989 N98532 
AW102617 AA412583 AI922246 W38495 AA355375 AA928571 C06275 AA352500 N93132 

1 17034 201 13.2 U72209 NM.005748 AI655607 AI052758 AA385199 AW956794 H88679 AL135153 AI765644 AA384399 AW956458 

AA568443 AA804610 AI873513 H88639 Z25371 R63456 W44919 

100752 33207.21 T81309 BE019033 R94181 BE019198 NIVL000812 J03242 AW41 1299 BE300064 BE297544 R94182 AW630108 T53723 
D58853 K78073 H80594 BE299560 T45899 H70196 M17426 N77077 S77035 H58384 H61664 H78540 T84527 C17198 
H60255 H71980 R92644 W79050 X00910 M29645 R91055 M17863 M17862 T71815 BE299561 BE464561 X06260 
R94741 T54216 C18594 BE262015 X06161 AW409889 AA378400 BE263228 BE313278 R88116 BE313457 H43500 
T48617 BE313761 H77309 AI207601 X06159 H40413 X03425 T87663 R10627 X03562 M141 18 W03982 R97520 H81229 
T83157 H83168 H48762 AA669898 BE263054 H47289 AA022807 R11555 H74260 R76968 R28338 H72534 H72464 
H62031 N72478 N45355 AW411300 R89113 R69135 H58454 T83281 R93476 H69645 H68015 T82229 H71089 T85121 
H59939 W65299 N78176 H53909 N72373 R21788 H04660 H59639 H61874 BE262219 T53614 N73335 N5Q464 W00943 
N77189 R89257 AA570502 R89432 R06366 AA553480 AA776271 AA551359 AA551050 H51670 AA601052 BE299081 
H68198 H52276 BE207832 N91 192 H70332 X07S68 X07868 H69464 H53782 H73710 R80435 M553384 AW884176 
N53475 T71662 AW954036 AW954033 AA552931 H93206 AA430218 AA553476 AI918470 T54124 BE207982 BE300177 
N73994 AW882625 N39549 N53838 AA722389 H71878 H58909 H37849 H78435 T47933 R77174 R83814 AA41 1890 
H94199 AA663208 BE205778 AA490137 H70492 R98232 H3780O AA679294 H40341 H74238 H47290 H73231 T48618 
AA025428 AI039521 H92969 N59389 H80538 H72933 T90630 AA41 1891 N55000 H74225 AA340290 AW957061 T54316 
AA340437 H57125 H58908 H79027 H63450 N74623 R93425 H68714 H68758 N68396 H48763 N69256 H57320 H53831 
H53589 N68833 N52453 H56048 H69870 H78074 R69253 R83375 T53615 H94330 H58455 H90864 T47934 H74261 
R89258 R97997 R91056 R28339 R86760 H78235 R97521 H67692 H40358 AA022688 H52513 H59601 T88690 H65256 
H63397 W65397 AA553588 R19280 N52645 W73930 R06367 R21743 H72372 N73921 AW883539 AW882639 T40616 
H47084 R95723 AA634316 AA862781 H77310 R91389 H93111 R92767 T54512 R89341 H70333 H57B17 H82941 H62032 
N52638 H58385 T91796 H51086 AA340292 T49918 H81230 R36121 N5041 1 T67664 N62436 N39340 AA665637 
AA340446 H93377 H92973 BE296290 BE269788 H61665 AA340444 N54605 AA454101 R10628 R94200 AI200549 
AA342640 BE298855 BE250229 T49916 H82008 N28278 AW880662 H71268 N76791 H47685 H65255 W05198 
AW889144 N76677 H71702 H68036 H71915 R91612 R87807 H68059 Al 1 33328 AI247866 AA621443 AW881050 
AA700847 AA340413 AW878608 AW881181 AW87B249 H71916 N54596 BE161581 AW878082 W04212 AW881040 
AW885492 AW880519 AA334887 AW878715 W06882 AW630222 AW885381 H70869 AW381778 H47601 AW889982 
H63888 AW884986 AW878713 AW878685 R36391 AW878694 AA368070 C03393 AW878695 AW878705 AW878665 
AW878742 AW878620 AW878823 AW878688 R29048 AW878690 AW878686 AW878810 AW878827 AW878733 
AW878659 AW878749 AW878681 AW883353 AW883277 AW883300 AW883565 AW883298 AW883143 AW883045 
AW883482 AW883352 AW883417 AW883357 AWB83231 AW883474 AW883355 AW882620 AW882533 AW883754 
AW883139 AW882827 AW883641 AW8835S7AW8834B1 AW882983 AW882982 AW882465AW883419 AW882466 
AW883639 AW883230 AW882981 AW882534 AW882874 AW882619 AW883480 AW882826 AW882831 AW882835 
AW882830 AW883563 AW882456 AW627642 

116417 5418 11 AW499664 AW500888 AL042095 AW576556 AW265424 AI521500 AA761333 AA761319 AW291137 AA649040 AA769094 

AA489664 AA63531 1 AW070509 AA425658 AI381489 AA609309 AA134476 W74704 AI923640 AW084888 H45700 
AI985564 AW629495 AW614573 AI859571 A1693486 AA913B92 AI806164 AA909524 AW263513 A1356361 Z40708 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



123712 


374423J 


117156 


145392J 


125008 


1802095J 


125020 


116017J 


125066 


1814993.1 


116661 


1532859J 


125104 


413347J 


124575 


1666649J 


125263 


1547.2 


131859 


3672.1 



125565 1704098.1 
132983 11922.1 



118584 532052 J 
133607 1227.6 



AI332765 AI392620 AA181060 AW118719 AW968804 AW263502 AW505314 AA036967 W74741 R51139 H19364 H45751 
Z44962 AW370823 H25650 T54007 AA453000 AL045739 

AA609684AA758732 ^ Ajeft , ojM 

W73853 AA928112 W77887 AW889237 AA148524 AI749182 AI754442 AI338392 AI253102 AI079403 AI370541 AI697341 

H97538 AW1B8021 A1927669 W72716 AI051402 AI188071 AI335900 N21488 AW770478 W92522 AI691028 AI913512 

A1144448 W73819 AA604358 N28900 W95221 AI868132 H98465 AA148793 

T91251 T64891 T85665 

T69981T69924AA078476 

T86284 T81933 

R61504 F04247 

T95590AA703278 H62764 

N68168 N69188 N90450 

AA098878 W88942 

AW960564 AA092457 T55890 D56120 T92525 A1815987 BE182608 BE182595 AW080238 M90657 AA347236 AW961686 
AW1 76446 AA304671 AW583735 T61714 AA316968 AI446615 AA343532 AA083489 AA488005 W52095 W39480 N57402 
D82638 W25540 W52847 D82729 D58990 BE619182 AA315188 AA308636 M1 12474 W76162 AA088544 H52265 
M301631 H80982 AA1 13786 BE620997 AW651691 AA343799 BE613669 BE547180 BE546656 F11933 AA376800 
AW239185 AA376086 BE544387 BE619041 AA45251 5 AA001806 AA190873 AA180483 AA159546 F00242 AI940609 
AI940602 AI189753 T97663 T661 10 AW062896 AW062910 AW062902 AI051622 AI828930 AA102452 A1685095 AI819390 
AA557597 AA383220 AI804422 AI633575 AW338147 AW603423 AW606800 AW750567 AW510672 A1250777 AA083510 
AW629109 AW513200 AA921353 AI677934 AI148698 AI955858 AA173825 AA453027 AI027865 AW375542 AA454099 
AA733014 A1591384 R79300 R80023 M843108 AA626058 AA844898 AW375550 AA889018 A1474275 AW205937 
AI052270 AW388117 AW388111 AA699452 A1242230 N47475 H38178 AA366621 AA113196 AA130023 H39740T61629 
AI885973 AW083671 AA179730 AA305757 AI285455 N83956 AA216013 AA336155 AW999959 T97525 AA345349 
T91762 AA771981 AI285092 AI591386 BE392486 BE385852 AA682601 AI682884 AA345840 T85477 AA292949 
AA932079 AA098791 D82607 T48574 AW752038 C06300 
R20840 R20839 

M30269 NMJ02508 X82245 AI078760 AW957003 D78945 M27445 AA650439 AL04881 6 AV660256 AV650347 
AA333052 BE295257 T60999 AA383049 AW369677 Z26985 AW175704 AA343326 AW747957 AIB18389 W17308 
W17302 H15591 AA371284 AA370412 W94966 BE384365 T28498 R80714 R16959 H21723 AW835154 D56097 D56381 
W21232 AA190565 AW379755 AW067895 

AW136928 AI685655 BE218584 BE465078 N68963 AA975338 BE147199 N76377 

BE273749 BE397561 BE387189 AL037858 AL037878 AI983094 BE259216 AA011363 AL0361B9 BE562325 AA251169 
BE617431 N98537 AA158093 AL047800 M34539 NM.000801 AA312140 D16971 AA158904 AA307114 AA312B03 
T09203 AW629686 AL048504 BE388578 AA220957 AA158364 BE267385 AA294971 C18055 BE241757 AA1 15056 
AI936769 BE378435 BE206971 AW674924 BE622060 AA604674 AA1 15273 AW4021 59 AA338608 BE568819 M80199 
X55741 AA3751 1 1 AA37601 6 BE612671 AA805742 AW405588 N25850 N44580 H06031 AW403549 BE536552 AA056726 
BE543239 AA082517 AI201645 AI201642 AI192622 N40104 AA370921 BE547569 AI969502 AA302038 AI197890 
AW268354 AI014938 W45448 AI541395 AA037272 BE538826 AL039613 BE536130 AA299355 AW805147 AW974624 
H53220 AI471471 AA399303 AA007386 W35106 BE613277 R12739 R12738 AA304342 AA687802 BE409581 AI498844 
AV662092 AW904105 AA011375 BE315214 H99302 BE537893 N32299 AW855829 AI291320 BE078322 A1301395 
AA303362 N32719 AA358328 AA357877 AI952540 H56279 H02758 H0204B AW805233 R82224 AA410772 AA291352 
BE171 109 N69935 BE169248 AA361173 H44978 BE617887 D52560 AA084043 W03595 R67219 N36477 N42924 R67104 
H44901 H79695 W21105 AA393988 W30899 AA316096 BE622896 W46872 AA442678 BE544893 BE540112 BE621873 
AA338067 N55052 BE398154 BE621210 AA740760 C03739 C03206 BE396692 AA482370 AA031614 AA301575 
AA304710 AA132153 AA029796 AA994960 H19567 AA442969 H49781 H46871 AA035395 AA056185 AA149378 
AA643080 AL135479 AA292329 AA654337 AA041228 AA454888 AA025039 W58331 AA625981 T94941 AA302448 
H19900 AA218956 AA51 3790 AA563962 AA398076 W44441 AA293276 W47373 AA625879 W30688 AA043029 T64284 
' R79151 AA304340 AA485186 AA604939 R82470 AA421425 AW771456 AI339329 AA304424 AA605236 AA936934 
AA587673 AI209162 AI697301 AI479995 AI679814 AI361950 AW189125 AI955888 AI986019 BE301019 AI084792 
AI310211 AW189307 AI022070 AW977204 AI146825 AW190163 AW303281 AI828345 BE046043 AW029257 AA482268 
AI246507 AI420729 AW084932 AW439514 AIB90487 AW439692 AI523896 AI186612 AI659953 AI889773 AA687527 
AW072694 AW262153 AW467371 AI613269 AI679238 D54404 AA158103 AW105527 AW149739 AW150361 AW268387 
AW1 17708 AI951682 AI687440 AW674285 AA678365 AI587082 AA732095 AA019899 W45661 AA627300 BE613304 
AA765891 AA612935 AI814658 AW316916 R66594 AA514640 M025040 AA031472 AW732076 AA029797 AI244560 
AI128734 AW381720 AI092360 AI263283 AW613175 A1890675 AI720156 AW631348 AI635106 AI278045 AA303979 
AA703505 W45449 AW078651 AI292052 AW381707 AI147854 AW381743 AA158905 AA303258 AA888144 AW195967 
AA428706 AA989559 AA617731 H19882 BE543418 AA830386 AA421302 W58652 T94995 A1869743 AI579145 
AW085971 N98425 AA765136 AI347027 AI356955 AA928038 AI679717 AA458459 AA679281 AI367973 AI270041 
AA7651 35 AA732793 AI798447 AA668646 AA251008 AI984538 AI401737 AA0561 86 BE043308 AW662375 AI3021 10 
N50724 W9S332 BE537047 N26983 AI567172 AA765296 AW673237 N29784 AA534275 AA084044 AW067973 
AW300766 T63398 W46823 R39790 AI364185 AW298582 AA454814 AW069878 N67751 H05982 N23140 AI362647 
AI302086 AI767772 N25755 H53114 AA706133 T93511 AA429291 AA935294 AA987647 W02803 R66595 AI680795 
W23673 AW440794 AA722872 H49538 AW131042 AA531603 AA908665 AA040791 AA235312 W52205 N93444 R82180 
H02759 H79696 AW088894 H56079 AA961 143 AW067776 AW973745 AA01 6311 AW071227 AA017511 AI753994 
W47374 T64155 AA296092 AI698626 AA55B158 AA296088 AW794259 H01963 AA149267 AA485076 AA975B56 H44938 
AA035396 AI955555 H46289 AA486161 AI631222 AA359047 AW794253 AI806962 AW243930 AA526145 AW878734 
AA018464 AA132031 R67220 R79152 AA296093 H54300 AI005160 BE242548 AW992803 AW878644 AW878666T27742 
R82471 AW517604 AW472738 AI282904 R39791 AA486098 AW467891 AW960520 AA551736 AA056621 AW945197 
R66373 AA554236 BE242202 AI904376 AI832590 H19484 R00890 AI627677 AA302287 AI869451 A1734855 AI708073 
AI832902 AA585184 AW204299 AA055565 D12417 D11975 T63543 AW664099 R54423 BE612712 T96340 T63985 
AA598917 T40735 T64053 AA149284 AW272548 AA363445 AA042893 AW300697 BE261973 T53501 T53500 AW878729 
AW878657 AW794391 AA069193 R01553 H44875 AA385406 AA533968 M93060 AL135600 W96331 AA017651 
AA018849 AA017692 H85337 BE278690 AA731598 AA018512 AI076813 AI022644 R02585 X52220 AW296894 M825671 
A1699321 A1393601 AW592611 AI146747 AA608921 AA158365 AW590007 AA354519 D20081 R02704 AW798339 
M92422AA094903AA007676 
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133681 13893.1 AI352558 Z82248 X78138 NM.003405 AU077248 AA223125 S80794 D78577 AI124697 AW403970 BE614Q89 BE296713 

BE621334 L20422 X80536 D54224 D54950 X57345 N29226 AA127798 AA340253 F08031 M192540 H67S36 AA321827 
AW950283 AA084159 BE538808 AW401377 AA256774 C03366 W46595 W47608 AA305009 H69431 H69456 AL120082 
H11706 AA303717 AA361357 H22042 H78020 AW999584 AA134368 AA322911 AA322961 H60980 N85248 N31547 
H79624 T11718 W85826 AW894663 AW894624 BE167441 BE170015 AA304626 AW602163 AW998929 AA156681 
AA151067 BE002724 AA608688 H82692 BE155392 AW383636 BE155394 AA4B7004 AW383504 AI342365 R82553 
W1 6498 BE155344 A1143938 R69901 AA322873 AW340648 R25364 AA367935 AI559406 AA033522 AA374252 
AW835019 A1922133 AI697089 N99662 AW18907B AI199076 AW151598 W59944 AA662875 W94022 AA299055 
AI039008 AI829449 AA583503 AI635674 AW131665 AI473820 AW2731 18 AW900930 AA908944 AI688035 AW170272 
AI082545 AW468176 AI608761 AKJ82748 AI91 1682 AI248943 AI831016 AA192465 AI218477 AA938406 AA385288 
AI809817 AA905196 AI191245 AW70204 AI188296 AI421367 AI1 25315 AI087141 AA629032 AA740589 AI554181 
AA150830 A1248541 AI077943 AA775958 AA864930 AI261476 AI123121 AI310394 AA862331 AA872478 BE537084 
AI205606 AA720684 AI872093 AW150042 AL120538 AA219627 AA988608 C21397 AI359337 H25337 AI089749 
AA605146 AI359620 AA150478 AI359738 AW3B3642 AW995424 AI766457 R56892 AI089839 W61343 N69107 W46459 
AA565955 N20527 AI279782 W46596 AA776573 H23204 AI866231 AI083995 N21530 AA126874 D82630 W65437 
AI086917 AW382095 AI086877 H69844 AW340217 W85827 L08439 AA262704 AA50538O W47413 W94135 AA223241 
AW089153 AA084101 BE538000 AA098126 T28031 AA491574 R84813 AA774536 AW383522 AA155615 AW383529 
AA491520 AW028427 AA171496 AI469689 AW664539 Af8111Q2 AI811116 BE464590 BE350791 H78021 T15405 H21979 
AA219489 H13301 AA505883 AI864305 AI423963 AW084401 F04963 R69858 H67097 AI917740 AI655561 H69864 
AA033631 AW3B3484 AI886261 H25293 AA513281 AW271187 H11617 N79982 Al 174338 AI904207 AI904208 BE614558 
W94127 W65436 AI272249 AA700018 AI579932 AI085941 AW152629 

134403 17037J AA334551 BE008229 AA307537 AW961 156 AW995894 AW995826 NNLC06751 M81 199 AA045603 AL036372 AV645606 

AI688095 AW351901 AA101337 AA101345 N73342 BE018030 BE569044 AW841975 AA373388 BE090412 H95440 
N53845 R67867 AA093441 AA363427 H93708 AW023134 AW994986 AW994989 BE090429 R23614 AI567932 H03726 
H01 101 H01867 AA548743 AI671806 AW872949 AW872941 AA742447 AI199788 AA045604 AI637465 AI741796 
- AW242217 AW131463 AI765302 A1683923 AA889762 AI804889 AI986437 C06049 BE502340 AI695651 AI491970 
AA496B04 AA281008 AA665699 AI473814 BE301445 AA707837 AA551 925 AI017348 AI208185 AA775203 AA1 56296 
AA557453 H95441 AA768547 AW769358 AA991197 AA181954 AI091389 AI147289 AW771837 AI638582 AA844411 
AI374750 T29320 AW951 272 AW0B5923 H02834 AA843259 AA814696 AW183290 AA158453 N68125 N69039 AA100423 
AA101346 AI918720 H01102 R67B68 H01868 N66438 R46580 AI858433 AA599560 AA187577 AA157481 AA361520 
AL047827 AA158452 R21688 AW964874 AA325161 R40871 AW752395 AW375924 R13355 AA281 174 AA428908 

135096 33756_2 AA081258 AA16031 1 W17034 H83596 Z99393 AI831206 AW771 108 AW769214 N89775 AW161495 AW161522 

AW160880 Z99394 AI814820 

103767 34817.1 ' BE244667 BE241813 BE242271 AA381943 NMJ16040 AF151858 AW967497 AW966873 AI824386 AW470133 

AW015765 BE01 8650 AW503659 AI1 29838 AI632346 AA01 3099 AW77051 1 BE21 9482 AI8241 35 AI867379 AA019348 
AA285143 AW087624 AI990100 AA251084 AI633962 AA287714 AA400773 AI2921 12 AW469095 AA743312 AW1 17423 
AA694551 AA885657 AA1 12675 BE327333 AA082161 H03613 AA094735 AW500235 N28878 AA287713 AW300233 
AA826249 N46921 BE348728 AW505056 AW966879 AI521202 AA393405 AI264668 AA910851 AA251721 A1470834 
H03503 AA089688 R58562 BE004728 AA66B793 H27167 R54717 

103855 84277J W02363 N80298 AA304486 AW954799 AW805136 AW970817 AW373398 AW875459 AA136805 AA683501 N73299 

AW341082 AI632954 AA493369 AI478433 A1037911 AW272169 AW043832 AA010683 AW629090 AW183622 N64510 
AW079953 AI554533 AA563670 AA010682 AW237610 AW419057 AI470926 AI627833 AA195080 AA195179 AI471443 
AW590266 AI168477 AW771214 AI767341 AW340086 AW748455 AI280079 AI244821 AI381283 AW300130 AW183374 
AW195397 AA136706 AI824598 AW573004 Z98448 AA905255 AI497883 

AW450979 AA136653 AA136656 AW419381 AA984358 AA492073 BE168945 AA809054 AW238038 BE01 1212 BE01 1359 
BE011367 BE011368 BE011362 BE011215 BE011365 BE011363 

AA376654 W76367 AA318232 AI694545 AI742403 AI887383 AW204731 AW874431 BE220997 AA1 14979 AA303838 
AI002267 AW952031 W74801 AA0112B7 AA115112 AI306385 R37677 AW571707 R59986 W94102 AW197042 H10206 
AW139819 AI686172 A1674165 R51633 AI357086 T23948 H10833 H23002 H11743 R37085 Z39208 H22794 H11820 
R13817 Z43122 H10257 R88398 R18795 AA010848 R67191 H10875 R67170 
AA179656 AA182626 AA182603 
R69751R70467H69771 H80879H80878 ■ 
Z41572 R39330 
AA402505AA398900 

X94703 NMJJ04249 R52316 T87420 N46403 Z36855 BE076834 
AW602528BE073859 Z38412 
AA404418AI217248 
AA321890R18000 

AI453076 AI376075 AI014836 AA628633 AA961066 AI150282 AI028574 AI217182 AA732910 AA431478 AL041229 
AA353093 AW957317 AW872498 AI560785 AI289110 AW135512 X97261 T68873 
AF008937 

AI743860 N49543 AW027759 BE349467 AI656284 BE463975 R35022 AA370031 AW955302 AL042109 N53092 AI611424 
AL079362 AI9B9290 AI928016 BE394912 BE504220 BE467505 AI611611 AI61 1407 AI61 1452 W56437 AI284566 
AI583349 AW1 83058 AI308085 A1074952 AA437315 AA628161 AW301728 A1150224 AA400137 AA437279 AI223355 
AA639462 AI261373 AI432414 AI984994 AI539335 AA401550 AA358757 AI609976 AA442357 AA359393 AA437046 
AA370301 AA429328 AW272055 AI580502 AI832944 A1038530 AA425107 AI014986 A1148349 AW237721 AW779756 
AW137877 AI125293 AA400404 R28554 
AA065069AA085108 

AA069818 AA069971 AA069923 AA069908 
AA021473 
AA608588 
AA608751 
AA128980 



126872 
113026 



120284 
112540 
111904 
121094 
128510 
114106 
121335 
120761 
122050 
130018 
100104 
121822 



108309 
107832 
123523 
123533 
132225 
125017 
125063 
125064 



142696.1 
84431J 



158963J 

1605263 1 

1719336 1 

275729J 

19829.1 

1182096J 

279548J 

224903.1 

273507.2 

18986J 

19974.-3 

244391J 



110682 J 
111495J 

genbankuAA021473 
genbanLAA60B588 
genbanLAA608751 
genbankJ\A128980 
genbank.T68875 T68875 
genbanleT85352 T85352 
genban!085373 T85373 
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125091 genbanK_T91518 T91518 

100964 entre*J00212 J00212 

102269 entre?_U30245 U30245 

125150 NOT„FOUND_entrez.W38240 W38240 

123964 genbankj)13961 C13961 

118111 genbanLN55493N55493 

118129 genbankJi57493N57493 

102491 entre2L.U51010 U51010 

1 18329 genbanK_N63520 N63520 

118475 genbankJI66845 N66845 

118581 genbanLN68905 N68905 

111514 genbank_R07998 R07998 

104534 R22303_at R22303 

120340 genbanU\A206828 AA206828 

120376 genbanKjAA227469 AA227469 

104787 genbanK_AA027317 AA027317 

120409 genbanLAA235050 AA235050 

120745 genbankJ\A302809 AA302809 

120809 genbankJ\A346495 AA346495 

120839 genbankJ\A348913 AA348913 

113702 genbanLT97307 T97307 

106596 304084 1 AI583948 AA578212 AW303715 AA653450 AA456981 AI400385 W88533 AI224133 AW272145 AA088686 R94698 

1 13947 genbanK_W84768 W84768 

122562 genbanU\A452156 AA452156 

122635 genbanleAA454085 AA454085 

108277 genbanLAA064859 AA064859 

108403 genbanK.AA075374 AA075374 

122860 genbankuAA464414 AA464414 

108427 genbanLAA076382 AA076382 

108439 genbank_AA078986 AA078986 

131353 231290 1 AW411259 H23555 AW015049 AI684275 AW015886 AW068953 AW014085 AI027260 R52686 AA918278 AI129462 

AA969360 N34869 AI948416 AA534205 AA702483 AA705292 
108533 genbankJ\A084415 AA084415 
124254 genbank_H69899 H69899 
101447 entre^M21305 M21305 
101458 entre*JM22092 M22092 

101667 13349 1 NM.005381 M60858 AW373732 AW373724 AW3736B9 AW373629 AW373609 AW373776 AA187806 AW386946 

AW374207 T05235 AA216203 AW385556 AA306940 AA306526 AA315461 AL036757 AW373711 AW4Q3124 AW403640 
• AW377084 T27360 H62638 F06957 AW377051 AA554779 AA378568 AA096007 AW352407 AW302637 F07929 H17433 
AW382712 H05665 F07292 N39875 AA089729 H62556 N42842 R12952 AW373735 AW364155 AA056183 W39185 
AW382708 N32488 AF1 14096 AW375993 AM 33569 W52561 AA603040 AA133710 AI926796 AW176370 AA827519 
AW338437 AA521142 T29341 AI800461 AW317002 AA703914 AA86083O AI859203 AI445772 AA714334 AI817066 
AI832027 AW510442 AI635802 AW0B8306 AW068672 AW408555 AW467542 AA552657 AA152367 W32081 AA5821 24 
AA074040 AA931657 A1051154 AW41 0203 AI921 644 H17434 AIB32330 AW404836 AI925038 AA088423 AA954166 
AA580453 AW021292 AI267215 AW080082 AW383778 AI933053 AI919097 W31557 N90245 M931591 AA563995 
F36352 AA056184 AA476294 AA641327 AA533550 AI749630 W58323 AA569119 AA508573 AI809050 AI378996 
AA41 1 362 AW407505 AA938104 AA074041 AA632876 AW193748 AA507873 A1270128 AI472365 AA41 1363 AI523216 
AI719965 AI816302 AA182681 AI707990 AA133588 AI758537 W60253 A1460308 AA135423 AI083904 F04188 N89693 
AW408776 A1678595 AI270568 AA722059 W58234 F33650 AA090547 AA285108 AA425981 N85079 D20218 A1273980 
AA159028 F03226 AW247914 N26918 AW272741 N90109 H05666 N23327 AW247953 R44748 AA962015 F03558 
AI752394 AW409913 AW248396 AI816463 AI752393 AA325370 AA263089 AI570130 AI971951 AI160658 AI357360 
AW168686 AL121075 AW050536 N21672 W67748 AA514242 AI127386 H14607 AI185752 W79364 AA088520 AA152476 
AW351940 AW373683 AI940524 AW374953 T56500 N24329 AI940720 AW374933 AW374947 AW391913 AL138337 
AW376241 AW062943 F26666 AW410202 AW062958 F34529 AW381807 AW393315 W17147 AW176359 AA664576 
AW380424 AA306040 AI745674 AW300951 AI188579 AI438973 AI305271 AA433818 AA612807 A1831809 AI940409 
AA158663A1572988 

124576 genbanK_N68201 N68201 

108931 genbank^AA147186 AA147186 

108941 genbankJ\A148S50 AA148650 

124720 144582J R05283R11056 

124793 genbanLR44519R44519 

124799 genbanLR450d8 R45088 

103138 entre?.X65965 X65965 

117683 genbanR_N40180N40180 

124991 genbank.T501 16 T501 16 

103432 entre*_X97748 X97748 

119174 genbanK_R71234 R71234 

119239 95573J T11483T11472 

133678 ■ 11235 1 AW247252AA346143N^000270 M381085 N91995 X00737 M381079M 

AA326750 AA376804 AW403290 T95231 M13953 T47963 HB2039 AA279899 AA627997 N76320 N99527 H37842 
W20095 AA457308 AW469547 AA724143 H83220 AA319496 W68334 W30892 R89169 R99427 N41854 H47286 
AA348094 AA045089 R63016 AI922219 AI024906 AI096488 AI885005 AA194872 N90489 AI452544 H72411 AA282427 
AA430735 R68963 R22453 H70385 AW129369 AW467320 AW519082 AA345018 AA582183 AI961789 R65918 N30611 
AI979189 AI280889 AW273191 R66531 AI285845 AI675927 AI421990 AW190879 H37794 AA699667 H68427 AA954388 
AI188757 AI140048 AA430382 AI204151 AW247B64 AA559099 AI431420 AA548276 AI149466 AA772669 AA694388 
AA724168 AA301651 AA281952 AA779925 AA234760 W86290 AA913603 AW51 1745 AI500697 AA814922 AA835040 
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T47964 H53998 AA975804 R98710 AI077604 N70252 R98084 AW250171 H69268 AI597614 AA970746 M972548 
A13771 16 R62962 H 16737 R89070 AA731329 R66532 N54354 AI818832 H81944 N71567 T95122 W86463 AA437095 
AI431999 AI915724 N63851 AI574743 AA457307 AA211475 N64444 Ar799146 H72853 R99335 H60413 AA770367 
AA156105 AI269937 H64029 H89728 R65819 AW470496 AI873318 AI735713 H82987 C02447 AM78666T27651 
AI699770 AW025156 H69719 AI984717 N69225 AI459856 AA953577 AI424691 H13843 R22404 AI873796 AI336002 
N70898 AI420854 AA541792 AA346142 A1000814 AI828348 AA045090 T51 257 N90434 H13890 N73184 AI708083 
AA781606 AA329050 AA339985 R68964 H64795 W04186 H16845 
119416 genbanK_T97186 T971B6 

NOT - .FOUND.entre^.W38194 W38194 
NOT.FOUND_entre^W38197 W38197 
119654 genbanLW57759W57759 
121350 genbanLM405237 AA405237 
121558 genbanmA412497 AA412497 
105985- genbank_AA406610 AA406610 
1 14648 ' genbanKJ\A101056 AA101056 
121895 genbanleAA427396 AA427396 
100327 enlre2LD55640 D55640 
123315 714071J AA496369 AA496646 
123473 genbankJ\A599143 M599143 . 
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TABLE 4: 



Pkey: 

Accession: 

ExAccn: 



Unique Eos probeset Identifier number 
Accession number used for previous patent filings 
Exemplar Accession number, Genbank accession number 
Unigene number 
Unigene gene title 



Pkey Accession ExAccn UniGene UnlgeneTitle 



100405 

100420 

100481 

100484 

100718 

100991 

101097 

101168 

101194 

101261 

101345 

101447 

101485 

101543 

101550 

101560 

101674 

101714 

101741 

101838 

101857 

102012 

102024 

102164 

102241 

102283 

102303 

102554 

102663 

102759 

102778 

102804 

102887 

102898 

102915 

103036 

103037 

103095 

103158 

103166 

103185 

103280 

103554 

103850 

104465 

104592 

104764 

104786 

104850 

104865 

104894 

104952 

104974 

105178 

105263 

105330 

105376 

105729 

105826 

105977 

106008 

106031 

106124 



D86425 
D86983 

HG109B-HT1098 

HG1103-HT1103 

HG3342-HT3519 

J03764 

L06797 

L15388 

L20971 

L35545 

L76380 

M21305 

M24736 

M31166 

M31551 

M32334 

M61916 

M68874 

M74719 

M92934 

M94856 

U03057 

U03877 

U18300 

U27109 

U31384 

U33053 

U59423 

U70322 

U81607 

U83463 

U89942 

X04729 

X06256 

X07820 

X54925 

X54936 

X60957 

X67235 

X67951 

X69910 

X79981 

Z18951 

AA187101 

N24990 

R81003 

AA025351 

AA027168 

AA040465 

AA045136 

AA054087 

AA071089 

AA085918 

AA187490 

AA227926 

AA234743 

AA236559 

AA292694 

AA398243 

AA406363 

AA411465 

AA412284 

AA423987 



AW291587 Hs.82733 
D86983 Hs.1 18893 
X70377 Hs.121489 
NIVL005402HS.288757 
BE295928 Hs.75424 
J03836 Hs.82085 
BE245301 Hs.89414 
NM_005308Hs.211569 
L20971 Hs.188 
D30857 Hs.62353 
NWL005795Hs,152175 
M21305 

AA296520 Hs.89546 
M31166 Hs.2050 
Y00630 Hs.75716 
AW958272 Hs.347326 
NM_002291Hs.82124 
M68874 Hs.211587 
NrVL003199Hs.326198 
BE243845 Hs.75511 
BE550723 Hs.153179 
BE259035 Hs.1 18400 
AA301867 Hs.76224 
NM_0001O7Hs.776O2 
NIvL007351Hs.268107 
AW161552 Hs.83381 
U33053 Hs.2499 
U59423 Hs.79067 
NIVL002270HS.168075 
NIVL005100HS.788 
AF000652 Hs.8180 
NM002318HS.83354 
J03836 Hs.82085 
NM_002205Hs.149609 
X07820 Hs.2258 
M13509 Hs.83169 
BE01B302 Hs.2894 
NMJ05424Hs78824 
BE242587 Hs.1 18651 
AA159248 Hs.180909 
NMJ)06825Hs.74368 
U84722 Hs.76206 
AI878826 Hs.74034 
AA187101 Hs.213194 
Z44203 Hs.26418 
AW630488 Hs.25338 
AJ039243 Hs.278585 
AA027167 Hs.10031 
AL133035 Hs.8728 
T79340 Hs.22575 
AF065214 Hs.18858 
AW076098 Hs.345588 
Y12059 Hs.278675 
AA313825 Hs.21941 
AW388633 Hs.6682 
AW338625 Hs.22120 
AW994032 Hs.8768 
H46612 Hs.293815 
AA478756 Hs.194477 
AK001972 Hs.30822 
AB033888 Hs.8619 
X64116 Hs.171844 
Hs.7567 



nldogen 2 

Melanoma associated gene 
cystafinD 

v-ral simian leukemia viral oncogene horn 
inhibitor of DNA binding 1, dominant neg 
serine (or cysteine) proteinase inhtblto 
chemokine (C-X-C motif), receptor 4 (fus 
G protein-coupled receptor kinase 5 
phosphodiesterase 4B, cAMP-specific (dun 
protein C receptor, endothelial (EPCR) 
calcitonin receptor-like 
gb:Human alpha satellite and satellite 3 
selectin E (endothelial adhesion molecul 
pentaxin-related gene, rapidly induced b 
serine (or cysteine) proteinase inhibito 
intercellular adhesion molecule 2 
taminin, beta 1 

phospholipase A2, group IVA (cytosolic, 
transcription factor 4 
connective tissue growth factor 
fatty acid binding protein 5 (psoriasis- 
singed (Drosophila)-fike (sea urchin fas 
EGF-containing fibuIin-Cke extracellula 
damage-specific DNA binding protein 2 (4 
multimerin 

guanine nucleotide binding protein 11 

protein kinase C-fike 1 

MAD (mothers against decapentaplegic, Dr 

karyopherin (importin) beta 2 

A kinase (PRKA) anchor protein (gravin) 

syndecan binding protein (syntenin) 

Tysyi oxldase-like 2 

serine (or cysteine) proteinase inhibito 
integrin, alpha 5 (fibronectin receptor, 
matrix metalloproteinase 10 (stromelysin 
matrix metalloproteinase 1 (interstitial 
placental growth, factor, vascular endoth 
tyrosine kinase with immunoglobulin and 
hematopoietically expressed homeobox 
peroxiredoxin 1 

transmembrane protein (63kD), endoplasmi 
cadherin 5, type 2, VE-cadherin (vascuta 
caveolin 1 , caveolae protein, 22kD 
hypothetical protein MGC10B95 
ESTs 

protease, serine, 23 
ESTs 

K1AA0955 protein 

hypothetical protein DKFZp434G171 
B-cell CLL/lymphoma 6, member B (zincfi 
phospholipase A2, group IVC (cytosolic 
desmoplakin(DPI, DPI)) 
bronwdomairvcontalning 4 
AD036 protein 

solute carrier family 7, (cationic amino 
ESTs 

hypothetical protein FU10849 

Homo sapiens HSPC285 mRNA, partial cds 

E3ubiqultJnllgaseSMURF2 

hypothetical protein FU1 1110 

SRY (sex determining region Y)-box 18 

Homo sapiens cDNA: FU22296 fis, clone H 

Homo sapiens cDNA: FU21952 fis, clone H 
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106155 
106302 
106423 
106793 
107174 
107216 
107295 
107385 
108756 



109001 
109166 
109456 
109766 
110107 
110906 
110984 
111006 
111018 
111133 
111760 
113073 
113195 
113923 
114521 
115061 
115096 
115145 
115819 
115947 
116314 
116339 
116430 
116589 
116733 
117023 
117186 
117563 
117997 
118475 
118581 
119073 
119155 
119174 
119221 
119416 
119866 
121335 
121381 
123160 
123473 
123523 
123533 
123964 
124006 
124315 
124659 
124669 
124847 
124875 
125091 
. 125103 
125355 
125565 
125590 
423765 
126511 
100286 
126563 
126649 
449602 
126872 
456000 
414221 
127402 



AA425309 

AA435896 

AA448238 

AA478778 

AA621714 

D51069 

T34527 

U97519 

AA127221 

AA132983 

AA135606 

AA156125 

AA179845 

AA232645 

F10399 

H16772 

N39584 

N52006 

N53375 

N54057 

N64436 

R26892 

T33637 

T57112 

W80763 

AA046808 

AA253217 

AA255991 

AA258138 

AA426573 

AA443793 

AA490588 

AA496257 

AA609717 

D59570 

F13787 

H88157 

H98988 . 

N34287 

N52090 

N66845 

N68905 

R32894 

R61715 

R71234 

R98105 

T97186 

W80814 

AA404418 

AA405747 

AA488687 

AA599143 

AA608588 

AA608751 

C13961 

D60302 

H94892 

N93521 

N95477 

R60044 

R70506 

T91518 

T95333 

R45630 

R20839 

R23858 

R23858 

AI024874 

W26247 

W26247 

AA856990 

AA856990 

AA136653 

AA136653 

AA136653 

AA358869 



AA425414 Hs.33287 
AA398859 Hs.18397 
AB020722 Hs.16714 
H94997 Hs.16450 
BE1 22762 Hs.25338 
D51069 Hs.211579 
AA186629 Hs.80120 
NMJ05397HS.16426 
AA127221 Hs.1 17037 
AL1 17452 Hs.44155 
AA135606 Hs.189384 
AI056548 Hs.72116 
AA219691 Hs.73625 
AW956580 Hs.42699 
F06838 Hs.14763 
AW151660 Hs.31444 
AA035211 Hs.17404 
AW613287 Hs,80120 
BE387014 Hs.166146 
AI287912 Hs.3628 
AW580939 Hs.97199 
BE551929 Hs.268754 
N39342 Hs.103042 
H83265 Hs.8881 
AW953484 Hs.3849 
AW139036 Hs.108957 
AI751438 Hs.41271 
AI683069 Hs.175319 
AA740907 Hs.88297 
AA486620 Hs.41135 
R47479 Hs.94761 
AI799104 Hs.178705 
AK000290 Hs.44033 
AK001531 Hs.66048 
AI557212 Hs.17132 
AL157424 Hs.61289 
AW070211 Hs.102415 
H98988 Hs.42612 
AF055634 Hs.44553 
N52090 Hs.47420 
N66845 
N68905 

BE245360 Hs.279477 
R61715 Hs.310598 
R71234 

C14322 Hs.250700 
T97186 

AA496205 Hs.193700 
AA404418 

AW088642 Hs.97984 
AA488687 Hs.284235 
AA599143 
AA608588 
AA608751 
C13961 
AI147155 



Hs.270016 



NH.005402HS.288757 
AI680737 Hs.289068 



Hs.102943 
Hs.304177 
Hs.285814 



AI571594 
W07701 
AI887664 
T91518 
AA570056 Hs.122730 



Hs.170098 

Hs.143375 
Hs.143375 
Hs.57958 



R60547 
R20840 
R23858 
R23858 
T92143 
BE247550 Hs.86859 
AA516391 Hs.181368 
AA001860 Hs.279531 
AA001860 Hs.279531 
AW450979 
BE180876 Hs.11614 
AW450979 

AA358869 Hs.227949 



nuclear factor l/B 

hypothetical protein FU23221 

Rho guanine exchange factor (GEF) 15 

ESTs 

ESTs 

melanoma ceil adhesion molecule 

UDP-N-acetyl-alpha-D-galactosaminerpolyp 

podocalyxin-iike 

ESTs 

DKFZP586G1517 protein 

gb:z!10a05.s1 Soares_pregnanLuterus_NbH 

hypothetical protein FU20992 similar to 

RAB6 interacting, kinesin-like (rabkines 

ESTs 

ESTs 

ESTs 

ESTs 

UDP-N-acetyI-alpha-D-galactosamine:polyp 
Homer, neuronal immediate early gene, 3 
mitogen-activated protein kinase kinase 
complement component C1q receptor 
Homo sapiens cDNA FU11949 fis, clone HE 
microtubule-associated protein 1B 
ESTs, Weakly similar to S41044 chromosom 
hypothetical protein FU22041 similar to 
40S ribosomal protein S27 isoform 
Homo sapiens mRNA fuii length insert cDN 
ESTs 
ESTs 

endomudn-2 
KIAA1691 protein 

Homo sapiens cDNA FU1 1333 fis, clone PL 

dipeptidyl peptidase 8 

hypothetical protein FU10669 

ESTs, Moderately simnarto I54374 gene 

synaptojanin 2 

Homo sapiens mRNA; cDNA DKFZp586N0121 (f 
ESTs. Weakly simflar to ALU1 JWMAN ALU S 
unc5 (C.elegans homolog) c 
EST 

gb:za46d 1 .s1 Soares fetal liver spieen 
gb:za69b09.s1 Soares JetalJung_NbHL19W 
ESTs 

ESTs, Moderately similar to ALULHUMAN A 
gb:yi54c08.s1 Soares placenta Nb2HP Homo 
tryptase beta 1 

gb:ye50h09.s1 Soares fetal liver spleen 
Homo sapiens mRNA; cDNA DKFZp586l0324 {f 
gb:zw37e02.s1 SoaresJotal_fetus_Nb2HF8._ 
hypothetical protein FU22252 similar to 
ESTs, Weakly similarto I38022 hypotheti 
gb:ae52d04.s1 Stratagene lung carcinoma 
gb:ae54e06.s1 Stratagene lung carcinoma 
gbae56h07.s1 Stratagene lung carcinoma 
gb:C13961 Clontech human aorta polyA+mR 
ESTs 

v-ral simian leukemia viral oncogene horn 

Homo sapiens cDNA FU1 1918 fis, clone HE 

hypothetical protein MGC12916 

Homo sapiens clone FLB8503 PR02286 mRNA, 

sprouty (Drosophila) homolog 4 

gb:ye20fD5.s1 Stratagene lung (937210) H 

ESTs, Moderately similar to KIAA1215 pro 

KIAA0372gene product 

gb:yg05c08.r1 Soares infant brain 1NIB H 

Homo sapiens, clone IMAGE:3840937, mRNA. 

Homo sapiens, clone 1MAGE:3840937, mRNA, 

EGF-TM7-latrophffin-related protein 

growth factor receptor-bound protein 7 

U5 snRNP-specific protein (220 kD), orth 

ESTs 

ESTs 

gb:UI-H-BISaIaha-12^4Jl.s1 NCLCGAP.Su 
HSPC065 protein 

gb:Ul-H-BI3-ala-a-12-0-Ul.s1 NCLCGAP_Su 
SEC13{S. cerevisiae)-l3ce1 
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127651 
424806 
128062 
128992 
129046 
129188 
129314 
129371 
129468 
129765 
129805 
129884 
130495 
130639 
130657 
130828 
130972 
131080 
131137 
131182 
131486 
131573 
131647 
131756 
131859 
131B81 
132050 
132083 
132164 
132358 
132413 
132456 
132490 
132676 
132687 
132718 
132736 
132760 
132933 
132968 
132994 
133061 
133147 
133161 
133200 
133260 
133363 
133491 
133517 
133550 
133607 
133614 
133627 
133691 
133696 
133913 
133975 
133985 
134039 
134088 
134161 
134299 
134416 
116470 
134656 
134989 
135051 
135073 
135349 
100114 
100130 
100143 
100168 
100208 
100224 
100405 



AI123976 

AI123976 

AA379500 

R49693 

AA195678 

M30257 

AA028131 

M10321 

J03040 

M86933 

AA012933 

AA286710 

AA243278 

D59711 

T94452 

AA053400 

AA370302 

J05008 

U85193 

AA256153 

X83107 

AA046593 

AA410480 

D45304 

M90657 

AA010163 

AA136353 

Y07867 

U84573 

X60486 

AA132969 

AA1 14250 

F13782 

AA283035 

AB002301 

AA056731 

U68019 

H99198 

AA598702 

N77151 

AA505133 

AB000584 

D12763 

AA253193 

AA432248 

AA083572 

AA479713 

L40395 

X52947 

W80846 

M34539 

D67029 

U09587 

M85289 

D10522 

W84712 

D29992 

134657 

S78569 

D43636 

U97188 

AA487558 

M28882 

X70683 

X14787 

AA236324 

C15324 

AA452000 

D83174 

D 00596 

D11428 

D13640 

D14874 

D26129 

D28476 

D86425 



AA382523 Hs.105689 
AA382523 Hs.105689 
AA379621 Hs.105547 
H04150 H3.107708 
AB029290 Hs.108258 
NM.001078HS.109225 
BE622768 Hs.290356 
X06828 Hs.1 10802 
AW410538 Hs.1 11779 
M86933 Hs.1238 
AA012848 Hs.12570 
AF055581 Hs.13131 
AW250380 Hs.109059 
AI557212 Hs.17132 
AW337575 Hs.201591 
AW631469 Hs.203213 
D81866 Hs.21739 
NM_001955Hs.2271 
W27392 Hs.33287 



AI824144 
F06972 



Hs.23912 
Hs.27372 



AA040311 Hs.28959 
AA359615 Hs.30089 
AA443966 Hs.31595 
AW960564 
AW361018 Hs.3383 
AI267615 Hs.38022 
BE386490 Hs.279663 
AI752235 Hs.41270 
NM_003542Hs.46423 
AW361383 Hs.260116 
AB011084 Hs.48924 
NM 001290HS.4980 
N92589 Hs.261038 
AB002301 Hs.54985 
NM_004600Hs.554 
AW081883 Hs.211578 
AA125985 Hs.56145 
BE263252 Hs.6101 
AF234532 Hs.61638 
AA1 12748 Hs.279905 
A1186431 Hs.296638 
AA026533 Hs.66 
AW021103 Hs.6631 
AB037715 Hs.183639 
AA403045 Hs.6906 
AI866286 Hs.71962 
BE619053 Hs.170001 
N1VL000165HS.74471 
AI129903 Hs.74669 
BE273749 

NWL003003HS.75232 
NNL002047HS.75280 
M85289 Hs.211573 
A1878921 Hs.75607 
AU076964 Hs.7753 
C18356 Hs.295944 
L34657 Hs.78146 
NWL002290HS.78672 
AI379954 Hs.79025 
AA634543 Hs.79440 
AW580939 Hs.97199 
X68264 Hs.211579 
AI272141 Hs.83484 
AI750878 Hs.87409 
AW968058 Hs.92381 
AI272141 Hs.83484 
W55956 Hs.94030 
AA114212 Hs.9930 
X02308 Hs.82962 
NM_000304Hs.103724 
AU076465 Hs.278441 
H73444 Hs.394 
NM.002933HS.78224 
AL121516 Hs.138617 
AW291587 Hs.82733 



MSTP031 protein 
MSTP031 protein 

neural proliferation, differentiation an 
ESTs 

actin binding protein; macrophtn (microf 
vascular ceil adhesion molecule 1 
mesoderm development candidate 1 
von WQIebrand factor 
secreted protein, acidic cysteine-rich 
amelogenln (Y chromosome) 
tubulln-spectfic chaperons d 
lysosomal 

mitochondrial ribosomal protein L12 
ESTs, Moderately similar to I54374 gene 
ESTs 
ESTs 

Homo sapiens mRNA; cDNA DKFZp586l1518 (f 
endothelin 1 
nuclear factor l/B 
ESTs 

BMX non-receptor tyrosine kinase 

ESTs 

ESTs 

ESTs 

transmembrane 4 superfamily member 1 



ESTs 
Pirin 

procollagen-lysine, 2-oxoglutarate 5-dfo 
H4 histone family, member G 
metalloprotease 1 (pitriiysin family) 
KIAA0512 gene product; ALEX2 
LIM domain binding 2 
ESTs, Weakly similar to 138022 hypotheti 
KIAA0303 protein 

Sjogren syndrome antigen A2 (60kD, ribon 
Homo sapiens cDNA: FU23037 fis, clone L 
thymosin, beta, identified in neuroblast 
hypothetical protein MGC3178 
myosin X 

clone HQ0310PRO0310p1 
prostate differentiation factor 
interteukin 1 receptor-like 1 
hypothetical protein FU20373 
hypothetical protein FU10210 
Homo sapiens cDNA: FU23197 fis, clone R 
ESTs, Weakly similar to B36298 proRne-r 
eukaryotic translation initiation factor 
gap junction protein, alpha 1 , 43kD (con 
vesicle-associated membrane protein 5 (m 
FK506-binding protein 1A (12kD) 
SEC14 (S. cerevisiae)-like 1 
glycyl-tRNA synthetase 
heparan sulfate proteoglycan 2 (pertecan 
myristoylated alanlne-rich protein kinas 
calumenin 

tissue factor pathway inhibitor 2 
piatelet/endothelial cell adhesion molec 
laminin, alpha 4 
KIAA0096 protein 
IGF-II mRNA-binding protein 3 
complement component C1q receptor 
melanoma ceil adhesion molecule 
SRY (sex determining region Y)-box 4 
thrombospondin 1 

nudix (nucleoside diphosphate linked moi 

SRY (sex determining region Y)-box 4 

Homo sapiens mRNA; cONA DKFZp586E1624 (f 

serine (or cysteine) proteinase inhibito 

thymidyiate synthetase 

peripheral myelin protein 22**' 

K1AA0015 gene product 

adrenomedullin 

ribonuclease, RNase A family, 1 (pancrea 
thyroid hormone receptor interactor 12 
nidogen2 
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100420 
100455 
100529 
100618 
100619 
100658 
100676 
100718 
100752 
100828 
100850 
100991 
101097 
101110 
101142 
101156 
101168 
101184 
101192 
101317 
101336 
101345 
101400 
101475 
101485 
101496 
101505 
101543 
101557 
101560 
101587 
101592 
101633 
101634 
101667 
101682 
101714 
101720 
101741 
101744 
101793 
101837 
101838 
.101840 
101857 
101864 
101931 
101966 
102012 
102013 
102024 
. 102059 
102121 
102283 
102300 
102378 
102395 
102460 
102491 
102499 
102523 
102560 
102564 
102589 
102600 
102645 
102687 



D86983 
D87953 

HG1862-HT1897 
HG2614-HT2710 
HG2639-HT2735 
HG2855-HT2995 
HG3044-HT3742 
HG3342-HT3519 
HG3543-HT3739 
HG4069-HT4339 
HG417-HT417 
J03764 
L06797 
L08246 
L12711 
L13977 
L15388 
L19871 
L20859 
L42176 
L49169 
L76380 
M15990 
M23254 
M24736 
M26576 
M27396 
M31166 
M31994 
M32334 
M35878 
M36429 
M57730 
M57731 
M60858 
M62994 
M68874 
M69043 
M74719 
M75126 
M84349 
M92843 
M92934 
M93056 
M94856 
M95787 



Hs.1 18893 
AW888941 Hs.75789 
BE313693 Hs.334330 



AI752163 
N24433 
U56725 
X02761 



102709 
102759 
102804 
102882 
102907 
102915 
102927 
102960 



S81914 

U03057 

U03100 

U03877 

U08021 

U14391 • 

U31384 

U32944 

U40369 

U41767 

U46959 

U51010 

U51478 

U53445 

U59289 

U59423 

U62015 

U63825 

U67963 

U73379 

U73824 

U77604 

U81607 

U89942 

X04412 

X06985 

X07820 

X12876 

X15729 



Hs.1 14599 
Hs.241557 
Hs.180414 
Hs.287820 
Hs.75424 

T81309 

AL048753 Hs.303649 
AA836472 Hs.297939 
J03836 Hs.82085 
BE245301 Hs.89414 
AI439011 Hs.86386 
L12711 Hs.89643 
AA340987 Hs.75693 
NMJX)5308Hs.211569 
NNLQ01674HS.460 
BE247295 Hs.78452 
L42176 Hs.8302 
NMJJQ6732HS.75678 
NM_005795Hs.152175 
M15990 Hs.194148 
BE410405 Hs.76288 
AA296520 Hs.89546 
X12784 Hs.1 19129 
AA307680 Hs.75692 
M31166 Hs.2050 
BE293116 Hs.76392 
AW958272 Hs.347326 
AI752416 Hs.77326 
AF064853 Hs.91299 
NM_004428Hs.1624 
AV650262 Hs.75765 
NMJ)05381 
AF043045 Hs.81008 
M6B874 Hs.211587 
M69043 Hs.81328 
NM_003199Hs.326198 
AI879352 Hs.1 18625 
W01076 Hs.276573 
M92843 Hs.343586 
BE243845 HsJ5511 
AA236291 Hs.183583 
BE550723 Hs.153179 
BE392588 Hs.75777 
NR4_006823Hs.75209 
X96438 Hs.76095 
BE259035 Hs.1 18400 
BE616287 Hs.178452 
AA301867 Hs76224 
AI752666 Hs.76669 
NM_004998Hs.82251 
AW161552 Hs.83381 
AI929721 Hs.5120 
AU076887 Hs.28491 
AU077005 Hs.92208 
U48959 Hs.211582 
U51010 

BE243877 Hs.76941 
U53445 Hs.15432 
R97457 Hs.63984 
U59423 Hs.79067 
AU076728 Hs.8867 
AI984144 Hs.66713 
AL1 19566 Hs.6721 
NM_007019Hs.93002 
AA532780 Hs.183684 
AA122237 Hs.81874 
NM_005100Hs.788 
. NM_002318Hs.83354 
AI767736 Hs.290070 
BE409861 Hs.202833 
XO7820 Hs.2258 
BE512730 Hs.65114 
AI9Q4738 Hs.76053 



Melanoma associated gene 
N-myc downstream regulated 
calmodulin 2 (phosphorytase kinase, delt 
collagen, type VIII, alpha 1 
RNA binding motif, single stranded Inter 
heat shock 70kD protein 2 
fibronectin 1 

inhibitor of DNA binding 1, dominant neg 
insulln-like growth factor 2 (somatomedi 
small Inducible cytokine A2 (monocyte ch 
cathepsin B 

serine (or cysteine) proteinase mhibito 

chemoklne (C-X-C motif), receptor 4 (fiis 

myeloid cell leukemia sequence 1 (BCL2-T 

transketolase (Wernicke-Korsakoff syndro 

prolylcarboxypeptidase (angiotensin ase C 

G protein-coupled receptor kinase 5 

activating transcription factor 3 

solute carrier family 20 (phosphate tran 

four and a half LIM domains 2 

FBJ murine osteosarcoma viral oncogene h 

calcitonin receptor-tike 

v-yes-1 Yamaguchi sarcoma viral oncogene 

calpain 2, (m/ll) targe subunit 

selectin E (endothelial adhesion molecul 

collagen, type IV, alpha 1 



pentaxin-related gene, rapidly induced b 
aldehyde dehydrogenase 1 family, member 
intercellular adhesion molecule 2 
insulin-like growth factor binding prote 
guanine nucleotide binding protein (G pr 
ephrin-A1 
GR02 oncogene 
nucleolin 

filamln B, beta (actin-binding protein-2 
phospholipase A2, group IVA (cytosolic, 
nuclear factor of kappa light po lypeptid 
transcription factor 4 
hexokinase 1 

CD59 antigen p18-20 (antigen Identified 
zinc finger protein homologous to Zfp-36 
connective tissue growth factor 
serine (or cysteine) proteinase inhffaito 
fatty acid binding protein 5 (psoriasis- 
transgelin 

protein kinase (cAMP-dependent, catafyti 
Immediate eariy response 3 



catenin (cadherin-associated protein), a 
EGF-containing fibulin-Qke extracellula 
nicotinamide N-methyltransferase 
myosin IE 

guanine nucleotide binding protein 1 1 
dynein, cytoplasmic light polypeptide 



a disintegrin and metalloproteinase doma 
myosin, light polypeptide kinase 
gb:Human nicotinamide N-methyltrartsferas 
ATPase, Na-tfK+ transporting, beta 3 poly 
downregulated in ovarian cancer 1 
cadherin 13, H-cadherin (heart) 
MAO (mothers against decapentapiegic, Dr 
cystelne-rich, angiogenic inducer, 61 
hepatitis delta antigen-interacting prot 



ubiqurtin carrier protein E2-C 

eukaryotic translation initiation factor 

microsomal glutathione S-transferase 2 

A kinase (PRKA) anchor protein (gravin) 

tysyloxidase-Rke2 

gelsoGn (amyloidosis, Finnish type) 

heme oxygenase (decycling) 1 

matrix metalloproteinase 10 (strometysin 

keratin 18 

DEAD/H (Asp-Glu-AIa-Asp/His) box polypep 
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103011 
103020 
103029 
103036 
103056 
103080 
103095 
10313A 
103176 
103195 
103347 
103371 
103432 
103471 
103967 
104447 
104764 
104783 
104798 
104865 
104877 
104894 
104952 
105113 
105178 
105196 
105215 
105263 
105271 
105330 
105461 
105492 
105493 
105594 
105727 
105732 
105767 
105882 
105936 
106031 
106124 
106222 
106241 
106263 
106264 
106366 
106454 
106634 
106724 
106793 
106799 
106842 
106868 
10S890 
106961 
106974 
107030 
107061 
107086 
107216 
107385 
107444 
107985 
108507 
108695 
108931 
109001 
109195 
109390 
109456 
109737 
110411 
110660 
110906 
111018 
111091 



X52541 
X53416 
X54489 
X54925 
X57206 
X59798 
X60957 



X69111 

X70940 

X87838 

X91247 

X97748 

Y00815 

AA303711 

L44538 

M025351 

AA027050 

AA029462 

AA045136 

AA047437 

AA054087 

AA071089 

AA1 56450 

AA187490 

AA195031 

AA205724 

AA227926 

AA227986 

AA234743 

AA253216 

AA256210 

AA25626B 

AA279397 

AA292379 

AA292717 

AA346551 

AA400292 

M404336 

AA412284 

AA423987 

AA428594 

AA430108 

AA431462 

M431470 

AA443756 

AA449479 

AA459916 

AA465226 

M478778 

AA479037 

AA482597 

AA487561 

AA489245 

AA504110 

AA520989 

AA599434 

AA608649 

AA609519 

D51069 

U97519 

W28391 

AA035638 

AA083514 

AA121315 

AA147186 

AA156125 

AA188932 

AA219653 

AA232645 

F10078 

H48032 

H82117 

N39584 

N54067 

N59858 



AJ243425 Hs.326035 
X53416 Hs.195464 
AW800726 Hs.789 
M13509 Hs.83169 
Y18024 HsJ8877 
AU077231 Hs.82932 
NM_005424Hs.78824 



AL021154 Hs.76884 
AA351647 Hs.2642 
AU077309 Hs.171271 
X91247 Hs.13046 
X97748 

Y00815 Hs.75216 
AL120051 Hs.144700 
AW204145 Hs.156044 
AI039243 Hs.278585 
AA533513 Hs.93659 
AW952619 Hs.17235 
T79340 Hs.22575 
AI138635 Hs.22968 
AF065214 Hs.18858 
AW076098 Hs.345588 
AB037816 Hs.8982 
AA313825 Hs.21941 
W84893 Hs.9305 
AA205759 Hs.10119 
AW388633 Hs.6682 
AA807881 Hs.25329 
AW338625 Hs.22120 
BE539071 Hs.69388 



A1805717 
AL047586 



Hs.289112 
Hs.10283 



AB024334 Hs.25001 
AL135159 Hs.20340 
AW504170 Hs.274344 
AW370946 Hs.23457 
W46802 Hs.81988 



AI678765 

X64116 

H93366 



Hs.21812 

Hs.171844 

Hs.7567 



AA356392 Hs.21321 

BE019681 Hs.6019 

W21493 Hs.28329 

AL046859 Hs.3407 

AA186715 Hs.336429 
NM_014036Hs.5216 

W25491 Hs.288909 



N48670 
H94997 



Hs.28631 
Hs.16450 



BE313412 Hs.7961 
AF124251 Hs.26054 
BE185536 Hs.301183 
AA489245 Hs.88500 
AW243614 Hs.18063 
AI817130 Hs.9195 
AL1 17424 Hs.25035 
BE147611 Hs.6354 
NIVL012331Hs^6458 
D51069 Hs.211579 
NM.005397HS.16426 
W28391 Hs.343258 
T40064 Hs.71968 
AI554545 Hs.68301 
AB029000 Hs.70823 
AA147186 

AI056548 Hs.72116 
AF047033 Hs.132904 
AW007485 Hs.87125 
AW956580 Hs.42699 
AA055415 Hs.13233 
AW001579 Hs.9645 
AA782114 HS.28043 
AA035211 Hs.17404 
AI287912 Hs.3628 
AA300067 Hs.33032 



early growth response 1 

fHarnin A, alpha (actm-binding protein- 

GR01 oncogene (melanoma growth stimufatl 

matrix metalloproteinase 1 (interstitial 

inositol 1,4,5-trisphosphate 3-kinase B 

cyclin D1 (PRAD1: parathyroid adenomatos 

tyrosine kinase with immunoglobulin and 

gb:Rsaptens SOD-2 gene for manganese su 

inhibitor of DNA binding 3, dominant neg 

eukaryotlc translation elongation factor 

catenin (cadherin-associated protein), b 

thioredoxin reductase 1 

gb:H.sapiens PTX3 gene promoter region. 

protein tyrosine phosphatase, receptor t 

ephrin-81 

ESTs 

ESTs 

protein disulfide isomerase related prot 

Homo sapiens clone TCCCIA00176 mRNA sequ 

B-cefl CLL/Iymphoma 6, member B (zinc fi 

Homo sapiens done IMAGE:451939, mRNA se 

phospholipase A2, group IVC (cytosoHc, 

desmoplaidn (DPI, DPII) 

Homo sapiens, clone IMAGE:3506202, mRNA, 

AD036 protein 

angiotensin receptor-like 1 

hypothetical protein FU14957 

solute carrier family 7, (cationic amino 

ESTs 

ESTs 

hypothetical protein FLJ20505 

CGi-43 protein 

RNA binding motif protein 8B 

tyrosine 3-monooxygenase/tryptophan 5-mo 

KIAA1002 protein 

hypothetical protein MGC12942 

ESTs 

disabled (Drosophila) homolog 2 (mitogen 
ESTs 

Homo sapiens cONA: FU22296 fis, clone H 

Homo sapiens cONA: FU21962 fis, done H 

Homo sapiens done FLB9213 PR02474 mRNA, 

Homo sapiens cDNA: FU21288 fis, done C 

hypothetical protein FLJ 14005 

protein kinase (cAMP-dependent, cataiyti 

RIKEN cDNA 9130422N19 gene 

HSPC028 protein 

hypothetical protein RJ22471 

Homo sapiens cDNA: FU22141 fis, clone H 

ESTs 

Homo sapiens done 25012 mRNA sequence 
novel SH2-containing protein 3 
molecule possessing ankyrin repeats indu 
mitogen-activated protein kinase 8 inter 
Homo sapiens cDNA FU 10768 fis, clone NT 
Homo sapiens cDNA FU13698 fis, clone PL 
chloride intracellular channel 4 
stromal cell derived factor receptor 1 
methionine sulfoxide reductase A 
melanoma ceil adhesion molecule 
podocalyxin-like 

proliferation-associated 2G4, 38kD 

Homo sapiens mRNA; cDNA DKFZp564F053 (fr 

ESTs 

KIAA1077 protein 

gb:zo38d01.s1 Stratagene endothelial eel 
hypothetical protein FU20992 similar to 
solute carrier family 4, sodium blcarbon 
EH-domain containing 3 
ESTs 

ESTs, Moderately similar to A47582 B-cel 
Homo sapiens mRNA for KIAA1741 protein, 
ESTs 
ESTs 

mitogen-acbVated protein kinase kinase 
hypothetical protein DKFZp434N185 
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111356 N90933 
111378 N93764 
111741 R26124 
111769 R27957 
112318 R55470 
112951 T16550 
113057 T26674 
113195 T57112 
113490 T88700 
113542 T90527 
113803 W42789 
113847 W60002 
113910 W78175 
113947 W84768 
114047 W94427 
115061 AA253217 
115819 AA426573 
115870 AA432374 
115964 AA446622 
116228 AA478771 
116264 AA482594 
116314 AM90588 
116589 D59570 
117023 H88157 
117112 H94648 
117156 H97538 
117176 H98670 
117280 N22107 
119559 W38197 
119866 W80814 
120655 AA287347 
121314 AA402799 
121335 AA404418 
121822 AA425107 
121835 AA425435 
122331 AA442872 
122577 AA452860 
123160 AA488687 
123486 AA599674 
124059 F13673 
124339 H99093 
124358 N22495 
124364 N23031 • 
124726 R15740 
124763 R39610 
125167 W45560 
125304 Z39833 
125307 Z40583 
125329 AA825437 
107985 R66613 
125598 R66613 
125609 AA868063 
116024 AA128075 
418000 AA128075 
126399 AA12B075 
127435 N66570 
127566 AI051390 
127619 AA627122 
434190 AA627122 
128453 X02761 
128495 AF010193 
128515 AA1 49044 
128580 U82108 
128623 D78676 
128642 L35240 
128669 AA598737 
128903 R69417 
128914 AA232837 
129087 N72695 
129188 M30257 
129226 M96843 
129265 X68277 
129345 AA292440 
129468 J03040 
129488 AA228107 
101838 AA449789 



BE301871 Hs.4867 
AW160993 Hs.326292 
AB020653 Hs.24024 
AW629414 Hs.24230 
AW083384 Hs.11067 
AA307634 Hs.6650 
AW194301 Hs.339283 
H83265 Hs.8881 
BE178110 Hs.173374 
H43374 Hs.7890 
AW880709 Hs.283683 
NM_Q05032Hs.4114 
AA113262 Hs.17901 
W84768 

AL035858 Hs.3807 
AI751438 Hs.41271 
AA486620 Hs.41135 
NIVL005985HS.48029 
AA987568 Hs74313 



Hs.50841 
Hs.272239 
Hs.178705 
Hs.17132 



AI767947 
D51174 
AI799104 
AI557212 
AW070211 Hs.102415 
AW969999 Hs.293658 
W73853 

H45100 Hs.49753 
M18217 Hs.172129 
W38197 

AA496205 Hs.193700 
AA305599 Hs.238205 
W07343 Hs.182538 
AA404418 
AI743860 

AB033030 Hs.300670 
AL133437 Hs.110771 
AA829725 Hs.334437 
AA488687 Hs.284235 
BE019072 Hs.334802 
BE387335 Hs.283713 
H99093 Hs.343411 
AW070211 Hs.102415 
AF265555 Hs.250646 
NIVL003654HS.104576 
BE410405 Hs.76288 
AL137540 Hs.102541 
AL359573 Hs.124940 
AW580945 Hs.330466 
AA825437 Hs.58875 
T40064 Hs.71968 
T40064 HS71968 
AA868063 Hs.104576 
AA088767 Hs.83883 
AA932794 Hs.83147 
AA088767 Hs.83883 
X69086 Hs.286161 
AI051390 Hs.1 16731 
' AA627122 Hs.163787 
AA627122 Hs.163787 
X02761 Hs.287820 
NM_005904Hs.100602 
BE395085 Hs.10086 
U82108 Hs.101813 
BE076608 Hs.105509 
Z28913 Hs.102948 
W28493 Hs.180414 
AW150717 Hs.345728 
AW867491 Hs.107125 
AI348027 Hs.108557 
NMJ01078Hs.109225 
BE222494 Hs.180919 
AA530892 Hs.171695 
R22497 Hs.1 10571 
AW410538 Hs.111779 
AW966728 Hs.54642 
BE243845 Hs.75511 



mannosyl (alpha-1 ,3-)-gtycoprotefn beta- 
hypothetical gene DKFZp434A1114 
KIAA0846 protein 
ESTs 

ESTs, Highly similar to T46395 hypothetl 
vacuolar protein sorting 45B (yeast homo 
Human DNA sequence from clone RP1-187J11 
ESTs, Weakly similar to S41044 chromosom 
Homo sapiens cDNA FU10500 fls, clone NT 
Homo sapiens mRNA for KIAA1671 protein, 
chromosome 8 open reading frame 4 
plastin 3 (T isofprm) 

Homo sapiens, clone IMAGB3937015, mRNA, 
gb:zh53d03,s1 $oaresJetaLliver_spleen_ 
FXYD domain-containing ion transport reg 
Homo sapiens mRNA full length insert cDN 
endomucin-2 

snail 1 (drosophiia homolog), zinc finge 

K1AA1265 protein 

ESTs 

lysosomal 

Homo sapiens cDNA FLJ1 1333 fis, clone PL 
ESTs, Moderately similar to I54374 gene 
Homo sapiens mRNA; cDNA DKFZp586N0121 (f 
ESTs 
ESTs 

uveal autoantigen with coiled coil domai 
Homo sapiens cDNA: FU21409 fis, clone C 
Empirically selected from AFFX single pr 
Homo sapiens mRNA; cDNA DKFZp586l0324 (f 
hypothetical protein PRO2013 
phospholipid scramblase 4 
gb:zw37e02.s1 Scares Jota1_fetus_Nb2HF8_ 
metailothionein 1E (functional) 
KIAA1204 protein 

Homo sapiens cDNA: FU21904 fis, clone H 
hypothetical protein M6C4248 
ESTs, Weakly similar to I38022 hypothetl 
Homo sapiens cDNA FU14680 fis, clone NT 
ESTs, Weakly simitar to S64054 hypothetl 
DEAD/H (Asp-Glu-Ala-Asp/His) box polypep 
Homo sapiens mRNA; cDNA DKFZp586N0121 (f 
baculoviral IAP repeat-containing 6 
carbohydrate (keratan sulfate GaI-6) sul 
calpain 2, (m/II) large suburb 
netrin4 

GTP-binding protein 

ESTs 

ESTs 

Homo sapiens mRNA; cDNA DKFZp564F053 (fr 
Homo sapiens mRNA; cDNA DKF2p564F053 (fr 
carbohydrate (keratan sulfate Gal-6) sul 
transmembrane, prostate androgen induced 
guanine nucleotide binding protein-like 
transmembrane, prostate androgen induced 
Homo sapiens cDNA FU13613 fis, clone PL 
ESTs 
ESTs 
ESTs 

fibronectln 1 

MAD (mothers against decapentapiegic, Dr 
type I transmembrane protein Fn14 
solute carrier family 9 (sodium/hydrogen 
CTL2gene 

enigma (UM domain protein) 
heat shock 70kO protein 8 
STAT induced STAT inhibitor 3 
plasmalemma vesicle associated protein 
hypothetical protein PP1057 
vascular cell adhesion molecule 1 
inhibitor of DNA binding 2, dominant neg 
dual specificity phosphatase 1 
growth arrest and DNA-damage-inducible, 
secreted protein, acidic, cysteine-rich 
methionine adenosyltransferase II, beta 
connective tissue growth factor 
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413731 AA449789 
129557 W01367 
129619 AA610116 
129627 AA258308 
129762 AA460273 
129884 AA286710 
130018 T68873 
130147 D63476 
130178 M62403 
130282 X55740 
130431 L10284 
130495 AA243276 
130553 M430032 

130638 H16402 

130639 D59711 
130657 T94452 
130686 AA431571 
130776 R79356 
130818 AA280375 
130840 Z49269 
130899 Z41740 
131002 AA121543 
131080 J05008 
131084 AA101878 
131091 T35341 
131107 N87590 
131182 AA256153 
131237 W74533 
131319 U25997 
131328 V01512 
131509 X56681 
131555 AA161292 
131564 AA491465 
131573 AA046593 
131692 D50914 
131756 D45304 
131859 M90657 
131909 W69127 
131915 AA316186 
132046 AA384503 
132050 AA136353 
132151 AA044755 
132164 U84573 
132187 AA058911 
132303 AA620962 
132314 AA285290 
132358 X60486 
132398 R31641 
132421 AA489190 
132490 F13782 
132520 AA257993 
132546 M24283 
132610 AA443114 
132716 T35289 
132840 N23817 
132883 AA047151 
132968 N77151 
132989 AA480074 
132999 Y00787 
133071 T99789 
133076 W84341 
133099 L09209 
133147 D12763 
133149 T16484 
133161 AA253193 
133200 AA432248 
133220 X82200 
133260 AA083572 
133295 L00352 
133349 N75791 
133391 X57579 
133398 X02612 
133436 H44631 
133454 AA090257 
133478 X83703 
133491 L40395 



BE243845 Hs.75511 
AL045404 Hs.46366 
AA209534 Hs.284243 
T40064 Hs.71968 
AA453694 Hs.12372 
AF055581 Hs.13131 
AA353093 

D63476 Hs.172813 
U20982 Hs.1516 
BE245380 Hs. 153952 
AW505214 Hs.155560 
AW250380 Hs.109059 
AF062649 Hs.252587 
AW021276 Hs.17121 
AI557212 Hs.17132 
AW337575 Hs.201591 
BE548267 Hs.337986 
AF167706 Hs.19280 
AW190920 Hs.19928 
BE048821 Hs.20144 
A1077288 Hs.296323 
AL050295 Hs.22039 
NH.001955Hs.2271 
NM_017413Hs.303084 
AJ271216 Hs.22880 
BE620886 Hs.75354 
AI824144 Hs.23912 
AF104266 Hs24212 
NMJ03155Hs.25590 
AW939251 Hs.25647 
X56681 Hs.2780 
T47364 Hs.278613 



T93500 

AA040311 

BE559681 



Hs.28792 
Hs.28959 
Hs.30736 



AA443966 Hs.31595 
AW960564 

NM_01655BHs.274411 
AI161383 Hs.34549 



AI359214 
AI267615 



Hs.179260 
Hs.38022 



BE379499 Hs.173705 
AI752235 Hs.41270 
AA235709 Hs.4193 
BE177330 Hs.325093 
AF112222 Hs.323806 
NM.003542HS.46423 
AA876616 Hs.16979 
AW163483 Hs.48320 
NNL001290HS.4980 
AA257992 Hs.50651 
M24283 Hs.168383 
AA160511 Hs.5326 
BE379595 Hs,283738 
BE218319 Hs.5807 
AA373314 Hs.5897 
AF234532 Hs.61638 
AA480074 Hs.331328 
Y00787 Hs.624 
BE384932 Hs.64313 
AW946276 Hs.6441 
W16518 Hs.279518 
AA026533 Hs.66 
AA370045 Hs.6607 
AW021103 Hs.6631 
AB037715 Hs.183639 
NM.006074HS.318501 
AA403045 Hs.6906 
AI147861 Hs.213289 
AW631255 Hs.8110 
AW103364 Hs.727 
NWL000499HS.72912 
BE294068 Hs.737 
BE547647 Hs.177781 
X83703 Hs.31432 
BE619053 Hs.170001 



connective tissue growth factor 
K1AA0948 protein 
tetraspan NET-6 protein 

Homo sapiens mRNA; cDNA DKF2p564F053 (fr 
tripartite motif protein TRIM2 
lysosomal 
metallothionein 1L 

PAK-interacting exchange factor beta 
InsuMke growth factor-binding prote 
5* nucleotidase (CD73) 
calnexln 

mitochondrial ribosomal protein L12 
pituitary tumor-transforming 1 
ESTs 

ESTs, Moderately similar to I54374 gene 
ESTs 

Homo sapiens cDNA FLJ10934 fis, clone OV 

cysteine-rlch motor neuron 1 

hypothetical protein SP329 

small inducible cytokine subfamily A (Cy 

semm/gtucocorticoid regulated kinase 

KIAA0758 protein 

endothelin 1 

apeiin; peptide iigand for APJ receptor 

dipeptidylpeptidase III 

GCN1 (general control of amino-acid synt 

ESTs 

latrophifin 

stanniocalcln 1 

v-fos FBJ murine osteosarcoma viral onco 

jun D proto-oncogene 

interferon, alpha-inducible protein 27 

Homo sapiens cDNA FU11041 fis, done PL 

ESTs 

KIAA0124 protein 
ESTs 

transmembrane 4 superfamily member 1 

SCAN domain-containing 1 

ESTs, Highly similar to S94541 1 done 4 

chromosome 14 open reading frame 4 

ESTs 

Homo sapiens cDNA: FU22050 fis, clone H 

procollagen-lysine, 2-oxoglutarate 5-dio 

DKFZP58601624 protein 

Homo sapiens cDNA: FU21210 fis, done C 

pinin, desmosomeassodated protein 

H4 histone family, member G 

ESTs, Weakly similar to A43932 mudn 2 p 

double ring-finger protein, Dorfin 

UM domain binding 2 

Janus kinase 1 (a protein tyrosine kinas 

intercellular adhesion molecule 1 (CD54) 

amino add system N transporter 2; porcu 

casein kinase 1, alpha 1 

GTPaseRab14 

Homo sapiens mRNA; cONA DKFZp586P1622 (f 
myosin X 

hypothetical protein FU13213 
interieukin 8 

ESTs, Weakly similar to AF257182 1 G-pro 
Homo sapiens mRNA; cDNA DKFZp586J021 (fr 
amyloid beta (A4) precursor-like protein 
interieukin 1 receptor-like 1 
AX1N1 up-regulated 
hypothetical protein FU20373 
hypothetical protein FU10210 
Homo sapiens mRNA full length Insert cDN 
Homo sapiens cDNA: FU23197 fis, clone R 
low density lipoprotein receptor (famifi 
L-3-hydrcwyacyl-Coenryme A dehydrogenase 
inhlbin, beta A (activin A, actMn AB a 
cytochrome P450, subfamily i (aromatic c 
immediate early protein 
hypothetical protein MGC5618 
cardiac ankyrin repeat protein 
eukaryotic translation initiation factor 
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133510 AA227913 AW880841 Hs.96908 p53-induced protein' 

133517 X52947 NM_000165Hs.74471 gap junction protein, alpha 1,43kD (con 

133526 M11313 AU077051 Hs.74561 alpha-2-macroglobulin 

133538 L14837 NM_003257Hs.74614 tight junction protein 1 (zona occludens 

133562 M60721 M60721 Hs.74870 H2.0 (DrosophllaHte homeo box 1 

133584 D90209 D90209 Hs.181243 activating transcription factor 4 {tax-r 

133590 T67986 T70956 Hs.75106 clusterin (complement lysis Inhibitor, S 

133617 AA148318 BE244334 Hs.75249 ADP-ribosylation factor-like 6 interacti 

133651 U97105 AI301740 Hs.173381 dihydropyrimidinase-ltke 2 

133671 T25747 AW503116 Hs.301819 zinc finger protein 146 

133678 K02574 AW247252 nucleoside phosphorylase 

133681 D78577 AI352558 tyrosine 3-monooxygenase/tryptophan 5-mo 

133722 X53331 AW969976 Hs.279009 matrix Gla protein 

133730 S73591 BE242779 Hs.179526 upregulated by 1,25^ihydroxyvitaminr>3 

133750 X95735 BE410769 Hs.75873 zyxin 

133802 L16862 AW239400 Hs.76297 G protein-coupled receptor kinase 6 

133825 U44975 BE616902 Hs.285313 core promoter element binding protein 

133838 M97796 BE222494 Hs.180919 inhibitor of DNA binding 2, dominant neg 

133859 U86782 U86782 Hs.178761 26Sproteasome-associatedpad1 homotog 

133889 AA099391 U48959 Hs.211582 myosin, light polypeptide kinase 

133960 M19267 M19267 Hs.77899 tropomyosin 1 (alpha) 

133975 D29992 C18356 Hs.295944 tissue factor pathway inhibitor 2 

133977 L19314 AI125639 Hs.250666 hairy (DrosophilaJ-homolog 

134039 S78569 NMJ02290Hs.78672 iaminin, alpha 4 

134075 U28811 NMJ12201Hs.78979 Golgi apparatus protein 1 

134081 L77886 AL034349 Hs.79005 protein tyrosine phosphatase, receptor t 

134164 C14407 AW245540 Hs.79516 brain abundant, membrane attached signal 

134203 M60278 AA161219 Hs.799 diphtheria toxin receptor (heparin-bindi 

134238 R81509 AA102179 Hs.160726 Homo sapiens cDNA FLJ11680 fis, done HE 

134299 AA487558 AW580939 Hs.97199 complement component C1q receptor 

134332 D86962 D86962 H&81875 growth factor receptor-bound protein 10 

134339 AA478971 R70429 Hs.81988 disabled (Drosophiia) homoiog 2 (mitogen 

134343 D50683 D50683 Hs.82028 transforming growth factor, beta recepto 

134381 U56637 AI557280 Hs. 184270 capping protein (actln filament) muscle 

134403 M61199 AA334551 sperm specific antigen 2 

134416 M28882 X68264 Hs.211579 melanoma cell adhesion molecule 

134493 X15183 M30627 Hs.289088 heat shock 90kD protein 1, alpha 

134558 S53911 NWL001773Hs.85289 CD34 antigen 

134817 U20734 AU076592 Hs.198951 jun B proto-oncogene 

134983 D28235 D28235 Hs.196384 prostaglandin-endoperoxide synthase 2 (p 

134989 AA236324 AW968058 Hs.92381 nudix (nucleoside diphosphate linked moi 

135052 AA148923 AL136653 Hs.93675 decidual protein Induced by progesterone 

135062 AA174183 AK000967 Hs.93872 KIAA1 682 protein 

135069 AA456311 AA876372 Hs,93961 Homo sapiens mRNA; cDNA DKFZp567D095 (fir 

135071 L08069 W27190 Hs.94 DnaJ (Hsp40) homoiog, subfamily A, membe 

135073 AA452000 W55956 Hs.94030 Homo sapiens mRNA; cDNA DKFZp586E1624 (f 

135170 AA282140 T53169 Hs.9587 Homo sapiens cDNA: FU22290 fis, done H 

135196 J02854 C03577 Hs.9615 myosin regulatory light chain 2, smooth 

1 35348 AA442054 U80983 Hs.268177 phospholipase C, gamma 1 (formerly subty 
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TABLE 4A 

Table 4A shows the accession numbers for those pkeys lacking unlgenelD's for Table 4. The pkeys in Table 7 lacking unigenelD's are represented within 
Tables 1 -6A. For each probeset we have Bsted the gene cluster number from which the oligonucleotides were designed. Gene clusters were compiled 
using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence similarity using Clustering and 
Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for sequences comprising each cluster are listed in the "Aceession G 
column. 



Pkey: Unique Eos probeset identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey CAT Number Accession 



100752 33207 21 T81309 BE019033 R94181 BE019198 NMJ00612 J03242 AW411299 BE300064 BE297644 R94182 AW630108 T53723 
D58853 H78073 H80594 BE299560 T48899 H70196 M17426 N77077 S77035 H58384 H61664 H78540 T84527 C17198 
H60255 H71980 R92644 W79050 X00910 M29645 R91055 M17863 M17662T71815 BE299561 BE464561 X06260 
R94741 T54216 C18594 BE262015 X06161 AW409889 AA378400 BE263228 BE313278 R88116 BE313457 H43500 
T48617 BE313761 H77309 AI207601 X06159 H40413 X03425 T87663 R10627 X035S2 M14118 W03982 R97520 H81229 
T83157 H83168 H48762 AA669898 BE263054 H47289 AA022807 R1 1555 H74260 R76968 R28338 H72534 H72464 
H62031 N72478 N45355 AW411300 R89113 R69135 H58454 T83281 R93476 H69645 H68015 T82229 H71089 T65121 
H59939 W65299 N78176 H53909 N72373 R21788 H04660 H59639 H61874 BE262219 T53614 N73335 N50464 W00943 
N77189 R89257 AA570502 R89432 R06366 AA553480 AA776271 AA551359 AA551050 H51670 AA601052 BE299081 
H68198 H52276 BE207832 N91192 H70332 X07868 X07868 H69464 H53782 H73710 R80435 AA553384 AW8B4176 
N53475 T71662 AW954036 AW954033 AA552931 H93206 AA430218 AA553476 AI918470 T54124 BE207982 BE300177 
N73994 AW882625 N39549 N53838 AA722389 H71878 H58909 H37849 H78435 T47933 R77174 R83814 AA41 1890 
H94199 AA663208 BE205778 AA490137 H70492 R98232 H37800 AA679294 H40341 H74238 H47290 H73231 T4861 8 
AA025428 AI039521 H92969 N59389 H80538 H72933 T90630 AA411891 N55000 H74225 AA340290 AW957061 T54316 
AA340437 H57125 H58908 H79027 H63450 N74623 R93425 H68714 H6875B N68396 H48763 N69256 H57320 H53831 
H53589 N68833 N52453 H56048 H69870 H78074 R69253 R83375 T53615 H94330 H58455 H90864 T47934 H74261 
R89258 R97997 R91056 R28339 R86760 H78235 R97521 H67692 H40358 AA022688 H52513 H59601 T88690 H65256 
H63397 W65397 AA553588 R19280 N52645 W73930 R06367 R21743 H72372 N73921 AW883539 AW882639 T40616 
H47084 R95723AA634316AA862781 H77310R91389 H93111 R92767 T54512 R89341 H70333 H57817 H82941 H62032 
N52638 H58385 T91796 H51086 AA340292 T49918 H81230 R36121 N50411 T87664 N62436 N39340 AA665637 
AA340446 H93377 H92973 BE296290 BE269788 H61665 AA340444 N54605 AA454101 R10628 R94200 AI200549 
AA342640 BE298855 BE250229 T49916 H82008 N28278 AW880662 H7126B N76791 H47685 H65255 W05198 
. AW889144 N76677 H71702 H68036 H71915 R91612 R87807 H68059 AI133328 AI247866 AA621443 AW8B1050 
AA700847 AA340413 AW878608 AW881 181 AW878249 H71916 N54596 BE161581 AW878082 W04212 AW881040 
AWB85492 AW880519 AA334887 AW878715 W06882 AW630222 AW885381 H70869 AW381778 H47601 AW889982 
H63868 AW884986 AW87871 3 AW878685 R36391 AW878694 AA368070 C03393 AW878695 AW878705 AW878665 
AW878742 AW878620 AW878823 AW878688 R29048 AW878690 AW878686 AW878810 AW878827 AW878733 
AW878659 AW878749 AW878681 AW883353 AW883277 AW883300 AW883565 AW8B3298 AW883143 AW883045 
AW883482 AW883352 AW883417 AW883357 AW883231 AW883474 AW883355 AW882620 AW882533 AW883754 
AW883139 AW882827 AW883641 AW883567 AW883481 AW882983 AW882982 AW882465 AW883419 AW882466 
AW883639 AW883230 AWB82981 AWB82534 AW882874 AW882619 AW883480 AW882826 AW882831 AW882835 
" AW882830AW883563AW882456AW627642 

1 17156 145392.1 W73853 AA9281 12 W77887 AW889237 AA148524 AI749182 AI754442 AI338392 AI253102 AI079403 AI370541 AI697341 
H97538 AW1B8021 AI927669 W72716 AI051402 AI188071 AI335900 N21488 AW770478 W92522 AI691028 AI913512 
AI144448 W73819 AA604358 N28900 W95221 AI868132 H98465 AA148793 

131859 3672J AW960564 AA092457 T55890 D56120 T92525 AJ815987 BE182608 BE182595 AW080238 M90657 AA347236 AW961686 

AW176446 AA304671 AW583735 T61714 AA316968 AI446615 AA343532 AA083489 AA488005 W52095 W39480 N57402 
D82638 W25540 W52847 D82729 D58990 BE619182 AA3151B8 AA308636 AA112474 W76162 AA088544 H52265 
AA301631 H80982 AA1 13786 BE620997 AW651691 AA343799 BE613669 BE5471 80 BE546656 F1 1933 AA376800 
AW239185 AA376086 BE544387 BE619041 AA452515 AA001806 AA190873 AA180483 AA159546 F00242 AI940609 
AI940602 AI189753 T97663 T66110 AW062896 AW062910 AW062902 AI051622 AI828930 M102452 AI685095 AI819390 
AA557597 AA383220 AI804422 AI633575 AW338147 AW603423 AW6068O0 AW750567 AW510672 AI250777 M083510 
AW629109 AW513200 AA921353 AI677934 AI148698 AI955858 AA173825 AA453027 AI027865 AW375542 AA454099 
AA733014 AI591 384 R79300 R80023 AA843108 AA626058 AA844898 AW375550 AA889018 AI474275 AW205937 
AI052270 AW388117 AW3881 11 AA699452 AI242230 N47476 H38178 AA366621 AA1 13196 AA130023 H39740 T61629 
AI885973 AW083671 AA179730 AA305757 AI285455 NB3956 AA216013 AA336155 AW999959 T97525 AA345349 
T91762 AA771981 AI285092 AI591386 BE392486 BE385852 AA682601 AI682884 AA345840 T85477 AA292949 
AA932079 AA098791 D82607 T48574 AW752038 C06300 

125555 1704098J R20840R20839 

133607 1227_6 BE273749 BE397561 BE387189 AL037858 AL037878 A1963094 BE259216 AA011363 AL036189 BE562325 AA251 169 

BE617431 N98537 AA158093 AL047800 M34539 NM_O008O1 AA312140 D16971 AA158904AA307114 AA312803 
T09203 AW629686 AL048504 BE388578 AA220957 AA158364 BE267385 AA294971 C18055 BE241757 AA1 15056 
AI936769 BE378435 BE206971 AW674924 BE622060 AA604674 AA115273 AW402159 AA338608 BE568819 M80199 
X55741 AA375111 AA376016 BE612671 AA805742 AW405588 N25850 N44580 H06031 AW403549 BE536552 AA056726 
BE543239 AA082517 AI201645 AI201642 AI192622 N40104 AA370921 BE547569 AI969S02 AA302038 AI197890 
AW268354 AI014938 W45448 AI541395 AA037272 BE538826 AL039613 BE536130 AA299355 AW805147 AW974624 
H53220 AI471471 AA399303 AA007386 W35106 BE613277 R12739 R12738 AA304342 AA687802 BE409581 AI498844 
AV662092 AW904105 AA01 1375 BE315214 H99302 BE537893 N32299 AW855829 AI291320 BE078322 A1301395 
AA303362 N32719 AA358328 AA357877 AI952540 H56279 H02758 H02048 AW805233 R82224 AA410772 AA291352 
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BE171 109 N69935 BE169248 AA361 173 H44978 BE617887 D52560 M084043 W03595 R67219 N36477 N42924 R67104 
H44901 H79695 W21105 AA393988 W30B99 AA31 6098 BE622896 W46B72 M442678 BE544893 BE5401 12 BE621873 
AA338067 N55052 BE398154 BE621210 AA740760 C03739 C03208 BE396692 AA482370 AA031614 AA301575 
AA304710 AA132153 AA029796 AA994960 H19567 AA442959 H49781 H46871 AA035395 AA056185 AA149378 
AA643080 AL135479 AA292329 AA654337 AA041228 AA454888 AA025039 W58331 AA625981 T94941 AA302448 
H19900 AA21B956 AA513790 AA563982 AA398076 W44441 AA293276 W47373 AA625879 W30688 AA043029 T64284 
R79151 AA304340 AA485186 AA604939 R82470 AA421425 AW771456 AJ339329 AA304424 AA605236 AA936934 
AA587673 AI209162 AI697301 AW79995 AI679814 AI361950 AW189125 AI955888 AI986019 BE301019 AI084792 
AI310211 AW189307 AI022070 AW977204 AI146825 AW190163 AW303281 AI828345 BE046043 AW029257 AA482268 
AI246507 AI420729 AW084932 AW439514 AI890487 AW439692 A1523896 AI186612 AI659953 AI889773 AA687527 
AW072694 AW262153 AW467371 AI613269 A1679238 D54404 AA158103 AW105527 AW149739 AW150361 AW268387 
AW1 17708 A1951682 AI687440 AW674285 AA678365 A1587082 AA732095 AA019899 W45661 AA627300 BE613304 
AA765891 AA612935 AI814658 AW316916 R66594 AA514640 AA025040 AA031472 AW732076 AA029797 AI244560 
AI128734 AW381720 AI092360 AI253283 AW813175 AI890675 AI720156 AW631348 AI635106 AI278045 AA303979 
AA703505 W45449 AW078661 AI292052 AW381707 AI147854 AW3B1743 AA158905 AA303258 AA888144 AW195967 
AA428706 AA989559 AA617731 H19882 BE543418 AA830386 AA421302 W58652 T94995 AI869743 AI679145 
AW085971 N98425 AA765136 AI347027 AI356955 AA928038 AI679717 AA458459 AA679281 AI367973 AI270041 
AA765135 AA732793 AI798447 AA668646 AA251008 AI984538 AI401737 AA056186 BE043308 AW662375 AI3021 10 
N50724 W96332 BE537047 N26983 A1567172 AA765296 AW673237 N29784 AA534275 AA0B4044 AW067973 
AW300766 T63398 W46823 R39790 AI364185 AW298582 AA454814 AW069878 N67751 H05982 N23140 AI362647 
AI302086 AI767772 N25755 H531 14 AA706133 T9351 1 AA429291 AA935294 AA987647 W02803 R66595 AI680795 
W23673 AW440794 AA722872 H49538 AW131042 AA531603 AA908665 AA040791 AA235312 W52205 N93444 R82180 
H02759 H79696 AW088894 H56079AA961143 AW067776 AW973745 AA016311 AW071227 AA017511 AI753994 
W47374 T64155 AA296092 AI698626 AA558158 AA296088 AW794259 H01963 AA149267 AA485076 AA975856 H44938 
AA035396 AI955555 H46289 AA486161 AI631222 AA359047 AW794253 AI806962 AW243930 AA526145 AW878734 
AA018464 AA132031 R67220 R79152 AA296093 H54300 AI005160 BE242548 AW992803 AW878644 AW878666 T27742 
R82471 AW517604 AW472738 AI282904 R39791 AA486098 AW467891 AW960520 AA551736 AA056621 AW945197 
R66373 AA554236 BE242202 AI904376 AI832590 H19484 R00890 AI627677 AA302287 AI869451 A1734855 AI708073 
AI832902 AA585184 AW204299 AA055565 D12417 D11975 T63543 AW664099 R54423 BE612712 T96340 T63985 
AA598917 T40735 T64053 AA149284 AW272548 AA363445 AA042893 AW300697 BE261973 T53501 T53500 AW878729 
AW878657 AW794391 AA069193 R01553 H44875 AA385406 AA533968 M93060 AL135600 W96331 AA017651 
AA018849 AA017692 H85337 BE278690 AA73159B AA018512 AI076813 AI022644 R02585 X52220 AW296894 AA825671 
AI699321 AI393501 AW592611 AI146747 AA608921 M158365 AW590007 AA354519 D20081 R02704 AW798339 
M92422 AA094903 AA007676 

133681 13893 1 AI352558 Z82248 X78138 NMJJ03405 AU077248 AA223125 S80794 D78577 A1124697 AW403970 BE614089 BE296713 

BE621334 L20422 X80536 D54224 D54950 X57345 N29226 AA127798 AA340253 F08031 AA192540 H67636 AA321827 
AW950283 AA084159 BE538808 AW401377 AA256774 C03366 W46595 W47608 AA305009 H69431 H69456 AL120082 
H1 1706 AA303717 AA361357 H22042 H78020 AW999584 AA134368 AA32291 1 AA322961 H60980 N85248 N31547 
H79624 T1 1718 W85826 AW894663 AW894624 BE167441 BE170015 AA304626 AW602163 AW998929 AA156681 
AA151067 BE002724 AA608688 H82692 BE155392 AW383636 BE155394 AA487004 AW383504 AI342365 R82553 
W16498 BE155344 AI143938 R69901 AA322873 AW340648 R25364 AA367935 A1559406 AA033522 AA374252 
AW835019 A1922133 AI697089 N99662 AW189078 AI199076 AW151598 W59944 AA662875 W94022 AA299055 
AI039008 AI829449 AA583503 AI635674 AW131665 A1473820 AW2731 18 AW900930 AA908944 AI688035 AW170272 
AI082545 AW468176 AI608761 AI082748 A191 1 682 A1248943 AI831016 AA192465 AI218477 AA938406 AA385288 
AI809817 AA905195A1 191245 AI470204 AI188296 AI421367 AI125315 AI087141 M629032 AA740589 AI554181 
AA150830 AI248541 AI077943 AA775958 M864930 AI261476 AI123121 AI310394 AA862331 AA872478 BE537084 
AI20560S AA720684 AI872093 AW150042 AL120538 AA219627 AA988608 C21397 AI359337 H25337 AI089749 
AA605146 AI359620 AA150478 A1359738 AW383642 AW995424 AI766457 R56892 AI089839 W61343 N69107 W46459 
AA565955 N20527 AI279782 W46596 AA776573 H23204 AI866231 AI083995 N21530 AA1 26874 D82630 W65437 
AI086917 AW382095 AI086877 H69844 AW340217 W85827 L08439 AA262704 AA505380 W47413 W94135 AA223241 
AW089153 AA084101 BE538000 AA098126 T28031 AA491574 R84813 AA774536 AW383522 M155615 AW383529 
AA491520 AW028427 AA171496 AI469689 AW664539 AI811102 AI811 116 BE464590 BE350791 H76021 T15405 H21979 
AA219489 H13301 AA505883 AI864305 A1423963 AW084401 F04963 R69858 H67097 AI917740 AI655561 H69864 
AA033631 AW383484 AI886261 H25293 AA513281 AW271187 H11617 N79982 AM 74338 AI904207 AI904208 BE614558 
W94127 W65436 AI272249 AA700018 AI579932 AI085941 AW152629 

134403 17037.1 AA334551 BE008229 AA307537 AW961156AW995894 AW995826 NMJ306751 M61199 AA045603 AL036372 AV645606 

AI688095 AW351901 AA101337 AA101345 N73342 BE018030 BE569044 AW841975 AA373388 BE090412 H95440 
N53845 R67867 AA093441 AA363427 H9370B AW023134 AW994986 AW994989 BE090429 R23614 A1567932 H03726 
H01 101 H01867 AA548743 AI671806 AW872949 AW872941 AA742447 AI199788 AA045604 AI837465 AI741796 
AW242217 AW131463 AI765302 AI683923 AA889762 A1804889 AI986437 C06049 BE502340 AI695651 A1491970 
AA496804 AA281008 AA665699 AI473814 BE301445 AA707837 AA551925 AI017348 AI208185 AA775203 AA156296 
AA557463 H95441 AA766547 AW769358 AA991 197 AA181954 AI091389 AI147289 AW771837 AI638582 AA84441 1 
AI374750 T29320 AW951272 AW085923 H02834 AA843259 AA814696 AW183290 AA158453 N68125 N69039 AA100423 
AA101346 AI918720 H01102 R67868 H01868 N66438 R46580 AI858433 AA599560 AA187577 AA157481 AA361520 
AL047827 AA158452 R21688 AW964874 AA325161 R40871 AW752395 AW375924 R13355 AA281174 AA428908 

126872 142696J AW450979AA1 36653 AA136656AW41 9381 AA984358 AA492073 BE168945 AA809054 AW238038 BE01 1212 BE01 1359 
BE011367 BE011368 BE011362 BE011215 BE011365 BE011363 

121335 279548J AA404418 AI217248 

130018 18986 1 AA353093 AW957317 AW872498 A1560785 AI289110AW135512 X97261 T68873 

121822 24439? 1 AI743850 N49543 AW027759 BE349467 AI656284 BE463975 R35022 AA370031 AW955302 AL042109 N53092 AI61 1424 
AL079362 AI969290 AI928016 BE394912 BE504220 BE467505 AIB11611 AI61 1407 AI61 1452 W56437AI284566 
A1583349 AW183058 A13080B5 AI074952 AA437315 AA628161 AW301728 AI150224 AA400137 AA437279 AI223355 
AA639462 AI261373 AI432414 AI984994 AI539335 AA401550 AA358757 AI609976 AA442357 AA359393 AA437046 
AA370301 AA429328 AW272055 AI580502 A1832944 AI038530 AA425107 AI014986 AI148349 AW237721 AW779756 - 
AW137877 AI125293 AA400404 R28554 

123523 genbankJ\A608588 AA60858B 
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123533 genbankuAA608751 AA608751 

125091 genbankJ91518 T91518 

123964 genbanK_C13961 C13981 

102491 entre^U51010 U51010 

1 18475 genban^N 6684 5 N66845 

118581 genbank_N68905 N68905 

113947 genbanIeW84768W84768 

101447 entre^M21305 M21305 

101667 1 3349 1 NM.005381 M60858 AW373732 AW373724 AW373689 AW373629 AW373609 AW373776 AA187806 AW386946 

AW374207 T05235 AA216203 AW385556 AA306940 AA306526 AA315461 AL036757 AW37371 1 AW403124 AW403640 
AW377084 T27360 H62638 F06957 AW377051 AA554779 AA378568 AA09B007 AW352407 AW302637 F07929 H17433 
AW382712 H05665 F07292 N39875 AA089729 H62556 N42842 R12952 AW373735 AW364155 AA056183 W39185 
AW382708 N32488 AF1 14096 AW375993 A1133569 W52561 AA603040 AA133710 AI928796 AW176370 AA827519 
AW338437 AA521 142 T29341 AIB00461 AW317002 AA703914 AA860830 AI859203 AI445772 AA714334 AI817066 
AI832027 AW510442 AI635802 AW088306 AW068672 AW408555 AW457542 AA552657 AA152367 W32081 AA582124 
AA074040 AA931657 AI051154 AW410203 A1921644 H17434 AI832330 AW404836 AI925038 AA088423 AA954166 
AA580453 AW021292 AI267215 AW080082 AW383778 AI933053 AI919097 W31557 N90245 AA931591 AA563395 
F36352 AA056184 AA476294 AA641327 AA533550 AJ74SB30 W58323 AA5691 19 AA508573 AJ809050 AI378996 
AA411362 AW407505 AA938104 AA074041 AA632876 AW193748 AA507873 AI270128 AI472365 AA41 1363 AI523216 
AI719965 AI816302 AA182681 AI707990 AA133588 AI758537 W60253 AI460308 AA135423 AI083904 F04188 NB9693 
AW408776 AI678595 AI270568 AA722059 W58234 F33650 AA090547 AA285108 AA425981 N85079 D20218 AI273980 
AA159028 F03226 AW247914 N26918 AW272741 N90109 H05666 N23327 AW247953 R44746 AA962015 F03558 
AI752394 AW409913 AW248396 AI816463 AI752393 AA325370 AA263089 AI570130 AI971951 AI160658 A1357360 
AW168686 AL121075 AW050536 N21672 W67748 AA514242 AI127386 H14607 AI185752 W79364 AA088520 AA152476 
AW351940 AW373683 A1940524 AW374953 T5650O N24329 AI940720 AW374933 AW374947 AW391913 AL138337 
AW376241 AW062943 F26666 AW410202 AW062958 F34529 AW381807 AW393315 W17147 AW176359 AA664576 
AW380424 AA306040 AI745674 AW300951 AI188579 AI438973 AI305271 AA433818 AA612807 AB31809 AI940409 
AA158663AI572988 

.108931 genbanleAA147186 AA147186 

103138 entrezj<65965 X65965 

103432 entre2LX97748 X97748 

119174 genbank_R71234 R71234 

133678 11235 1 AW247252AA346143NIVL000270AA381085N91995 X00737 M381079AA296473AA296110AA315735 M3116 

AA326750 AA376804 AW403290 T95231 M13953 T47963 H82039 AA279899 AA627997 N76320 N99527 H37842 
W20095 AA457308 AW469547 AA724143 H83220 AA319496 W86334 W30892 R89169 R99427 N41854 H47286 
AA348094 AA045089 R63016 AI922219 AI024906 AI096488 AI885005 AA1 94872 N90489 AI452544 H7241 1 AA282427 
AA430735 R68963 R22453 H70385 AW129369 AW467320 AW519082 AA345018 AA582183 AI961789 R65918 N3061 1 
■ AI979189 AI280889 AW273191 R66531 AI285845 AI675927 AI421990 AW190879 H37794 AA699667 H68427 AA954388 
AI188757 AI140048 AA430382 AI204151 AW247864 AA559099 AI431420 AA548276 AI149466 AA772669 AA694388 
AA724168 AA301651 AA281952 AA779925 AA234760 W86290 AA913603 AW51 1745 AI500697 AA814922 AA835040 
T47964 H53998 AA975804 R98710 AI077604 N70252 R98084 AW250171 H69268 AI597614 AA970746 AA972546 
AI377116 R62962 H16737 R89070 AA731329 R66532 N54354 AI81B832 H81944 N71567 T95122 W86463 AA437095 
AI431999 Af9l5724 N63851 AI674743 AA457307 AA21 1475 N64444 AI799146 H72853 R99335 H60413 AA770367 
AA156105 AI269937 H64029 H89728 R65819 AW470496 AI873318 AI735713 H82987 C02447 AI478666 T27651 
AI699770 AW025156 H69719 AI984717 N69225 AI459856 AA953577 AI424691 H13843 R22404 AI873796 AI336002 
N70898 AI420854 AA541792 AA346142 AI000814 AI828348 AA045090 T51257 N90434 H13890 N73184 AI708083 
AA781608 AA329050 AA339985 R68964 H64795 W04186 H16845 

119416 genbankJT97186 T97186 

119559 NOT FOUND_entre*_W38197 W38197 

123473 genbank_AA599143 AA599143 
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TABLE 5: 



Pkey: Unique Eos probeset identifier number 

Accession: Accession number used for previous patent filings 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigertelD: Unigene number 

Unigene Title: Unigene gene title 



Pkey Accession ExAccn UniGene UnigeneTitie 

115819 AA426573 AA486620 Hs.41135 AA486620 

132837 D58024 AA370362 Hs.57958 AA370362 

101545 M31210 BE246154 Hs.154210 BE246154 

102898 X06256 NNL002205Hs.149609 NWL002205 

101192 L20859 BE247295 Hs.78452 BE247295 

102915 X07820 X07820 Hs.2258 X07820 

105330 AA234743 AW338625 Hs.22120 AW338625 

107385 U97519 NM_005397Hs.16426 NMJ)05397 

102024 U03877 AA301867 Hs76224 AA301867 

134416 M28882 X68264 Hs.211579 X68264 

103036 X54925 M13509 Hs.83169 M13509 

104865 AA045136 T79340 Hs.22575 T79340 

106124 AA423987 H93366 Hs7567 H93366 

105330 AA234743 AW338625 Hs.22120 AW338625 

109001 AA156125 AI056548 Hs.72116 A1056548 

104764 AA025351 AI039243 Hs.278585 AI039243 

133200 AA432248 AB037715 Hs.183639 AB037715 

105263 AA227926 AW388633 Hs.6682 . AW388633 

105178 AA187490 AA313825 Hs.21941 AA313825 

109456 AA232645 AW956580 Hs.42699 AW956580 
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TABLE 5A 

Table 5A shows the accession numbers for those pkeys lacking unigeneltys for Table 5. The pkeys In Table 7 lacking unigenelD's are represented within 
Tables 1 -6A. For each probeset we have listed the gene cluster number from which the oligonucleotides were designed. Gene clusters were compiled 
using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence similarity using Clustering and 
Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for sequences comprising each duster are listed In the 'Accession" 
column. 



Pkey: Unique Eos probeset Identifier number 

CAT number: Gene cluster number 
Accession: Genbank accession numbers 



Pkey 
115819 



CAT Number Accession 



10241 1 AA486620 AF205940 AA297524 AB034695 AA081335 NM.016242 AA188323 AA297537 H88204 AW953081 W31695 

AW582203 AA248250 AW68121 1 AA426230 AA464807 AA426155 N44141 AA347390 AA770661 AI333225 N36136 
AW665724 AA431894 AI374976 AI400254 AI338446 AA186695 H88205 W04527 AA487066 AI051414 AA918383 
AA426573 AA425620 AW438654 AA090513 BE167284 BE167291 AI301726 
102024 14505 1 AA301867 AW957981 R27614 AA155808 AI920990 AI74071 1 AA301026 AA301015 AI220981 AI857670 AI537140 

AW015210 AA030000 W46890 H44021 AI355967 AI651735 AA058479 AA146932 T58265 R85890 AA047810 AA017387 
AW026093 AA971133 AI827263 AI056416 AI355994 AI127691 H46603 U03877 NM_004105 AA157357 H42844 
AA146B24 AA187709 AA187259 AA304348 AA147292 AA361687 AA156041 AA330636 R32929 AA321 130 AW950250 
AA082157 AA029129 AA303708 AA028155 D31561 T84689 AA302493 BE153057 BE153181 W39408 AA187200 
BE153250 AW383337 AW382622 AW382647 AW750072 BE153060 AW382630 AW371865 AW392464 AW382664 
AW382658 AW382650 H61647 AW365075 AW355049 M373397 BE072779 BE072781 Z30254 W24381 BE153254 
AA040442 BE072729 BE072731 N94740 AA146945 AW802737 AI826799 AI085395 R34034 H65140 AA082800 H88275 
AA147824 R63882 W80899 AA296413 AI765300 AI862426 AW022055 AW300003 AI743784 AI862635 AI9B5428 
AA147764 AW573245 AW190290 AI040898 D57613 N63457 AA148082 A1028458 AA1481 10 AW814489 N75105 
AW629443 AA704122 AW582220 AA181240 AA057495 AI418224 AI261751 AW388595 AI472205 AW470672 AA102546 
AA789046 AA182416 AA062668 AW300732 AI288220 AA181982 AA146825 AA028130 AI985522 AA303344 AA081313 
N69082 AA182035 AI8671 28 AA100902 AA605087 N67178 AW020324 AW890446 AI472191 AI335691 AI597837 
A1081143 AI335681 AA040443 AI1 28067 A1678244 AA018303 AA1 57260 W80792 AI934590 AI096430 T54343 AI446350 
AA165196 AA780683 AA603631 M047787 AA968580 AA912645 AW890504 AW02691 3 D56983 H52088 AA156121 
R30848 AW023036 AI590960 N67345 AI753225 AI753283 AI183768 AA147818 H89101 AI362141 H89205 AI14771 1 
AA321129 AA668622 AA343479 AW069438 AI422376 AW629270 AA013413 AI221948 AA970605 N52335 H38366 
T91180 AA657841 AA017386 AA152227 AA187593 AI913340 A1719313 AI969943 AI701271 A1004328 AI868348 N93659 
H65093 H25736 D57007 D56957 C00987 D61839 D56661 AI472137 AI971002 D56971 BE048830 D57972 AI589286 
AI361055 AI361071 AI292223 AA155898 D57139 D57981 D57345 A1420034 D57332 D57959 AA875933 R33493 N67558 
D58353 AA188394 AA147966 AI160640 AI363165 H40638 AA578137 AW950265 AA300943 AI128999 H46584 AA917355 
N57820 AA320504 H51959 H25737 

101545 24607 1 BE246154 M31210 NM 001400 AA193392 NM016537 AF233365 AF022137 H27787 AA370448 F05373 T27666 W21494 

AA036907 AI249966 N93476 F01623 AA304390 AA308808 

109456 180633J AW956580 AA886361 AI147670 AI090115 AI166683 AA232645 H99504 AA374707 AA380875 AW139567 AI735132 
BE439385 AW629780 N28322 AA232789 AA232790 N73285 

103036 17145 1 M13509 X54925 NM_002421 M16567 X05231 M15996 W39354 AA186634 AA852324 AA187507 AA081 149 AA186524 

AA187264 AA187361 AA386155 AA186973 AA374217 U78045 AA081230 AA188049 AA186393 W56827 AA852602 
AA157468 AA308204 AA186754 AA186808 AA08251 6 AA304334 AW376428 BE4393B4 AW376420 AA1 56273 T 18504 
AA186521 W49496 AW084608 AA083575 AA372360 AW963590 AA132297 W47445 M186376 AA157628 AW003999 
AI037890 AI858060 AI589010 AI743739 AI452673 AW304188 AW1 17854 BE439933 AA157416 AW778966 AI038497 ' 
AA081006 AA100829 AA181048 C02231 T27821 W23960 AW954802 A1471432 AW801 296 AW8012B9 AW801603 
AW801523 AW801292 AW801542 AW801601 AA181134 AI445147 M191501 AA582862 N94407 AI147810 AA1818B0 
W49497 W52714 AA188249 AI932881 AI082493 AA503656 AA182682 AW801393 AA182830 AA181882 AA182826 
AI613182 N94510 W47343 AI085755 AI076956 A1918426 AA0B1208 AI282835 AA147528 AI081490 AI654536 AA181875 
AA081282 AA186389 C06085 AA063542 AI800644 AA157642 AA101069 AA157752 M158121 AA143331 AA081283 
AA852603 AA188296 A1932880 AW449628 AA187348 C02091 AA514656 AA082736 AA308786 AA143201 M16567 

133200 28960 1 AB037715 AI351347 AI375798 AI884765 AL121124 W01068 AI807275 T95240 R42807 AW515645 A1057314 AI033520 

AA057671 N70215AA054215AW204183AA552149 T95130AW796310 AI866520 AW275564 AW796308 AI637901 
AW197404 T78406 AA456232 AW206463 AA779800 AI052698 M026744 AA454623 AW470729 R45490 AW770258 
AI038393 A1290170 AA722734 AL121 125 R41608 AI862414 AA83861 1 R45582 AI278083 BE466849 BE219944 
AA418030 BE041555 AA578572 T16528 AW006344 Z39782 AI244848 AW1 37344 AA707400 AI032028 BE540464 
AID94265 A1184281 AA931890 AW382744 AW382729 AW020448 AW827237 AA431226 AI672059 AW772345 N70172 
AW022003 AI862704 H19344 R6151 1 AI080204 H16566 AA432248 AI767980 T16688 AI984342 AI217478 AJ767095 
Z38551 AI359566 AI361437 AI041000 R07033 H16608 H19054 R12874 R61567 N98368 BE221 199 242320 AA094554 
R07078 AW860886 AA418090 R41262 

1 32837 256666 1 AA370362 AA3641 10 AW959554 AW371737 AW382068 AW604716 AW604713 AA487827 AW371674 AA429137 

BE503321 T93570 W72803 AI093076 AA487977 AI241562 BE439445 AW204065 R51635 A1802994 T10362 W68553 
AI866215 AW152154 AA700716 A1127443 R15824 AI537587 AA9531 10 D58024 AI52081 1 AA693670 A1453280 W76329 
AW023955AW022563 

102898 24023 1 NM 0022toX06256M13918BE070*66AW239485AW996127BE 

AA3"01 142 AA301 165 AW960506 BE272819 AA386086 T39391 AA285303 AA370580 D58585 T58668 AA156213 W24142 
AA343323AW796067AA151197AA376121 R94782 AA302363 H90357 R82621 AA301677 H55997 AW796059 W92358 
AL046458 AA471 198 AA301952 R46287 R82694 H03186 AA187706 R32562 R27094 R25947 R25320 AW949809 H13505 
H79049 R32403 H11213 R39710 H49765 H21 142 H21006 AA417664 W52075 N56771 AA284240 N98556 N30907 
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AA707335 AW603781 AI340367 AI814584 AA524182 AA370076 AA418785 AA704082 A1806851 H25513 T56388 
AA419627 H03986 H20963 T56245 AI459715 AW973768 AI334096 AI693020 T63414 R82646 AW167251 H55998 
AI274916 AA778367 AI755253 AKJ33667 AW083222 AA181979 R26865 AA661627 AA705329 AI798648 AA612799 
A1160180 AI274973 AI039264 AA301880 AI042429 AA307632 AI085688 A1278366 AI498890 AA303865 AI954844 
AA502380 AA156334 AA723480 AI803584 AI581026 AA304584 N51038 R94702 R69814 AW150962 AI570049 AA588807 
AA151198 T53400 AI567709 AI1B5326 AA309205 AW338969 R53903 AA991891 AA301643 AK93337 AI026049 H25514 
AJ741075 R2B632 AW166445 AI333068 H49978 H91 267 AA558193 AW079663 AA627380 AA807401 AI199956 AA6661 18 
AI718216 AW193228 AI077745 AI5Q0496 AI265059 AW080383 R06468 R26757 R32404 AA71 6599 W92322 AI077734 
AI270181 R46198 AI217540 AA304045 AA305421 AW074445 AI468256 AW089568 AW571605 BE162930 H41009 
AA578313 AW874497 AA1812B4 AA861947 T29451 D20841 T58618 AA418731 AI282500 AW081407 AA604560 
AA729855 AI262538 AI580225 

102915 2903 2 X07820 NMJ502425 BE271570 AI263526 AW296143 A1829878 AI973162 AI085155 AA857496 AA709305 C02220 

134416 30694 1 X68264 NMJJ06500 AF089868 BE257461 BE275425 AW997154 AI902799 AI902803 M78206 AA085691 AW392972 

AA325490 BE006161 AA349269 AA323568 AL042548 AA191148 AA187703 AA322791 AJ297452 T11625 AW366487 
AA303513 AA186961 AA1734B0 N28330 N28379 W40320 AA1871 18 H03695 AA402709 BE407476 H06354 BE276589 
AA351284 AA379921 AL138060 BE41 0587 AA1 13094 AA340481 BE277483 R21191 R79518 N86170 AA320505 
AA296055 AW951900 AA658897 AA650052 AA654304 AA191691 N26649 AW080963 AI265800 N72019 AI453458 
AA092563 AA402310 AI439450 AI061054 AA302358 T71566 AA302047 AA303432 N21289 H27357 AA303504 A1174583 
AW151762 AA181958 AW880618 AA630773 AI889539 AW901058 AI373405 AA341941 M086217 AI675590 AI653936 
AA633570 AA987619 AI270856 N93847 N40689 AW517517 N20030 W95985 AA303955 H89170 AA309917 N21642 
AA373132 W38517 AI687806 W76182 AA101065 AA036916 N45635 AI744510 AI669803 AI039157 AI126355 M634607 
AW131 120 AW196838 AA190601 AA91 1 130 BE221320 N92355 AA036752 H03696 M588873 AI456868 AI041618 
AA090477 AI093248 AA304755 AL137942 AL044688 AI083709 AI150965 N88891 AA635675 AA594898 W94657 
AA182823 AW166205 F27886 R79246 F37329 AA565697 AI075739 AI088654 A1094287 AI204256 AA095203 T93020 
AA688298 AA057324 N23442 AA075411 AA305046 AI031688 AI191503 AA1 11887 AA1 12264 N27929AA1 87509 
AI375522 AI474006 H06297 AI826177 N48880 H28333 AA075490 R22809 W79542 AI055934 AA042901 AA1 73481 
AA301986 W74531 AI051747 AA187715 AI888888 AA993017 A1057530 T92954 N80227 AW273595 AI351260 AW170643 
AW292979 AA302605 AA302330 BE349495 AA328602 AA302361 AI470984 AA155943 AA155914 
105178 7792.1 AA313825 AW960347 AF22346B NM.016613 AA186345 AA186508 AA081195 AA147972 AA346943 AW961667 

AA187222 AA1B7207 AW371052 AW449751 AW748803 AW391606 AW371047 AW371057 AW3710B5 AW362895 
AW371092 AW377556 BE010930 AI016882 AA247878 C04398 C05158 F1 1398 AA188315 H23385 R55086 H15346 
AA029106 AA2281 14 H17005 F08498 Z43376 AA095582 AA055186 AA463361 R15218 AA299132 AW103578 W21538 
AA428131 AA1 87115 AA157197 M157167 AW371371 AA363562 AW965995 N55663 Z17878 AA228023 AI140342 
AA100927 AA496988 AA055917 AI089303 AW014967 AW090248 AW338371 AW131066 D62963 D79713 AI583950 
AI336781 AI500705 AI471485 AW090239 D79784 D61847 D62789 D61842 AI086327 AI273381 D61815 D63043 AI913548 
AI280560 AI510828 AA029996 C16343 C16513 AI075741 AW516308 AI804764 AA948068 AI356588 AW103452 
AW573063 Z39445 C16489 AI949870 F04712 AA147823 AW026284 AI151538 AA081303 AA613890 AI251865 AW086499 
AA9921 1 1 AI862091 AI373465 BE502094 AI922270 AA884288 AA157079 N56963 AW189145 AA428080 R55056 
AA884068 AW771716 AA186662 C16364 H15723 AI921181 AA156888 H17006 AA187490 AI400994 AA346942 H28533 
AW129047 R41656 H14636 AA995041 D58370 Z21131 D58186 AI383271 AA643977 D58044 AI934302 AW779425 
F09065 H14930 AA890693 H23274 

AW388633 AW378440 AW388283 AW388339 AW388333 AW388414 AW388413 AW388607 AW388453 AW388687 
AW388480 AW388591 AW388711 AW388511 AW388438 AW388570 AW388449 A1694383 AW237145 AI652991 
AI964041 AW366319 AW366321 AW961938 AW46921 1 AI634155 AI492186 AI624430 AI677965 N26502 AI963871 
AW378431 AW378421 AI015391 AW352126 N59336 AI352317 AW1971 13 N67998 AW778935 AI476054 AI206626 
R37116 R40211 AA227926 AA639698 R38073 AI001745 T32854 AI619649 AI423703 F10774 AW388815 T16595 H05894 
AW338625 R43226 R51640 AI307645 AI308100 AI085787 AI420357 AIB92610 AA877160 AI953366 AA234743 
AI039243 R68234 AA025351 AA971063 AI537757 AA025362 R81636 T86650 

T79340 A/74231 7 AW1 82676 AW451460 AI420964 R43284 AA088179 AW590886 AW269529 AA045187 AI521736 
AI827455 M045136 AW271709 AI004344 AA639631 AA744417 AA744218 M045357 AA045351 
H93366AI653547AA336265AW966175BE566451 R71178 AI630656 AA234331 N55039 AA305632 AW960431 R34044 
R32254 AW020970 AW451281 AW275041 AI636933 AI655640 AA423986 AA642466 AI684063 AI633876 AI624897 
AA814795 AW590328 A1889166 AW243541 AW39691 AW473445 AI475516 AA74 1 228 AI1 27534 AA165143 AI074714 
AI654076 AA400674 AI560249 N50709 AW438621 AI806810 AI434579 AI308184 AA423987 AI141272 AI565586 
AI338440 AA219628 AI246643 AI985809 AA724260 AA633988 AI364172 AI798439 AI650801 R33503 AI435891 
AA903649 T96161 AA665538 AA219620 AI309962 AA400707 BE247066 R32178 AI275962 AA661602 AW003197 
BE466649 AA831198 AI620052 AI825387 AI634037 AI670978 AI670979 A1655092 R32304 AA828858 A1382428 
AW023660 AA262892 T26891 AW089917 T26926 R32227 
107385 6976 1 NM 005397 U97519 AW899329 A1902387 AA077792 AA078525 AW376607 AA077946 AA070415 BE208721 AW167958 

BE293050 BE208240 AI648698 AA101314 BE393348 BE305122 AA077591 BE274036 AA313687 BE392220 BE378954 
AA171461 AA464821 AW938242 AW938224 AW938243 AW938232 AA147953 N64294 AA205218 AW305065 AW517478 
" AA307983 AA377023 BE563629 R99976 N80294 T87719 T87928 AA496849 AA486344 AA204938 AW370448 AA318242 

AW964384 H92423 W95317 BE378774 BE391156 AA349138 AA173095 AW513198 AA037672 AA148029 AA169726 
W04791 AA075508 BE382937 BE395034 AF139793 AA961734 N48612 H64714 AW151251 AI565113 AI565881 
AW087370 AA631 168 AA622014 AW513098 A1857810 AW152287 AI052596 AI983246 AA024856 AI912456 AI677938 
AW026403 AA972537 A1088497 AW999869 W94582 AI140166 AI160659 AI566868 AA101263 AW190390 AW166466 
A1401207 AI418156 AI625265 AI146298 AW008592 BE223020 N5B926 A1308797 AA037673 AI935992 AI304706 
AA024939 AI216569 AI610423 AI354621 AI500677 AI679389 AI799310 N64508 AI128756 AI679897 AW589535 
AA989333 AI500527 AA565479 AA913529 AI923295 F21691 AA989376 AI699064 AA902447 AI690910 AA772659 
. AA204983AI337895R99975H65205M340766AI339441A!913855M450293AW1^^^^ 

AI247108 AI371261 AI364987 AI280171 AI269104 AI86B756 AA909836 AA983640 AI973271 AA913092 AI868205 
- AI144112 AI190975 N58085 AI566638 N93405 AW150504 AW296846 AI687036 AA902984 AI824460 AI625047 AA653148 
AI61 1228 AW131922 AA8626B7 AA902519 C01732 AW796045 AL044660 
101 192 15367 1 BE247295 AW068092 AL041313 AA159244 NM.005415 L20B59 AL135570 W47073 AW516906 BE388271 BE408629 

W46972 BE293646 BE256647 AI075010 AL041095 AA285300 AL039560 AA368740 W26602 AA399344 AA039235 
W27631 AW834898 AW834914 R93390 AA378039 AV649660 T53674 N98824 AA399974 AW843378 AA368267 R08256 
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AV653575 R27900 N4821 5 AW366371 N45500 AV652967 AI889251 AI08Q457 N39021 AI738542 AW242849 AI857471 
AI859775 AI582830 R75850 N66564 AW341636 AI499006 AI887217 AW026694 AW1 82840 AA039313 AA831346 
AI393465 AW089210 AI743830 AA744243 AA401310 AW439758 AW088152 R93391 AA291379 AA225220 AW009358 
AI192879 AA291202 AI565089 AA225089 AA807688 AI05205B AI341641 AI066625 AA333864 AA159147 AI923912 
5 R75851 AI761143 AW768588 AA394195 AI288450 AW512564 AI452775 AI056520 AA468602 AA872566 AI434739 

AA291838 A1948623 AW768614 AI374753 AW068174 AA884908 AI199346 AI199347 W94946 AI159995 AA877642 
AI280846 AI307610 AA403310 R08205 AW182123 AI000999 R27808 AW026571 D20816 AI560350 T27667 AW960271 
AI174628 AI432042 AI424528 AA909562 T17342 AI783866 
109001 146370.3 AI055548 AW409843 AW263540 AA723669 AA909334 AA156120 AA157141 AA156125 AW409866 W19499 AA157229 
10 AW887435 
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TABLE 6: 

Pkey: Unique Eos probeset Identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unlgene number 

Unigene Title: Unlgene gene title 

AUC1 : 70* percentile of average intensity (Al) for probeset at each of 2,6,1 5,24,48, and 96 hour timepoints minus 70* percentile Al at 0 hrs, 

summed over 5 experiments. 

AUC2: AUCW percentile of Al for aorta, aortic valve, vein, and artery. 



Pkey ExAccn UnigenelD UnigeneTitle 



Hs.293797 
Hs.249989 



314941 AA515902 Hs.130650 
327414 

321911 AF026944 
331578 AI246482 
332466 AB018259 Hs.1 18140 
313513 AW298600 Hs.141840 
320635 N50617 Hs.80506 
326230 

313556 AA628517 Hs.1 18502 
313665 AW751201 Hs.120932 
324852 AI380792 Hs.135104 
314372 AL040178 Hs.142003 
311877 AA084248 Hs.85339 
322262 AA632012 Hs.1 88746 
312173 AI821409 
319795 AB037821 
313350 AW591949 Hs.57958 
326759 

300318 AW444502 Hs.256982 
313978 AI870175 Hs.13957 
306840 AI077477 Hs.307912 
310272 AF216389 Hs.148932 
315044 BE547674 Hs.204169 
321325 AB033100 Hs.300646 
303251 AF240635 Hs.115897 
302378 AL109712 Hs.296506 
315060 AA551104 Hs.189048 
332048 AW337575 Hs.201591 
337214 

311598 AW023595 Hs.232048 
304782 AA582081 
312802 AA644669 Hs.193042 
302680 AW192334 Hs.38218 
317452 AA972965 Hs.135558 
318558 AW402677 Hs.146381 



ESTs 

predicted exon 

ESTs 

ESTs 

KIAA0716 gene product 
ESTs, Weakly similar to S59501 interfero 
small nuclear ribonudeoprotein poiypept 
predicted exon 



Hs.304471 
Hs.146858 



AUC1 

1038 
303.2 
429.2 
677.4 
395.2 
324 
394.8 
357.2 
433.6 
-83 
348.2 
-49.2 
-1309 
-247.8 
-1025.8 
203.6 
183.8 
1654.4 
-346 
576.6 
56.4 
-127.6 
-102.6 
1080.6 
1270.8 
915.8 



312149 T90309 Hs.269651 

319267 F11802 Hs,6818 

321510 H75391 Hs.255748 
326198 

315730 H25899 Hs.201591 

310442 AW072215 Hs.208470 

331237 W87874 Hs.25277 

300469 BE301708 Hs.233955. 
338316 

330968 R44557 Hs.23748 

331019 NlVL006033Hs.65370 

331261 BE539976 Hs.103305 

301822 X17033 Hs.271986 
325544 
328700 

322882 AW248508 Hs.279727 
336034 

316580 AA938198 Hs.146123 

309931 AW341683 

330692 R39288 Hs.6702 

319962 H06350 Hs.135056 

338033 ■ 

314943 Y00272 Hs.184572 

332640 BE568452 Hs.5101 
338158 
327036 



ESTs 
ESTs 

ESTs, Weakly similar to The KIAA0149 gen 
G protein-coupled receptor 39 
ESTs 

ESTs, Highly similar to AF1 16865 1 hedge 
protocadherin 10 
ETL protein 
predicted exon 

ESTs, Highly similar to AF116865 1 hedge 
ESTs 
EST 

semaphorin Rs, short form 
ESTs 

KIM protein (similar to mouse paladin) 
protocadherin 12 

Homo sapiens mRNA full length insert cDN 

ESTs, Moderately similar to ALUC_HUMAN t 1236.8 

ESTs 522.6 

predicted exon 269 

ESTs 796.4 
gb:nn32h08.s1 NCI_CGAPJ3as1 Homo sapiens 

ESTs - 349.6 

ESTs 638.6 

ESTs 360.8 

RNA binding motif protein, X chromosome 700.2 

ESTs 274.2 

ESTs 238.2 

ESTs 231.8 

predicted exon 581.6 

ESTs 281.6 

ESTs -213 

hypothetical protein FU21065 285 

hypothetical protein FU20401 26.6 

predicted exon 14945 

ESTs 975.8 

lipase, endothelial 201.2 
Homo sapiens mRNA; cDNA DKFZp434B0425 (f 

Integrin, alpha 2 (CD49B, alpha 2 subuni 3565 

predicted exon 1014.6 

predicted exon 627.4 

Homo sapiens cDNA FU14035 fis, clone HE 84.8 

predicted exon 782.6 

hypothetical protein FU12972 746.4 
gb:hd13dOU1 SoaresJJFlJLGBC_S1 Homos 
ESTs 137 
Human DNA sequence from clone RP5-650E9 



predicted exon 540.6 

ceil division cycle 2, G1 to S and G2 to -494.8 

protein regulator of cytokinesis 1 -600 

predicted exon 3115 

predicted exon 351.8 



AUC2 

9 

303 

42.9 

103 

395 

32.4 

395 

35.7 

12 

0.5 

34.8 

0.5 

0.2 

1 

1 

5.2 
18.4 
1.2 
1 

2.3 
0.4 
0 
0 

4.8 

5.3 

153 

4.9 

4.7 

26.9 

205 

316.410.5 

7.6 

63.9 

36.1 

6.6 

7.5 

23.8 

235 

8.2 

9.7 

03 

0.5 

03 

34.7 

1.8 

0.9 

478.613 

1.7 

9.4 

62.7 

5.7 

783 

13.8 

134.8135 
13.7 

14.6 0.5 
14 
1 
1 

31.1 
355 
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302655 AJ227892 Hs.146274 ESTs 180.2 18 

327568 predicted exon 229 22-9 

324801 AW770553 Hs.14553 sterol Oacyltransferase (acyKtenzyme 161.2 16.1 

317850 AI681545 Hs. 152982 hypothetical protein FU131 17 -690 1 

5 322818 AW043782 Hs.293616 ESTs 126.4 4.5 

324626 AI685464 Hs.292638 ESTs 170.2 17 

317224 X73608 Hs.93029 sparc/osteonectin, cwcv and kazaHIke d 40 0 

310955 AI476732 Hs.263912 ESTs 466.8 46.7 

315240 R38772 Hs.172619 KIAA1106 protein 277 27.7 

10 338388 predicted exon 267.6 26.8 

338442 predicted exon 256 25.6 

316617 AW247252 Hs.75514 nucleoside phosphorylase 1247.8 24.2 

338645 predicted exon 206 20.6 

313135 N58907 Hs.162430 ESTs 204.8 20.5 

15 324716 BE169746 Hs.12504 hypothetical protein DKFZp761D081 203.6 20.4 

330305 predicted exon 199.8 20 

308248 AI560919 gb:tq41g10j(1 NCLCGAP_Ut1 Homo sapiens 199.4 19.9 

308885 AI833240 gb:at76d10jc1 Barstead colon HPLRB7 Homo 198.2 19.8 

315622 A1796144 Hs.258188 Homo sapiens cDNA FU11674 fis, clone HE 191.2 19.1 

20 323675 R43240 Hs.272168 tumor differentially expressed 1 189.2 18.9 

312164 T91980 Hs.221074 ESTs 187.6 18.8 

300378 Z45270 Hs.235873 hypothetical protein FU22672 271.6 18.7 

317478 AI343569 Hs. 107000 Homo sapiens mRNA for WDC146, complete c 187 18.7 

317559 AW452344 Hs.129977 ESTs 184.2 18.4 

25 317207 AI873346 Hs.214505 ESTs 182.8 18.3 

334834 predicted exon 178.8 17.9 

320925 D62892 gb:HUM337C07B Clontech human aorta poIyA 177.217.7 

303289 AL121460 Hs.272673 hypothetical proteb FU20508 316.4 17.6 

328548 predicted exon 174.6 17.5 

30 317108 AA884000 Hs.8173 hypothetical protein RJ1 0803 172.4 17.2 

318013 AI188183 Hs.144078 ESTs 326 17.2 

314299 AW382682 Hs.154840 ESTs 170.8 17.1 

317702 AW173339 Hs.135665 ESTs 169.8 17 

316094 AW975920 Hs.283361 ESTs 169.4 16.9 

35 323706 AA377578 Hs.65234 hypothetical protein RJ20596 169.2 16.9 

325843 predicted exon 321.4 16.9 

. 316012 AA764950 Hs.1 19898 ESTs 1047.2 16.9 

309687 AW236154 Hs.77385 myosin p Iightpolypeptide6,alkali,smoothmu 168.2 16.8 

323329 AL134744 Hs.10852 ESTs 168 16.8 

40 312853 W05086 Hs.1 14256 ESTs 167.4 16.7 

313070 AI422023 Hs.161338 ESTs 298.6 16.6 

314096 AW977642 Hs.291742 ESTs 165.6 16.6 

338728 predicted exon 165.4 16.5 

316609 AW292520 Hs.122082 ESTs 165 16.5 

45 305989 AA888220 gb:oj15h01 .s1 NCI_CGAP_Kld5 Homo sapiens 164.6 16.5 

312642 AW052128 . gbwx26c02jt1 NCI_CGAPJ<id11 Homosaplen 164 16.4 

339236 predicted exon 163.6 16.4 

317058 AI217713 Hs.147586 ESTs 161.8 16.2 . 

311137 AW207582 Hs.196042 ESTs 582.2 165 

50 310178 AI936450 Hs.147482 ESTs 161.2 16.1 

320745 H51696 Hs.89278 hypothetical protein RJ1 11 86 161 16.1 

317336 AW014637 Hs.130212 ESTs 160 16 

309871 AW300366 gb:xs63b05.x1 NCI_CGAPJ<id11 Homo sapien 159.8 16 

302038 AC004076 Hs.129709 Homo sapiens chromosome 19, cosmid R3021 159 15.9 

55 332237 N52883 Hs.102676 EST 159 . 15.9 

312362 AW015994 gb:UI-H^I0jwbh-g-09^)-Ul.s1 NCI_CGAP_S 158.6 15.9 

331558 N62401 Hs.48531 EST 158.6 15.9 

316215 AJ684535 Hs.200811 ESTs 158.4 15.8 

336059 • predicted exon 157.4 15.7 

60 302790 AJ245245 gb:Homo sapiens mRNA for immunoglobulin 155.8 15.6 

328418 predicted exon 153.8 15.4 

304229 AK000149 Hs.29493 hypothetical protein FU20142 153.6 15.4 

331606 AW273285 Hs.50802 ESTs 153 15.3 

338962 predicted exon 664.4 15.3 

65 317959 AI204202 Hs.130264 ESTs 152.6 15.3 

336228 predicted exon 152.4 155 

313534 AW072916 Hs.78743 zinc finger protein 131 (clone pHZ-10) 152.2 15.2 

317404 AI806867 Hs.126594 ESTs 152.2 15.2 

311943 AI469911 Hs56498 hypothetical protein FU21657 152 15.2 

70 314680 AI247425 Hs.152182 ESTs 151.4 15.1 

331484 N29696 Hs.44076 EST 1515 15.1 

338116 predicted exon 1515 15.1 

329863 predicted exon 150.6 15.1 

315555 AW452886 Hs539107 ESTs 149.6 15 

75 317039 AA868583 Hs.126153 ESTs 149.6 15 

331138 R63816 Hs58445 ESTs 149.6 15 
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316561 AJ917222 Hs.121655 ESTs 149.4 14.9 

328695 predicted exon 149.2 14.9 

302282 BE396283 Hs. 1 73987 eukaryotic translation initiation factor 148.4 14.8 

318781 F11802 Hs.6818 ESTs 148.2 14.8 

5 323709 AW297246 Hs.288546 Homo sapiens cONA FU14190 fis, clone NT 148 14.8 

310790 AW192053 Hs.248865 ESTs 147.8 14.8 

316833 AW292614 Hs.124367 ESTs 147.8 14.8 

323176 NM.007350HS.82101 pleckstrtn homology-like domain, family 229 14.8 

324188 AW274439 Hs.252709 ESTs 147.6 14.8 

10 317441 AA922798 Hs.196583 ESTs 147.4 14.7 

317584 AI825890 Hs.220513 ESTs 146.8 14.7 

321798 AJ308206 Hs.181959 ESTs 146.8 14.7 

304363 AA206045 gb:zq77f05.s1 Stratagene hNT neuron (937 146.6 14.7 

313952 F20956 gb:HSPD05390 HM3 Homo sapiens cONA done 146.614.7 

15 301909 AI702609 Hs.15713 ESTs 263.8 14.7 

309196 AI904895 Hs.9514 nudeophosmin (nucleolar phosphoprotein 1462 14.6 

321860 N47474 Hs.212631 ESTs 146.2 14.6 

330187 . predicted exon 146 14.6 

323042 AA463571 Hs.172550 polypyrimidine tract binding protein (he 145.6 14.6 

20 313636 AA262397 Hs.201366 ESTs 145.2 14.5 

302437 AB024729 Hs.227473 UDP-N-acetylglucosamineia-I.S-D^nannosid 145 14.5 

318197 AM73096 Hs.133403 ESTs 144.8 145 

302749 M16951 gb:Human Ig mu-chaln mRNA VDJ4-region, 5 144.6 145 

322357 AI734258 Hs.245367 ESTs, Weakly similar to ALU1JHUMAN ALU S 144.614.5 

25 300391 AI927371 Hs.288839 hypothetical protein FU 121 78 144.4 14.4 

326077 predicted exon 144.4 14.4 

302004 Y18264 Hs. 123094 sal (Drosophi!a)-Kke 1 ' 144 14.4 

320668 AA805666 Hs.146217 Homo sapiens cDNA: FU23077 fis, clone L 144 14.4 

331212 T88693 Hs.226410 ESTs 144 ' 14.4 

30 311268 AI969727 Hs.231859 ESTs 143.2 14.3 

305159 AA659166 Hs.275668 EST,WeaklysimilartoEF1D_HUMANELONGATIONF 143 14.3 

304510 AA457391 Hs.119122 ribosomalproteinL13a 142.8 14.3 

320852 AA772920 Hs.303527 ESTs 142.8 14.3 

330854 AW291944 Hs.122139 ESTs 142.8 14.3 

35 318275 AW449952 Hs.190125 basfc^eBx-toop^erix-PAS protein 142.6 14.3 

314992 A1824879 Hs.211286 ESTs, Weakly similar to 1207289A reverse 1422 14.2 

322631 AA001697 Hs.293565 ESTs, Weakly similar to putative p1 50 [H 142.2 14.2 . 

332283 R40855 Hs.100839 EST 142 14.2 

302894 AA719572 Hs.274441 Homo sapiens mRNA; cDNA DKFZp434N011 (fr 141.214.1 

40 301808 R35391 Hs.252831 reticulon 3 141 14.1 

318608 AI204491 Hs.151502 ESTs 141 14.1 

316499 AW292947 Hs.122872 ESTs 140.8 14.1 

317011 AI248760 Hs.150276 ESTs 140.8 14.1 

321840 N45600 Hs.46534 : Homo sapiens mRNA; cDNA DKFZp434P0714 (f 140.814.1 

45 327365 predicted exon 140.8 14.1 

331264 AA278898 Hs.225979 hypothetical protein similar to small G 140.8 14.1 

324545 AW501944 Hs. 127243 Homo sapiens mRNA for KIAA1724 protein, 140.4 14 

312986 AA211586 gb:zn56d05.s1 Stratagene musde 937209 H 140.2 14 

316053 AA825814 Hs.149065 ESTs 140.2 14 

50 330723 BE247449 Hs.31082 hypothetical protein FU10525 140.2 14 

304876 AA595765 gb:nj28g06.s1 NCLCGAPJW Homo sapiens 139.814 

311379 AW134766 Hs.202450 ESTs 139.8 14 

318265 AW019873 Hs.146840 ESTs 139.8 14 

324137 AA393127 Hs.222762 ESTs 139.8 14 

55 328262 predicted exon 139.6 14 

322349 AK001279 Hs.180171 Homo sapiens cONA FU10417 fis, clone NT 139.4 13.9 

323504 AA280223 Hs.130865 ESTs 139.4 13.9 

304261 AA059387 gbzf66d01.s1 Soares retina N2MHR Homo 139.2 13.9 

310489 AW451493 Hs.235516 hypothetical protein PR02955 139J2 13.9 

60 335946 predicted exon 139.2 13.9 

318155 AI041546 Hs.132133 ESTs 138.8 13.9 

313796 AI797169 Hs.208486 ESTs 138.6 13.9 

333977 predicted exon 138.6 13.9 

324845 AW969635 Hs.283718 ESTs 138.2 13.8 

65 331139 R65706 gb.7H6gi2.sl Soares placenta Nb2HP Homo 138.2 13.8 

331131 R54797 gb:yg87b07,s1 Soares infant brain 1NIB H 669.6 13.8 

321250 H58539 Hs.151692 ESTs 138 13.8 

312498 AA668782 Hs.191284 ESTs, Weakly similar to ALU 1JWMAN ALU S 137.813.8 

331252 W52470 Hs.34578 alpha2,3-sia!yltrai*sferase 137.8 13.8 

70 337407 predicted exon 137.8 13.8 

303973 AW512014 gbxc68a03 jc1 NCLCGAPJ.ym12 Homo sapfen 137.4 13.7 

314582 AA412258 Hs.188817 ESTs 137.4 13.7 

327373 predicted exon 137.2 13.7 

323367 AA234591 Hs.304123 ESTs 136.6 13.7 

75 316207 AA832065 Hs.120260 ESTs • 136.4 13.6 

315231 AA705809 Hs.119922 ESTs 136.2 13.6 
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318592 T39310 Hs.1139 coW stock domain protein A 136.2 13.6 

320906 AW969706 Hs.293332 ESTs 136.2 135 

328937 predicted exon 136.2 13.6 

329073 predicted exon 136.2 13.6 

5 318231 AV659082 Hs.134228 ESTs 136 13.6 

311992 AL360200 Hs.114145 ESTs 135.8 13.6 

316497 AA766457 Hs.136849 ESTs 135.8 13.6 

317677 AA968594 Hs.127868 ESTs 135.8 13.6 

321680 WQ2848 Hs.93704 ESTs 135.8 13.6 

10 326080 predicted exon 135.8 13.6 

330938 AF036943 Hs.172619 KIAA1 106 protein 135.8 13.6 

306573 AL134878 Hs.1 19500 ribosomal protein, large P2 135.6 13.6 

307383 A1223207 Hs.147888 EST ■ 135.6 13.6 

311114 AW449382 Hs.195297 ESTs 135.6 13.6 

15 320579 R1513B Hs.165570 Homo sapiens clone 25052 mRNA sequence 135 135 

301328 AA884104 Hs.125546 ESTs 134.8 135 

312063 N58198 Hs.182898 ESTs 134.8 135 

323036 H09604 Hs.13268 ESTs 134.6 135 

332776 AF241850 Hs.151428 ret finger protein 2 134.4 13.4 

20 332494 AA282330 Hs.145668 ESTs 134.2 13.4 

334376 predicted exon 134.2 13.4 

313264 N93416 Hs.118228 ESTs 133.6 13.4 

313669 AA351109 Hs.5437 Taxi (human T-ceil leukemia virus type I 133.2 133 

312083 T87398 Hs.205816 ESTs 132.6 133 

25 319354 AA993807 Hs.167367 ESTs . 132.6 133 

307414 AI242106 gb:qh92a02.x1 SoaresJIFUJ_GBC_S1 Homo s 132.2 13.2 

312771 AA018515 Hs.264482 Apg12 (autophagy 12, S. cerevisiae)-like 131.8 13.2 

313004 AI274963 Hs.145900 ESTs 131.2 13.1 

300995 AW510641 Hs.258018 ESTs 220.6 13 

30 319323 F12650 Hs.13287 ESTs 125.4 125 

329451 predicted exon 123.4 123 

337603 predicted exon 572 12.2 

312480 R68651 Hs.144997 ESTs 121.4 12.1 

324934 AW452051 Hs.147546 ESTs 119.4 115 

35 320723 BE178025 Hs.7942 hypothetical protein FLJ20080 117 11.7 

318188 AI792566 gb:qi74f02.y5 NCI_CGAP_Ov26 Homo sapiens 116.611.7 

320873 AF238869 Hs.283955 Homo sapiens done GLSH-2 similar to gli 112.8 11.3 

331005 BE003191 Hs.119555 ESTs 112.6 11.3 

304969 AA614406 gb:np46f05.s1 NCl_CGAP_Br11 Homo sapiens 112.411.2 

40 319799 AI139253 Hs.227767 zinc finger protein 41 111.2 11.1 

302610 AA347945 Hs.256024 ESTs 111 11.1 

309485 AW130320 Hs.108124 ribosoma!protelnS4 r X4inked 111 11.1 

311880 AW419225 Hs.256247 ESTs 110.2 11 

313981 AW452334 Hs.128148 ESTs 110.2 11 

45 322442 W49701 Hs.29667 ESTs 109.4 10.9 

315099 AA806536 Hs.291841 ESTs 109 105 

304793 AA583264 Hs.182979 ribosomaiproteinL12 108.8 105 

330815 AA019211 Hs.236463 KiAA1 238 protein 108.8 10.9 

304044 T81656 Hs.252259 ribosomal protein S3 714.8 10.8 

50 325222 predicted exon 135 10.8 

325889 . predicted exon 814.6 103 

321447 AW891130 Hs.38173 ESTs 107.8 10.8 

302990 AA496212 Hs.180182 ESTs 106.2 10.6 

308108 AI476803 gb:tj77e12j(1 SoaresJ^SFJ : 8JWjDTJ>A_P_S 270.610.6 

55 310536 AI301041 Hs.150174 ESTs 106 10.6 

315257 AW157431 Hs.248941 ESTs 233 10.6 

318787 Z42313 Hs.22657 ESTs 105.8 10.6 

312306 AI927226 Hs.175610 ESTs 105.2 10.5 ' 

326788 predicted exon 104.4 10.4 

60 312234 AA830640 Hs.206934 ESTs 104 10.4 

314482 AW085525 Hs.134182 ESTs 234 10.4 

323597 A1185693 Hs.135119 ESTs 102.4 10.2 

302623 AW836724 Hs.1941 10 hypothetical protesn PRO2730 162.4 10.2 

323594 AI791531 Hs.129993 ESTs 101 10.1 

65 324315 N55761 Hs.194718 zinc finger protein 265 100.2 10 

314217 AA256465 Hs.188725 ESTs 99.2 9.9 

320932 AA554913 Hs.162297 ESTs 98.2 9.8 

327876 predicted exon 98.2 9.8 

319736 R17424 Hs.6650 vacuolar protein sorting 45B (yeast homo 98 9.8 

70 327747 predicted exon ' 97.6 9.8 

327844 predicted exon . 97.4 9.7 

318200 AI061192 Hs.166517 ESTs 97.2 9.7 

329414 predicted exon 97.2 9.7 

318296 AI089667 Hs.270713 ESTs 121.4 .9.7 

75 307010 AI140014 gb:qa68fD9j(1 SoaresJetaLhearLNbHH19W295 9.7 

319792 AI138635 Hs.22968 ESTs 385.4 9.6 
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305571 AA811688 Hs.82113 dUTPpyrophosphatase 96 9.6 

329440 predicted exon 93.8 9.4 

310381 AI263059 Hs.145594 ESTs 93.4 9.3 

318824 F06771 Hs.27226 ESTs 93.4 9.3 

5 328957 predicted exon 92.2 9.2 

318804 242549 Hs.160893 ESTs 92 9.2 

330836 AA055611 Hs.226568 ESTs, Moderately similar to ALU4_HUMAN A 92 9.2 

324592 AW752437 Hs.325708 ESTs 91.8 9.2 

311B20 AW274545 Hs.254333 ESTs 91.4 9.1 

10 321614 H86161 gb:ys94b01.r1 Soares retina N2b5HR Homo 91 9.1 

330306 predicted exon 91 9.1 

303096 AL080276 Hs.268562 regulator of G-protein signalfing 17 90 9 

313275 AI027604 Hs.159650 ESTs 110.4 8.8 

302593 H54855 Hs.36958 ESTs 88 8.8 

15 321421 BE465115 Hs.171688 ESTs 86.2 8.6 

330832 AI133530 Hs.62930 ESTs 456.4 8.6 

311847 AW301807 Hs.297260 ESTs 86 8.6 

322036 BE002723 Hs.301905 Homo sapiens cDNA RJ14080 fis, clone HE 145.8 8.6 

328688 predicted exon 85.6 . 8.6 

20 325251 predicted exon 85.4. 8.5 

329088 predicted exon 85.4 8.5 

322524 W79027 Hs.271762 ESTs 84 8.4 

337953 predicted exon 451 8.3 

323529 AA284397 Hs.201485 Homo sapiens clone FLC0664 PR02866 mRNA, 82.6 8.3 

25 307041 AI144243 gb:qb85b12j(1 Soares JetalhearLNbHH19W 306.88.2 

318285 AI332454 Hs.158412 ESTs 81.4 8.1 

312021 AA759263 Hs.14041 ESTs 81 8.1 

329350 predicted exon 81 8.1 

326169 predicted exon 80.4 8 

30 338038 predicted exon 10245 7.9 

312549 A1214510 Hs.146304 ESTs 77.4 7.7 

312542 D60076 gb:HUM084E10AClontech human fetal brain 76.8 7.7 

320992 AB026891 Hs.225972 solute carrier family 7, (cationic amino 76 7.6 

318596 AI470235 Hs.172698 EST 150.6 7.5 

35 315650 AA649042 Hs.269615 ESTs 73.4 7.3 

324328 AA447276 Hs.292020 ESTs 210.4 7.1 

332622 R10674 Hs.128856 CSR1 protein 702 7 

328229 predicted exon 69.4 6.9 

319110 T75260 Hs.98321 hypothetical protein FU14103 68.6 6.9 

40 316133 AI187742 Hs.125562 ESTs 308.6 6.9 

303992 AW515800. gb:hdB8g01 jc1 NCI_CGAP_GC6 Homo sapiens 67.8 6.8 

322675 AA017656 Hs.146580 endase 2, (gamma, neuronal) 377.2 6.7 

325753 predicted exon 105.2 6.6 

312539 AI004377 Hs.200360 Homo sapiens cDNA FU13027 fis, clone NT 92.2 6.4 

45 302592 AA294921 Hs.250811 v-ral simian leukemia viral oncogene horn 361.6 6.3 

314578 AA410183 Hs.137475 ESTs 201.6 6.1 

335966 predicted exon 108.6 6 

321478 AW402593 Hs,123253 hypothetical protein FU22009 528 . 6 

305192 AA666019 gb:ag44a04.s1 Jia bone marrow stroma Horn 58.6 5.9 

50 304275 AA070605 gb:zm53h09.s1 Stratagene fibroblast (937 78.6 5.6 

302779 AJ235667 gb:Homo sapiens mRNA for immunoglobulin 278.8 5.5 

301976 T97905 Hs.77256 enhancer of zeste (Drosophila) homotog 2 4795 5.4 

316021 AW293399 Hs.144904 nuclear receptor co-repressor 1 792.4 5.3 

320802 BE336699 Hs.185055 BENE protein 2423.8 5.3 

55 317282 AI733112 Hs.176101 ESTs 523.2 5.1 

316827 AI380429 Hs.172445 ESTs 578 5.1 

303190 BE280787 Hs.16079 hypothetical protein FU 10233 223 5.1 

315587 A126B399 Hs.140489 ESTs 136.2 5 

333122 predicted exon 399 5 . 

60 310214 AI220072 Hs.165893 ESTs 234.4 4.9 . 

320069 D43945 Hs.1 13274 transcription factor EC 68 4.9 

309328 AW024348 Hs.233191 EST, WeaWy similar to A27217 glucose tr 258.8 4.8 

318971 Z44067 Hs.10957 ESTs 376.6 4.8 

327220 predicted exon 47.4 4.7 

65 315757 AW014605 Hs.179872 ESTs 177.4 4.7 

320730 R68869 Hs.151072 ESTs 205.2 4.6 

313339 AI682536 Hs.163495 Homo sapiens cDNA FU1 3608 fis, clone PL 260 4.5 

318634 T49598 Hs.156832 ESTs 475.2 4.5 

320955 AW620035 Hs.278679 a disintegrin and metatloproteinase dorna 388.6 4.4 

70 306605 AI000497 Hs.119500 ribosomalprotein J largeP2 ,81.6 4.4 

309349 AW051913 gbwx24a09.x1 NCLCGAP_Kki11 Homosapien 102.44.3 

306004 AA889992 Hs.2186 eukaryotictranslationelongationfactorlga 451.2 4.2 

330020 predicted exon 61.2 4.1 

302308 AW327279 Hs.91379 ribosoma! protein L26 342 3.9 

75 314648 AW979268 gb:EST391378 MAGE resequences, MAGP Homo 56.4 3.8 

315131 AI753709 Hs.152484 ESTs 130.4 3.7 
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313690 A1493591 Hs.78146 plateletfendothefial cell adhesion molec 3179.6 3.6 

333585 predicted exon 175.4 3.5 

312911 H93366 Hs.7567 Homo sapiens cDNA: FU21962 fis, done H 219 3.5 

322966 AA633669 Hs.235920 Homo sapiens cell recognition molecule C 350.2 3.4 

5 312492 R71072 Hs.191269 ESTs 322.8 3 

318988 Z44203 Hs.26418 ESTs 25 2.5 

332363 AI123705 Hs.106932 ESTs 773.4 2.5 

324181 AI025476 Hs.131628 ESTs 634.8 2.4 

311717 AW205369 Hs.312830 ESTs 54.2 2.4 

10 321342 AA127984 Hs.222024 transcription factor BMAL2 23.4 2.3 

308852 A1829848 Hs.162937 peptldylprolylisomeraseA(cyclophillnA) 92 2.3 

331466 AA373210 Hs.43047 Homo sapiens cDNA FU 13585 fis, clone PL 494 2.3 

320279 AB033062 Hs.134970 DKFZP434N1 78 protein 76.2 2.2 

322221 N24235 Hs. 1 79662 nudeosome assembly protein 1 -like 1 253.2 2.1 . 

15 302925 AL137449 Hs.126666 homeoboxB4 136.6 2.1 

331384 AB041035 Hs.93847 NADPH oxidase 4 720 1.8 

* 300938 AA514416 Hs.152320 ESTs, Weakly similar to 1605244A erythro 27 1.8 

312695 AW196663 Hs.200242 ESTs 303.8 1.6 

320223 W35132 Hs.267442 ESTs 189 1.5 

20 ' 332743 AW247977 Hs.87595 translocase of Inner mitochondrial membr 14.4 1.4 

331039 AW378685 Hs.18625 Mitochondrial Acyl-CoA Thioesterase 529.8 1.4 

333123 predicted exon 396.2 1.4 

328455 predicted exon 91.8 1.3 

334458 predicted exon 406.4 1.3 

25 313478 AA643008 Hs.192775 ESTs 413.4 1.1 

309899 AW33B564 Hs.217493 annexinA2 -30.8 1 

311735 AW294416 Hs.144687 Homo sapiens CDNARJ12981 fis, clone NT -62.8 ' 1 

312953 NM_001992Hs.128087 coagulation factor II (thrombin) recepto -73.6 1 

313055 AW367295 Hs.241175 ESTs -43.8 1 

30 313291 AI267970 Hs.150614 ESTs, Weakly similar to ALU4JWMAN ALUS -63 1 

315059 AW275110 Hs.271106 ESTs -67 1 

322284 AI792140 Hs.49265 ESTs -395.2 1 

322450 AL121278 Hs.25144 ESTs -1.6 1 

324803 AW975183 Hs.292663 ESTs 4.4 1 

35 331495 AW970939 Hs.291039 ESTs -282.8 1 

333610 predicted exon -152.6 1 

335093 predicted exon -23.2 1 

339403 predicted exon -331.2 1 

302820 X04588 Hs.85844 neurotrophic tyrosine kinase, receptor, 591.2 1 

40 302270 R56151 Hs.93589 Homo sapiens mRNA; cDNA DKFZp564B1 162 (f 276.61 

323755 AW300094 Hs.136252 ESTs 135 0.9 

326946 predicted exon 727.4 0,9 

315343 BE144306 Hs.179891 ESTs, Weakly similar to P4HAJHUMAN PROLY 122.80.9 

311168 AK001270 H&196086 hypothetical protein HJ10408 304 0.9 

45 329732 predicted exon 109.2 0.9 

321415 BE621807 Hs.3337 transmembrane 4 superfamily member 1 414.8 0.7 

333121 predicted exon 87.8 0.7 

333120 predicted exon 379.8 0.7 

330392 AW797956 Hs.75748 proteasome (prosome, macropain) subunit, 589.2 . 0.7 

50 314711 AA769365 Hs.126058 ESTs -87 0.6 

330865 BE409857 Hs.69499 hypothetical protein 347.4 0.6 

333169 predicted exon - -1182 0.6 

335095 predicted exon 106.4 0.6 

335815 predicted exon -156 0.6 

55 330232 predicted exon 102.6 0.6 

330823 AA031565 Hs.221255 ESTs, Moderately similar to ALU5_HUMAN A -62 0.5 

331704 F04225 . Hs.66032 ESTs -14.6 0.5 

302642 NMJ16428HS.130719 NESH protein 267.6 0.5 

304484 AA432067 Hs.258373 ESTs 85 0.5 

60 310230 AKOO0377 Hs.144840 homolog of mouse C2PA -70 0.4 

301531 AI077462 Hs.134084 ESTs -195.4 0.4 

306337 AA954221 Hs.73742 ribosomalprotein,large,P0 -33.4 0.4 

331327 N46436 Hs.109221 ESTs -392 0.4 

332961 predicted exon -5.6 0.4 

65 322796 W31178 Hs.154140 Homo sapiens ovary-spedfic acidic prote -880.6 0.3 

328857 predicted exon 55.2 0.3 

316342 AA743935 Hs.202329 ESTs 43.4 0.3 

331263 AW780192 Hs.267596 ESTs -180.4 0.3 

335987 predicted exon -134 0.3 

70 311923 T60843 Hs.189679 ESTs 12.2 0.3 

310522 AW134529 Hs.244647 ESTs -187.8 0.3 

315363 AA759190 Hs.121454 ESTs, Weakly similar to olfactory recept 80 0.3 

302032 NNL001992HS.128087 coagulation factor II (thrombin) recepto -877 0.3 
313140 BE265133 Hs.217493 annextnA2 95.4 0.3 

75 310860 AW015920 Hs.161359 ESTs -239 0.3 

317899 AI952430 Hs.150614 ESTs, Weakly similar to ALU4JWMAN ALU S -715.2 0.3 
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328520 


predicted exon 


-109.2 


02 


302406 


NM_012099Hs.211956 CD3-epsilon -associated protein; antisens 


10 


0.2 


311804 


AI866921 Hs.203349 Homo sapiens cDNA FU12149 fls, clone MA -252.6 


0.2 


315065 


AK001122 Hs.105859 hypothetical protein FU10260 


-46.2 


0.2 


314129 


AA228366 Hs,1 15122 ESTs 


-308.8 


0.2 


335697 


predicted exon 


-47.2 


0.2 


335989 


predicted exon 


89 


0.2 


320606 


AW867943 Hs.127216 hypothetical protein FU 13465 


-205.6 


0.2 


329745 


predicted exon 


103 


0.2 


313628 


AW419069 Hs.209670 ESTs 


-177.8 


0.2 


334616 


predicted exon 


-936.6 


0.2 


308820 


AI821267 Hs.207243 EST 


-7.2 


0.2 


320416 


AI026984 Hs.293662 ESTs 


-18.4 


0.2 


335211 


predicted exon 


-142 


0.2 


323629 


AA375957 Hs.6682 ESTs 


-100 


0.1 


331420 


AW452904 gb:UI-H-Bi3-aly-h-11-0-Ul.s1 NCLCGAP.Su 83 


0.1 


315984 


AI015862 Hs.131793 ESTs 


-250.6 


0.1 


332833 


predicted exon 


-374.2 


0.1 


332607 


NM_002314Hs,36566 LIM domain kinase 1 


-27.6 


0.1 


313467 


AA004879 Hs.187820 ESTs . 


-288.2 


0.1 


323333 


AV651680 Hs.208558 ESTs 


-735.6 


0.1 


330775 


AW247020 Hs.250747 SUMO-1 activating enzyme subunit 1 


53.6 


0.1 


333168 


predicted exon 


-1041.8 


0.1 


332079 


AI308876 Hs.103849 ESTs 


19.4 


0.1 


322724 


AF161442 Hs.191591 Homo sapiens HSPC324 mRNA, partial cds 


-123.6 


0.1 


303652 


AI799111 Hs.64341 ESTs 


46.4 


0.1 


303131 


AW081081 Hs.103180 DC2 protein 


-156.4 


0.1 


320716 


A1479439 Hs.171532 ESTs 


-146.6 


0.1 


300454 


AA659037 Hs.163780 ESTs 


-304 


0.1 


312757 


AI285970 Hs.183817 ESTs 


-445. 


0.1 


312391 


R43707 Hs.133159 ESTs, Weakly similar to PIHUSD salivary 


-111.8 


0.1 


308877 


AI832519 gb:at69h03 x\ Barstead colon HPLRB7 Homo -149.6 


0 


311275 


AI659166 Hs.207144 ESTs 


-62.6 


0 


302363 


AW163799 Hs.198365 2,3-bisphosphoglycerate mutase 


-15 


0 


321717 


AW956580 Hs.42699 ESTs 


-1059.6 


0 


302638 


AA463798 Hs.102696 MCT-1 protein 


-332.2 


0 


306352 


AA961 367 gb:or52a05.s1 NCI_.CGAP.GC3 Homo sapiens 


21.8 0 


313798 


AI292148 Hs.71622 SWI/SNF related, matrix associated, actl 


-97.2 


0 


320807 


AA135370 Hs.188536 Homo sapiens cONA: FU21635 fis, done C 


-7m 


0 


320931 


AW262836 Hs.252844 ESTs 


-681.6 


0 


332450 


AW288085 Hs.11156 hypothetical protein 


28.4 


0 


332535 


AF1 67706 Hs. 19280 cysteine-rich motor neuron 1 


-722 


0 


335990 


predicted exon 


-421 


0 


330746 


AB0338B8 Hs.8619 SRY (sex determining region Y)-box 18 


35.4 


0 


316820 


AI627912 Hs.130783 Forssman synthetase 


-373.6 


0 


337429 


predicted exon 


-257 


0 


331192 


BE622021 Hs.152571 ESTs, Highly similar to IGF-li mRNA-bind 


-33 


0 


330609 


AI346201 Hs.76118 ubiquitin carboxyl-terminal esterase L1 


-280 


0 


323593 


AI739435 Hs.39168 ESTs 


-3627.6 


0 


302704 


AA531133 Hs.4253 hypothetical protein MGC2574 


-278.6 


0 


330534 


NM_004579Hs.82979 mrtogen-activating protein kinase kinase 


-244 


0 


332374 


X91 195 Hs.100623 phosphoBpase C, beta 3, neighbor pseudo 


-1204.2 


u 


333221 


predicted exon 


-189.6 


0 


335988 


predicted exon 


-122.6 


0 


330574 


AI984144 Hs.66713 hepatitis delta antigen-interacting prot 


-2257.4 


0 


312052 


BE621697 Hs.14317 nucleolar protein family A, member 3 {HI 


-359.2 


0 


319568 


AF131781 Hs.84753 hypothetical protein RJ 12442 * 


-874.6 


0 


337113 


predicted exon 


-24.6 


0 


335149 


predicted exon 


-191.8 


0 
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TABLE 6A 

Table 6A shows the accession numbers for those pkeys lacking unlgenelD's for Table 6. The pkeys in Table 7 lacking unlgenelD's are represented within 
Tables 1-6A. For each probeset we have fisted the gene cluster number from which the oligonucleotides were designed. Gene clusters were compiled 
using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence similarity using Clustering and 
ADgnment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for sequences comprising each cluster are listed in the "Accession' 
column. 

Pkey: Unique Eos probeset identifier number 

CAT number Gene duster number 
Accession: Genbank accession numbers 

Pkey CAT Number Accession 



320925 1525201 J D62B92 D79755 D62760 

20 321614 87866J H86161 AA054308 AA018955 

313952 136885 J F20956 AA1 29374 AA1 33740 AW81987B 
314648 293660J AW979268 AA878419 AA431342 AA431628 
302749 458 107 M16951 M16952 M1694B M16949 M16950 
312362 764066 J AW015994 R39898 AW000978 AI598202 AI521706 

25 312542 1522649J D60076 D60259 D61037 

312642 1005225.1 AW052128 H51439 H51481 

312986 171879J AA21 1586 F35799AA21 1641 F29720 AW937387 AW937408 
329350 cjLhs 
329414 c_y_hs 
30 329440 c_y_hs 
329451 c_yjis 

338033 CH22J528FG_LINK_EMAC00 

338038 CH2ZJ535FG_JJNKJMAC00 

338116 CH22J650FG_LINK_EMACOO 
35 338158 CH2£J700FG_JJNK_EMACO0 

329732 c14j>2 

329745 c14_p2 

308106 AI476803 

329863 c14j>2 
40 338316 CH22J944FG__UNKJMAC0O 

308248 AI560919 

338388 CH22J034FG.JJNKJMACQ0 
338442 CH22J109FG.J-INKJMACQO 
338645 CH22_7410FG_LINK.EM:ACOO 
45 338728 CH22L_7527FG__LINK_EMAG00 
308877 AI832519 

338962 CH22J838FG__UNK_DJ32I10 
308886 AI833240 

333120 CH22_349FGJ1J_.LINK.EMA 
50 333121 CH22L350FGJ1.4.LINK^MA 

333122 CH22J51FGJ1JJJNieEMA 

333123 CH22_352FGJ1_7_UNieEMA 

333168 CH22L400FGJ4J_UNK - EMA 

333169 CH22L.401FGJ4JJ.INKJEMA 
55 333221 CH22_458FGJ05_UINK_EM: 

326077 c17_hs 

326080 c17_hs 

326169 c17_hs 

326198 c17_hs 
60 326230 c17_hs 

333585 CH22J46FGJ03JJJNKJM: 

333610 CH22.871FGJ17J.UNK.EhA: 

335093 CH22J423FGJ92JJJNKJM 

335095 CH22J425FG.492JJJNKJM 
65 335149 CH22J484FGJ99JJJNKJM 

326759 c20_hs 

333977 CH22.1254FG 309 6J.INKJM 
326788 c20Jis 

335211 CH2Z.2550FG 511J.L1NK.EM 
70 305192 AA666019 

303973 AW512014 

303992 AW515800 

326946 c21_hs 

328229 cJJs 
75 328262 CJJs 
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328418 c_7_hs 
328455 c 7 Jis 

335697 CH22_3058F6J96JUINK_E 
328520 c_7_hs 
5 328548 cj_hs 

335815 CH22J187FG_618J_UNK_EM 
328688 cJLhs 
328695 cj.hs 
307010 AI140014 
10 337113 CH22J058FG.493J. 
307041 AI144243 
328700 c_7_hs 

335946 CH22_3324FGJ46 -J 20_LINfU> 
335986 CH22 3366FG.654J0JJNKJ) 
15 335987 CH2^3367FG_654J1JJNKJ> 

335988 CH22J^FGJ54.12_UNK_D 

335989 CH2a.3369FG_655XUNK.DJ 

335990 CH22J37QFG.655_4_LINK_DJ 
337214 CH22J288FGJ13JL 

20 330020 c16_p2 

305989 AA888220 

328857 c 7_hs 

328937 cj.hs 

328957 cj.hs 
25 330187 c_4_p2 

337407 CH22_5607FG_755_1_ 

337429 CH2a.5633FG_762.3_ 

330232 Cj_p2 

307414 AI242106 
30 330305 c_7_p2 

330306 c_7_p2 

337603 CH22J896FG_LINKJ20H12. 
337953 CH22_6395FG__LINK_EMAC00 
339236 CH22 8181FG__LINKJA354I1 
35 339403 CHZL8384FGJLINK.BA232E1 
309349 AW051913 
325222 c10_hs 
325251 dOJw 

318188 956161J AI792566 AI053836 AI054127 AI792489 AI288324 
40 309871 AW300366 
325544 c12Jis 
309931 AW341683 

332833 CH22_50FG_17_7_LINK-C20H1 

302779 33837 1 AJ235667 AJ235666 AJ235664 AJ235665 AJ235668 AJ235669 AJ235670 
45 302790 34168~1 AJ245245 AJ245247 AJ245257 AJ245246 AJ245254 AJ245256 AJ245253 AJ245203 AJ245250 AJ245252 AJ245243 AJ245204 
AJ245201 AJ245206 AJ245246 AJ245255 AJ245205 AJ245202 AJ245251 AJ245249 AJ245207 AJ245244 

332961 CH22J85FG_48J8JJNKJM: 

325753 c14_hs 

327036 c21_hs 
50 325843 c16Js 

325889 c16.hs 

304261 AA059387 

304275 AA070605 

334376 CH22_1670FG_379JJJNK_EM 
55 327220 cjjis 

304363 AA206045 

334458 CH2ZJ757FG_391_2_LINK_EM 
327365 cjjis 
327373 cjjis 
60 334616 CH22_1923FG_411J5_LINK_E 
327414 c_2_hs 
327568 c_3_hs 

336034 CH22J419FGJ78_5_LINK_DJ 
336059 CH22J445FGJB4J_UNK_DJ 
65 334834 CH22J2148FG_439_3_LINieEM 
304782 AA582081 
304876 AA595765 
327747 Cj.hs 

336228 CH22_3626FG_730_4_LlNKJ3A 
70 329073 clOs 

329088 c_xJs 

304969 AA614406 

327844 cj.hs 

327876 cJJs 
75 306352 AA961367 

331131 genbanLR54797 R54797 
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331420 675963 J AW452904 AW449414 BE467906 A1298565 BE549932 BE326357 F04362 
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TABLE 6B 

Table 6B shows the genomic positioning for those pkeys lacking unigene ID's and accession numbers in Table 6. The pkeys In Table 7 lacking 
unlgenelD's are represented within Tables 1-6B. For each predicted exon, we have listed the genomic sequence source used for prediction. Nucleotide 
locations of each predicted exon are also listed. 



Pkey: 
Ref: 



Strand: 
NLposition: 



Unique number corresponding to an Eos probeset 

Sequence source. The 7 digit numbers In this column are Genbank Identifier (Gl) numbers. 'Dunham I. et al.' refers to the publication 
entitled "The DNA 

sequence of human chromosome 22." Dunham I. etal, Nature (1999) 402:489-495. . 
Indicates DNA strand from which exons were predicted. 
Indicates nucleotide positions of predicted exons. 



Pkey Ref 



332961 
333221 
333585 
333610 
334376 
334458 
334616 
335149 
335211 
335697 
335986 
335987 
335988 
335989 
335990 
336034 
337953 
338033 
338038 
338316 
338442 
338962 
332833 
333120 
333121 
333122 
333123 
333168 
333169 
333977 
334834 
335093 
335095 
335815 
335946 
336059 
336228 
337113 
337214 
337407 
337429 
337603 
338116 
338158 



338645 
338728 
339236 
339403 
325222 
325251 
325544 
325753 
329745 
329732 
329863 
325889 



Dunham, I. etal. 
Dunham, I. etal. 
Dunham, I. etal 
Dunham, I. etal. 
Dunham, I. etal. 
Dunham, I. etal. 
Dunham, I. etal. 
Dunham, I. etal. 
Dunham, I. etal. 
Dunham, I. etal 
Dunham, I. etal. 
Dunham, l. etal. 
Dunham, I. etal. 
Dunham, i. etal. 
Dunham, I. etal. 
Dunham, I. etal. 
Dunham, I. etal. 
Dunham, I. etal. 
Dunham, I. etal. 
Dunham, I. etal. 
Dunham, I. etal. 
Dunham, I. eta). 
Dunham, I. etal. 
Dunham, I. etal. 
Dunham, I. etal. 
Dunham, I. etal. 
Dunham, I. etal. 
Dunham, I. eta). 
Dunham, I. etal. 
Dunham, I. etal. 
Dunham, L etal. 
Dunham, I. etal. 
Dunham, I. etal. 
Dunham, I. etal. 
Dunham, I. eta). 
Dunham, I. etal. 
Dunham, I. etal. 
Dunham, I. etal. 
Dunham, I. etal. 
Dunham, I. etal. 
Dunham, I. etal. 
Dunham, I. etal 
Dunham, L eta). 
Dunham, I. etal. 
Dunham. I. etal. 
Dunham, I. etal. 
Dunham, I. eta). 
Dunham, I. etal. 
Dunham, I. etal. 
6525287 
6682448 
6682452 
6682474 
6065779 
6065783 
6691797 
5867087 



Strand 


NLposition 


Plus 


2521424-2521555 


Plus 


3978070-3978187 


Plus 


6234778-6234894 


Plus 


6547007-6547116 


Plus 


13902218-13902331 


Plus 


14353496-14353572 


Plus 


15176123-15176470 


Plus 


r\A JA«T J A A f\A JMrA^ 

21497441-21497587 


Plus 


r\ A *T*f A f* 4 A t*lA **T~T A r* City 

21774611-21774680 


Pius 


25481456-25481649 


Plus 


27967791-27967852 


Plus 


ft^lft *T -J J J A n-»A-T A A ft A 

27971413-27971481 


Plus 


27977912-27978013 


Plus 


27983788-27983 860 


Plus 


27988532t27988608 


Plus 


29014404-29014590 


Plus 


6827029-6827125 


Plus 


8092128-8092271 


Plus 


8138219-6138392 


Plus 


17089711-17089988 


Plus 


19980640-19980698 


Plus 


ftft^ftjftftft AArOOAAn 

29581892-29582020 


Minus 


1119848-1119705 


Minus 


3307508-3307427 


Minus' " 


3308446-3308358 


Minus 


3309596-3309531 


Minus 


3310817-3310749 


Minus 


3729896-3729788 


Minus 


3730864-3730767 


Minus 


8722928-8722725 


Minus 


17182681-17182535 


Minus 


ft .i ft ft^ft ft^ A4 AA7A4 i 

21297367-21297214 


Minus 




Minus 


26320518-26320421 


Minus 


27487203-27487035 


Minus 


29184079-29183969 


Minus 


30904602-30904497 


Minus 


21233344-21233237 


Minus 


26095902-26095502 


Minus 


31886652-31886567 


Minus 


32086238-32086079 


Minus 


1299296-1299194 


Minus 


10614071-10613814 


Minus 


11794465-11794343 


Minus 


18662403-18662305 


Minus 


24063839-24063775 


Minus 


25949039-25948927 


Minus 


3277335532773202 


Minus 


34050728-34050625 


Minus 


22332-22473 


Minus 


411693-411751 


Plus 


171228-171286 


Plus 


398512-398521 


Pius 


174774-175142 


Plus 


161252-161322 


Plus 


196801-196971 


Plus 


223829-223891 
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325843 6552453 

330020 6671887 

326198 5867215 

326230 5867230 

5 326169 5867255 

326077 6682495 

326080 6682495 

326759 6249610 

326788 6682503 

10 326946 6004446 

327036 6531965 

327220 5867525 

327365 6552412 

327414 5867750 

15 327373 5867792 

327568 5867811 

330187 6706138 

327747 5867947 

327844 6249582 

20 330232 6013526 

328229 5868105 

327876 5868140 

328262 6381906 

328688 5868262 

25 328700 5868264 

328695 5868264 

328418 5868409 

328455 5868431 

328520 5868477 

30 328548 5868487 

328857 6381927 

330305 4877982 

330306 4877982 
328937 5868500 

35 328957 6456773 

329073 5868596 

329088 5868608 

329350 6456785 

329414 5868874 

40 329440 5868885 

329451 5868887 



Minus 


7126-7232 


Plus 


172397-172491 


Minus 


80295-80674 


Minus 


301868-301972 


Minus 


128321-128388 


Minus 


312108-312168 


Plus 


478644-478847 


Plus 


97216-97311 


Plus 


277132-277335 


Minus 


116677-116967 


Plus 


319951-320040 


Minus 


65701-65781 


Minus 


118133-118198 


Plus 


102461-102586 


Minus 


81864742 


Minus 


4615246287 


Plus 


212923-213020 


Plus 


115322-115498 


Minus 


18895-18958 


Plus 


113655-113830 


Minus 


120936-121053 


Plus 


103882-104034 


Plus 


11867-12027 


Plus 


626030-626094 


Plus 


764089-764203 


Plus 


318632-318695 


Minus 


258811-258894 


Plus 


38557*385633 


Plus 


1942075-1942246 


Plus 


72301-72397 


Minus 


80557-81051 


Minus 


52269-52365 


Plus 


96161-96233 


Minus 


1448241-1448333 


Plus 


219195-219297 


Plus 


3783847956 


Plus 


116738-116950 


Plus 


98911-98969 


Plus 


942555-942643 


Plus 


21943-22063 


Plus' 


25974-26048 
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TABLE 7: 

Table 7 depicts Seq ID No., UnlgenelD, UnigeneTitle, Pkey, and ExAccn for at! of the sequences In Table 8. Seq ID No links the nucleic acid and protein 
sequence information inTabte 8 to Table 7. 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnlgenelD: Unigene number 

Unigene Title: Unigene gene title 

Seq.lD.No.: Sequence Identification Number found In Table 8 



PKey 


ExAccn 


Unigene ID 


Unigene Time 


ccn |n mo 
otU IU WU 


101545 


BE246154 


Hs. 154210 


endothelial oinerenuation, spnmgoiipi 




115819 


AAhooozU 


ns.411oo 


endomucin-2 


OcL| IU Jut 


424503 


Ml j nrnonc 

NMJJU22UD 


HS.1490U9 


integrin, alpha 5 (fibronectin receptor, 




102917 


AJQ1o712 


nS.zo7 f\ii 


integrin, beta 1 (fibronectin receptor, 




102915 


X07820 


nS.zz5o 


matrix metalloproteinase iu (stromeiysin 




105330 


AW338625 


HS.2212U 


rr»T« 

coTS 


Can in 11 A 19 


107385 


NM.005397 


Hs.16426 


podocalyxin-like 


Can in I 1 ? £. 1A 
o€C| lu 10 fit I't 


102024 


A A Ort 4 OCT 

AA301867 


nS.7bzz4 


cGr-containing nounn-iiKe extracenuia 


Can ID 15 A 16 
oBLj IU 1 J a IU 


102024 


AA301867 


HS.76224 


EGF-containing fibutin-like extracellula 


Qan in 17 A 1R 


134416 


X68264 


Hs.211579 


melanoma cell adhesion molecule 


Qon in 1 o a 9n 


103036 


M13509 


Hs.83169 


matnx metalloproteinase 1 (interstitial 


Qon in 91 A 99 

oeq iu £. i & u 


104865 


T79340 


Hs.22575 


B-cell CLL/lymphoma 6, member B (zinc fi 


Con in 9^ A 9A 


106124 


H93366 


Hs.7567 


Homo sapiens cDNA: r Uzlboz tis, clone ri 


Con in 9*\ A 9fi 
On] IU ZO a 


109001 


AI056548. 


Hs72116 


hypothetical protein rUzuyyz similar to 


Qon in 97 £ 9A 
acq IU H a ZO 


104764 


a mono Jo 

A1039243 


II. 07QCOC 

Hs.z7o5oo 


boTS 


Qon in 9Q A ^0 


4 11 AAA 

133200 


AB037715 


u- 4oocor\ 
HS.1o3oo9 


Hypothetical protein runuzi u 


Qon in "\\ A ^9 
esq iu j I a ql 


105263 


a i a ton nAoo 

AW388633 


|_]_ nnnn 

ris.oooz 


solute carrier family 7, (cationic amino 


Qon in Z 
ofcjq IU OO a OH 


102892 


nr*J iftnin 

BE440042 


rls.ooozo 


matrix metalloproteinase 3 (stromelysln 


Qon in 9^ A 
ocL| IU OO Ot OO 


109456 


AW956580 


li_ lA^nn 

HS.42699 


cSTs 


Qon in ^7 A Ifi 
Oct] IU Or a OO 


1 10906 


A A AA r 1 4 4 

AA035211 


Hs.17404 


ESTs 


Qon in I A(\ 

oeq iu oy ssu 


1 19073 


BE245360 


Hs.279477 


ESTs 


Qon in Ai Z AO 
oeq iu *f i & **z 


132050 


AI267615 


Hs.38022 


ESTs 


Qon in 4^ X 44 
o8t] IU HO a HH 


132490 


NWL001290 


Hs.4980 


LIM domain binding 2 


oeq \U HO Ol 40 


102283 


AW161552 


Hs.83381 


guanine nucleotide binding protein 1 1 


Qon in 47 SL 4ft 

oeq luHf o**o 


101714 


M68874 


Hs.211587 


phosphoiipase A2, group IVA (cytosolic, 


beq iu 4y & ou 


133975 


C18356 


Hs.295944 


tissue factor pathway Inhibitor 2 


Qon in *;1 SL *n 
oeq iu oi & o*i 


106793 


H94997 


Hs.16450 


ESTs 


5eQ ID 00 a 04 


118511 


N75620 


Hs.43157 


ESTs 


Qon in R4. Z 

oeq iu oh a oo 


101447 


M21305 




gb:Human alpha satellite ana satellite d 


Qon in *\fi A 57 
acq iu oo a oi 


314941 


a a r4f n AA 

AA5 15902 


Hs. 130650 


ESTs 


Qon in ftft £ <;q 
oeq iu oo & 09 


332466 


AB018259 


Hs.1 18140 


KIAA0716 gene product 


Cam in cn 9. R4 
oeq IU OU a 01 


313513 


AW298600 


Hs. 141 840 


ESTs, Weakly similar to S59501 interfere 


Qon in ft9 Z fi^ 

oeq iu oz & oo 


313556 


AA628517 


Hs.1 18502 


ESTs 


Qon in fiA Z fi5 

oeq iu oh a oo 


313665 


At i ft T* 4 AAil 

AW751201 


Hs.51233 


ESTs 


Qon in RR Z R7 

oeq iu oo & o/ 


314372 


AL040178 


Hs.1 42003 


ESTs 


Cm in CQ S CO 

beq iu oo & by 


429276 


a ^Ar^AA r* 

AF056085 


1 1. d AAAJA 

Hs.1 9861 2 


G protein-coupled receptor 51 


Con in 7H fi. 71 

beq iu au & n 


101345 


NMJ305795 


Hs.1 52175 


calcitonin receptor-like 


' Con in 79 fi. 71 

beq IU /Z a rO 


418994 


A A AAA^AA 

AA296520 


Hs.89546 . 


selectin E (endothelial adhesion molecul 


Can in 7A fi. 7R 

beq iu fH & /o 


1UO0OU 


AA1R7101 
rv\ lot iw i 


Uc 91TIQ4 
nox IO IJ7H 


hvnnthpHnal nmfftin MRC10895 


Seq ID 76 & 77 


133260 


AA403045 


Hs.6906 


Homo sapiens cDNA: FLJ23197 fis, clone R 


Seq ID 78 & 79 


101097 


BE245301 


Hs.89414 


chemokine (C-X-C motif), receptor 4 (fus 


Seq ID 80 & 81 


104786 


AA027167 


Hs.1 0031 


KIAA0955 protein 


Seq ID 82 & 83 


132173 


X89426 


Hs.41716 


endothelial cell-specific molecule 1 


SeqlD84&85 


100420 


D86983 


Hs.1 18893 


Melanoma associated gene 


Seq ID S6&87 


111018 


AI287912 


Hs.3628 


mitogen-activated protein kinase kinase 


Seq ID 88 & 89 


108507 


AI554545 


Hs.68301 


ESTs 


Seq ID 90 & 91 


104894 


AF065214 


Hs.18858 


phosphoiipase A2, group IVC (cytosolic, 


Seq ID 92 & 93 


118511 


N75620 


Hs.43157 


ESTs 


Seq ID94&95 


125609 


AA868063 


Hs.104576 


carbohydrate (keratan sulfate Gal-6) sul 


Seq ID 96 & 97 


101543 


M31166 


Hs.2050 


pentaxin-related gene, rapidly induced b 


SeqlD98&99 


102241 


NM.007351 


Hs.268107 


muitimerin 


Seq ID 100 & 101 


101560 


AW958272 


Hs.347326 


intercellular adhesion molecule 2 


Seq ID 102 & 103 


103280 


U84722 


Hs.76206 


cadherin 5, type 2, VE-cadherin (vascula 


Seq ID 104 & 105 


105826 


AA478756 


Hs.194477 


E3ubiquitin ligaseSMURF2 


Seq ID 106 & 107 


102804 


NfvL002318 


Hs.83354 


iysyt oxtdase-iike 2 


Seq ID 108 & 109 


131647 


AA359615 


Hs.30089 


ESTs 


Seq ID 110 & 111 


103095 


N1VL005424 


Hs.78824 


tyrosine kinase with immunoglobulin and 


Seq ID 112 & 113 


103037 


BE018302 


Hs.2894 


placental growth factor, vascular endoth 


Seq ID 114 & 115 


100405 


AW291587 


Hs.82733 


nidogen 2 


Seq ID 116 & 117 


102012 


BE259035 


Hs.1 18400 


singed (Drosophila)-Kte (sea urchin fas 


Seq ID 118 & 119 
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101261 


D30857 


Hs.82353 


protein C receptor, endothelial (EPCR) 


C —r. m 4 on p 404 

Seq ID 120 & 121 


105729 


H46612 


Hs.293815 


Homo sapiens HSPC285 mRNA, partial cds 


r>_— tpi 400 D 4O0 

Seq ID 122 & 123 


107216 


D51069 


Hs.211579 


melanoma cell adhesion molecule 


n ir\ a ox p 4oc 
Seq ID 124 & 125 


131080 


NH.001955 


Hs.2271 


endothelin 1 


r»__. |r\ jam n 407 

Seq ID 126 & 127 


131486 


F06972 


Hs.27372 


BMX non-receptor tyrosine kinase 


p __ if\ 4 00 p 4 on 

Seq ID 128 & 129 


134299 


AW580939 


Hs.97199 


complement component C1q receptor 


O _ _ ir\ 4 On 0 414 

Seq ID 130 & 131 


134983 


D28235 


Hs.196384 


prostaglandin-endoperoxide synthase 2 (p 


Ow, m 400 0 400 

seq lu 132 & loo 


115827 


AA428000 


Hs.283072 


actln related protein 2/3 complex, subun 


Cam In 41A D 4QC 

Seq ID 134 & 135 


133614 


NMJ03003 


Hs.75232 


SEC14 (S. cerevislae)-like 1 


Cam in 40C 0 407 

oeq id loo & 137 


116483 


AI346201 


Hs.76118 


ubtquitin carboxyMerminal esterase L1 


Cam in 4O0 P 40ft 

Seq ID 138 6 139 


132546 


M24283 


Hs.168383 


intercellular adhesion molecule 1 (CD54) 


£»_ _ tn 4 Aft P 4AA 

Seq ID 140 & 141 


133678 


AW247252 


NA 


nucleoside phosphorylase 


0_ _ tr\ 440 P 4 A1 

Seq ID 142 & 143 


130184 


H58306 


Hs.15165 


retinolc acid induced 14 


Cm*, in AAA 0 4AC 

Seq ID 144 & 145 


134786 


T29618 


Hs.89640 


TEK tyrosine kinase, endothelial (venous 


r\_-_ 1r\ 4 AC 0 4X7 

Seq ID 146 & 147 


129371 


X06828 


Hs.1 10802 


von Willebrand factor 


r> |f> A AO o a in 

Seq ID 148 & 149 


418506 


AA084246 


Hs.85339 


G protein-coupled receptor 39 


Seq ID 150 & 151 


322262 


AA632012 


Hs.188746 


ESTs 


C***mm I Pi 4 CO 0 4 CO 

Seq ID 152 & 153 


312173 


AI821409 


Hs.304471 


EST 


Seq ID 154 & 155 


319795 


AB037821 


Hs.146858 


protocadherin 10 


ip\ 4CC P 4C7 

Seq ID 156 & 157 


313978 


AI870175 


Hs.13957 


ESTs 


r*_— ir\ 4co o a en 

Seq ID 158 & 159 


306840 


AI077477 


Hs.307912 


ESTs 


n__ in 4cn p 4C4 

Seq ID 16U & lot 


310272 


AF216389 


Hs.148932 


sema domain, transmembrane domain (TM), 


f\„_ ipj 4 CO P 4 CO 

Seq ID 162 & 163 


310272 


AF216389 


Hs.148932 


sema domain, transmembrane domain (TM), 


0__ in 4Cil P 4CC 

Seq ID 164 & 165 


315044 


BE547674 


Hs.204169 


ESTs, Weakly similar to S65657 a!pha-1C- 


C>_ _ tn 4CC a ACT 

Seq ID 1 bo & lt>7 


321325 


AB033100 


Hs.300646 


KIAA1274 protein (similar to mouse palad 


Can in 4CO 0 1CQ 

beq ID loo & 10s 


321325 


AB033100 


Hs.300646 


KIAA1274 protein (similar to mouse palad 


0~_ m 47n 0 474 

Seq ID 170 ft 171 


303251 


AF240635 


Hs.115897 


protocadherin 12 


Pyu, In 470 0 470 

seq ID 172& 170 


302378 


AL109712 


Hs.296506 


Homo sapiens mRNA full length Insert cDN 


n„ m 47 A 0 47C 

Seq ID V4 & 175 


319267 


F11802 


Hs.6818 


ESTs 


Seq ID 176 & 177 


310442 


AW072215 


Hs.208470 


ESTs 


Seq ID 178 & 179 


300469 


BE301708 


Hs.233955 


hypothetical protein FU20401 


Seq ID 180 & 181 


331237 


W87874 


Hs.25277 


Homo sapiens cDNA FU10717 fis; clone NT 


C#.» in 4 0O P 4 00 

Seq ID 182 & 183 


330968 


R44557 


Hs.23748 


ESTs 


Seq ID 184 & 155 


301822 


X17033 


Hs.271986 


integral, alpha 2 (CD49B, alpha 2 subuni 


Seq ID 186 & 187 


422573 


AW297985 


Hs.295726 


integrin, alpha V (vitronectin receptor 


ri ir\ inn o inn 

Seq ID 188 & 189 


133061 


AI186431 


Hs.296638 


prostate differentiation factor 


0__ lr\ 4 nn 0 404 

Seq ID 190 & lyi 


135235 


AW298244 


Hs.266195 


ESTs 


Cam in 4 no 0 4QO 

Seq ID 192 & 1aJ 


101192 


BE247295 


Hs78452 


solute carrier family 20 (phosphate tran 


0__ in 4 OX 0 4QC 

beq ID 194 & iyj> 


113195 


H83265 


Hs.8881 


ESTs, Weakly similar to S41044 chromosom 


e«»» in 4QC 0 4Q7 

beq ID loo & 1?/ 


101741 


NNL003199 


Hs.326198 


transcription factor 4 


p„_ m 4O0 0 4 OA 

Seq id 198 & lyy 


321911 


AF026944 


Hs.293797 


ESTs 


c«« in onn 0 on.4 
beq ID 2UU & 2U 1 


320635 


N50617 


Hs.80506 


small nuclear ribonucleoprotein poiypept 


0 _ _ ipi ono 0 000 

Seq ID 202 & 2uo 


326230 






NM_017643:Homo sapiens hypothetical prot 


m onji 0 oac 

Seq ID 204 & 205 


132968 


AF234532 


Hs.61638 


myosin X 


tn one o OA7 

Seq ID 206 & 207 


135073 


W55956 


Hs.94030 


Homo sapiens mRNA; cDNA DKF2p586E1624 (f 


ry rr\ onn O OAn 

Seq ID 208 & 209 


108937 


AL050107 


Hs.24341 


transcriptional co-activator with PDZ-bi 


Seq ID 210 & 211 


116430 


AK001531 


Hs.66048 


hypothetical protein FU10669 


Seq ID 212 & 213 


104877 


Al 138635 


Hs.22968 


U« M « Jama UilA^C'/IC^MQ m Dk| A on 

Homo sapiens clone iMAbb.4oiyoy, mKNA se 


Opn in 91d A 91 1; 

Ocvj IU £ IH Ot £ 19 


122697 


AA420683 


Hs.98321 


hypothetical protein FU14103 


Seq ID 216 & 217 


112522 


R68857 


Hs.265499 


ESTs 


Seq ID 218 & 219 


304782 


AA582081 




gb:nn32h08.s1 NCI_CGAPJ3as1 Homo sapiens 


Seq ID 220 & 221 


312802 


AA644669 


Hs.193042 


ESTs 


Seq ID 222 & 223 


302680 


AW192334 


Hs.38218 


ESTs 


Seq ID 224 & 225 


326198 






Phase2&3Exons 


Seq ID 226 & 227 


331019 


NM.006033 


Hs.65370 


lipase; endothelial 


Seq ID 228 & 229 
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TABLE 8 



Seq ID NO: 1 DNA sequence 

Nucleic Acid Accession #: NM_0014 00 

Coding sequence: 244-2208 (underlined 



sequences correspond to start and stop codons) ) 



1 11 21 31 41 51 

I I i I I I 

GTCGGGGGCA GCAGCAAGAT GCGAAGCGAG CCGTACAGAT CCCGGGCTCT CCGAACGCAA 60 

CTTCGCCCTG CTTGAGCGAG GCTGCGGTTT CCGAGGCCCT CTCCAGCCAA GGAAAAGCTA 120 

CACAAAAAGC CTGGATCACT CATCGAACCA CCCCTGAAGC CAGTGAAGGC TCTCTCGCCT 180 

CGCCCTCTAG CGTTCGTCTG GAGTAGCGCC ACCCCGGCTT CCTGGGGACA CAGGGTTGGC 240 

ACCATGGGGC CCACCAGCGT CCCGCTGGTC AAGGCCCACC GCAGCTCGGT CTCTGACTAC 300 

GTCAACTATG ATATCATCGT CCGGCATTAC AACTACACGG GAAAGCTGAA TATCAGCGCG 360 

GACAAGGAGA ACAGCATTAA ACTGACCTCG GTGGTGTTCA TTCTCATCTG CTGCTTTATC 420 

ATCCTGGAGA ACATCTTTGT CTTGCTGACC ATTTGGAAAA CCAAGAAATT CCACCGACCC 480 

ATGTACTATT TTATTGGCAA TCTGGCCCTC TCAGACCTGT TGGCAGGAGT AGCCTACACA 540 

GCTAACCTGC TCTTGTCTGG GGCCACCACC TACAAGCTCA CTCCCGCCCA GTGGTTTCTG 600 

. CGGGAAGGGA GTATGTTTGT GGCCCTGTCA GCCTCCGTGT TCAGTCTCCT CGCCATCGCC 660 

ATTGAGCGCT ATATCACAAT GCTGAAAATG AAACTCCACA ACGGGAGCAA TAACTTCCGC 720 

CTCTTCCTGC TAATCAGCGC CTGCTGGGTC ATCTCCCTCA TCCTGGGTGG CCTGCCTATC 780 

ATGGGCTGGA ACTGCATCAG TGCGCTGTCC AGCTGCTCCA CCGTGCTGCC GCTCTACCAC 840 

AAGCACTATA TCCTCTTCTG CACCACGGTC TTCACTCTGC TTCTGCTCTC CATCGTCATT 900 

CTGTACTGCA GAATCTACTC CTTGGTCAGG ACTCGGAGCC GCCGCCTGAC GTTCCGCAAG 960 

AACATTTCCA AGGCCAGCCG CAGCTCTGAG AAGTCGCTGG CGCTGCTCAA GACCGTAATT 1020 

ATCGTCCTGA GCGTCTTCAT CGCCTGCTGG GCACCGCTCT TCATCCTGCT CCTGCTGGAT 1080 

GTGGGCTGCA AGGTGAAGAC CTGTGACATC CTCTTCAGAG CGGAGTACTT CCTGGTGTTA 1140 

GCTGTGCTCA ACTCCGGCAC CAACCCCATC ATTTACACTC TGACCAACAA GGAGATGCGT 1200 

CGGGCCTTCA TCCGGATCAT GTCCTGCTGC AAGTGCCCGA GCGGAGACTC TGCTGGCAAA 1260 

TTCAAGCGAC CCATCATCGC CGGCATGGAA TTCAGCCGCA GCAAATCGGA CAATTCCTCC 1320 

CACCCCCAGA AAGACGAAGG GGACAACCCA GAGACCATTA TGTCTTCTGG AAACGTCAAC 1380 

TCTTCTTCCT AGAACTGGAA GCTGTCCACC CACCGGAAGC GCTCTTTACT TGGTCGCTGG 1440 

CCACCCCAGT GTTTGGAAAA AAATCTCTGG GCTTCGACTG CTGCCAGGGA GGAGCTGCTG 1500 

CAAGCCAGAG GGAGGAAGGG GGAGAATACG AACAGCCTGG TGGTGTCGGG TGTTGGTGGG 1560 

TAGAGTTAGT TCCTGTGAAC AATGCACTGG GAAGGGTGGA GATCAGGTCC CGGCCTGGAA 1620 

TATATATTCT ACCCCCCTGG AGCTTTGATT TTGCACTGAG CCAAAGGTCT AGCATTGTCA 1680 

AGCTCCTAAA GGGTTCATTT GGCCCCTCCT CAAAGACTAA TGTCCCCATG TGAAAGCGTC 1740 

TCTTTGTCTG GAGCTTTGAG GAGATGTTTT CCTTCACTTT AGTTTCAAAC CCAAGTGAGT 1800 

GTGTGCACTT CTGCTTCTTT AGGGATGCCC TGTACATCCC ACACCCCACC CTCCCTTCCC I860 

TTCATACCCC TCCTCAACGT TCTTTTACTT TATACTTTAA CTACCTGAGA GTTATCAGAG 1920 

CTGGGGTTGT GGAATGATCG ATCATCTATA GCAAATAGGC TATGTTGAGT ACGTAGGCTG 1980 

TGGGAAGATG AAGATGGTTT GGAGGTGTAA AACAATGTCC TTCGCTGAGG CCAAAGTTTC 2040 

CATGTAAGCG GGATCCGTTT TTTGGAATTT GGTTGAAGTC ACTTTGATTT CTTTAAAAAA 2100 

CATCTTTTCA ATGAAATGTG TTACCATTTC ATATCCATTG AAGCCGAAAT CTGCATAAGG 2160 

AAGCCCACTT TATCTAAATG ATATTAGCCA GGATCCTTGG TGTCCTAGGA GAAACAGACA 2220 

AGCAAAACAA AGTGAAAACC GAATGGATTA ACTTTTGCAA ACCAAGGGAG ATTTCTTAGC 2280 

AAATGAGTCT AACAAATATG ACATCCGTCT TTCCCACTTT TGTTGATGTT TATTTCAGAA 2340 

TCTTGTGTGA TTCATTTCAA GCAACAACAT GTTGTATTTT GTTGTGTTAA AAGTACTTTT 2400 

CTTGATTTTT GAATGTATTT GTTTCAGGAA GAAGTCATTT TATGGATTTT ' TCTAACCCGT 2460 

GTTAACTTTT CTAGAATCCA CCCTCTTGTG CCCTTAAGCA TTACTTTAAC TGGTAGGGAA 2520 

CGCCAGAACT TTTAAGTCCA GCTATTCATT AGATAGTAAT TGAAGATATG TATAAATATT 2580 

ACAAAGAATA AAAATATATT ACTGTCTCTT TAGTATGGTT TTCAGTGCAA TTAAACCGAG 2640 

AGATGTCTTG TTTTTTTAAA AAGAATAGTA TTTAATAGGT TTCTGACTTT TGTGGATCAT 2700 
TTTGCACATA GCTTTATCAA CTTTTAAACA TTAATAAACT GATTTTTTTA AAG 



Seq ID NO: 2 Protein sequence: 
protein Accession #: NP 001391 



1 11 21 31 41 51 

1 I I I I I 

MGPTSVPLVK AHRSSVSDYV NYDIIVRHYN YTGKLNISAD KENSIKLTSV VFILICCFII 60 

LENIFVLLTI WKTKKFHRPM YYFIGNLALS DLLAGVAYTA NLLLSGATTY KLTPAQWFLR 120 

EGSMFVALSA SVFSLLAIAI ERYITMLKMK LHNGSNNFRL FLLISACWVI SLILGGLPIM 180 

GWNCISALSS CSTVLPLYHK HYILFCTTVF TCjLLLSIVIL YCRIYSLVRT RSRRLTFRKN 240 

ISKASRSSEK SLALLKTVII VLSVFIACWA PIjFILLLLDV GCKVKTCDIL FRAEYFLVIiA 300 

VLNSGTNPII YTLTNKEMRR AFIRIMSCCK CPSGDSAGKF KRPIIAGMEF SRSKSDNSSH 360 
PQKDEGDNPE TIMSSGNVNS SS 
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Seq ID NO: 3 Nucleotide sequencer 
Nucleic Acid Accession #: NM_016242 

Coding sequence: 79-864 (underlined sequences correspond to start and stop codons) ) 



1 11 21 31 41 51 

I I I I I I 

AAGGCCCTGC CAGCTTGGGA GGGAATTGTC CCTGCCTGCT TCTGGAGAAA GAAGATATTG 60 

ACACCATCTA CGGGCACCAT GGAACTGCTT CAAGTGACCA TTCTTTTTCT TCTGCCCAGT 120 

ATTTGCAGCA GTAACAGCAC AGGTGTTTTA GAGGCAGCTA ATAATTCACT TGTTGTTACT 180 

ACAACAAAAC CATCTATAAC AACACCAAAC ACAGAATCAT TACAGAAAAA TGTTGTCACA 240 

CCAACAACTG GAACAACTCC TAAAGGAACA ATCACCAATG AATTACTTAA AATGTCTCTG 300 

ATGTCAACAG CTACTTTTTT AACAAGTAAA GATGAAGGAT TGAAAGCCAC AACCACTGAT 360 

GTCAGGAAGA ATGACTCCAT CATTTCAAAC GTAACAGTAA CAAGTGTTAC ACTTCCCAAT 420 

GCTGTTTCAA CATTACAAAG TTCCAAACCC AAGACTGAAA CTCAGAGTTC AATTAAAACA 480 

ACAGAAATAC CAGGTAGTGT TCTACAACCA GATGCATCAC CTTCTAAAAC TGGTACATTA 540 

ACCTCAATAC CAGTTACAAT TCCAGAAAAC ACCTCACAGT CTCAAGTAAT AGACACTGAG 600 

GGTGGAAAAA ATGCAAGCAC TTCAGCAACC AGCCGGTCTT ATTCCAGTAT TATTTTGCCG 660 

GTGGTTATTG CTTTGATTGT AATAACACTT TCAGTATTTG TTCTGGTGGG TTTGTACCGA 720 

ATGTGCTGGA AGGCAGATCC GGGCACACCA GAAAATGGAA ATGATCAACC TCAGTCTGAT 780 

AAAGAGAGCG TGAAGCTTCT TACCGTTAAG ACAATTTCTC ATGAGTCTGG TGAGCACTCT 840 

GCACAAGGAA AAACCAAGAA CTGACAGCTT GAGGAATTCT CTCCACACCT AGGCAATAAT 900 

TACGCTTAAT CTTCAGCTTC TATGCACCAA GCGTGGAAAA GGAGAAAGTC CTGCAGAATC 960 
AATCCCGACT TCCATACCTG CTGCTGG 



Seq ID NO: 4 Protein sequence: 
Protein Accession #: NP_057326 

1 11 21 31 41 51 

I I - I I I I 

MELLQVTILF LLPS ICS SNS TGVLEAANNS LWTTTKPSI TTPNTESLQK NWTPTTGTT 

PKGTITNELL KMSLMSTATF LTSKDEGLKA TTTDVRKNDS IISNVTVTSV TLPNAVSTLQ 

SSKPKTETQS SIKTTEIPGS VLQPDASPSK TGTLTSIPVT IPENTSQSQV IDTEGGKNAS 

TSATSRSYSS IILPWIALI VITLSVFVLV GLYRMCWKAD PGTPENGNDQ PQSDKESVKL 
LTVKTISHES GEHSAQGKTK N 



Seq ID NO: 5 Nucleotide sequence: 
Nucleic Acid Accession #: NM_002205 , 

Coding sequence: 24.. 3173 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

CAGGACAGGG AAGAGCGGGC GCTATGGGGA GCCGGACGCC AGAGTCCCCT CTCCACGCCG 60 

TGCAGCTGCG CTGGGGCCCC CGGCGCCGAC CCCCGCTCGT GCCGCTGCTG TTGCTGCTCG 120 

TGCCGCCGCC ACCCAGGGTC GGGGGCTTCA ACTTAGACGC GGAGGCCCCA GCAGTACTCT 180 

CGGGGCCCCC GGGCTCCTTC TTCGGATTCT CAGTGGAGTT TTACCGGCCG GGAACAGACG 240 

GGGTCAGTGT GCTGGTGGGA GCACCCAAGG CTAATACCAG CCAGCCAGGA GTGCTGCAGG 300 

GTGGTGCTGT CTACCTCTGT CCTTGGGGTG CCAGCCCCAC ACAGTGCACC CCCATTGAAT 360 

TTGACAGCAA AGGCTCTCGG CTCCTGGAGT CCTCACTGTC CAGCTCAGAG GGAGAGGAGC 420 

CTGTGGAGTA CAAGTCCTTG CAGTGGTTCG GGGCAACAGT TCGAGCCCAT GGCTCCTCCA 480 

TCTTGGCATG CGCTCCACTG TACAGCTGGC GCACAGAGAA GGAGCCACTG AGCGACCCCG 540 

TGGGCACCTG CTACCTCTCC ACAGATAACT TCACCCGAAT TCTGGAGTAT GCACCCTGCC 600 

GCTCAGATTT CAGCTGGGCA GCAGGACAGG GTTACTGCCA AGGAGGCTTC AGTGCCGAGT 660 

TCACCAAGAC TGGCCGTGTG GTTTTAGGTG GACCAGGAAG CTATTTCTGG CAAGGCCAGA 720 

TCCTGTCTGC CACTCAGGAG CAGATTGCAG AATCTTATTA CCCCGAGTAC CTGATCAACC 780 

TGGTTCAGGG GCAGCTGCAG ACTCGCCAGG CCAGTTCCAT CTATGATGAC AGCTACCTAG 840 

GATACTCTGT GGCTGTTGGT GAATTCAGTG GTGATGACAC AGAAGACTTT GTTGCTGGTG 900 

TGCCCAAAGG GAACCTCACT TACGGCTATG TCACCATCCT TAATGGCTCA GACATTCGAT 960 

CCCTCTACAA CTTCTCAGGG GAACAGATGG CCTCCTACTT TGGCTATGCA GTGGCCGCCA 1020 

CAGACGTCAA TGGGGACGGG CTGGATGACT TGCTGGTGGG GGCACCCCTG CTCATGGATC 1080 

GGACCCCTGA CGGGCGGCCT CAGGAGGTGG GCAGGGTCTA CGTCTACCTG CAGCACCCAG 1140 

CCGGCATAGA GCCCACGCCC ACCCTTACCC TCACTGGCCA TGATGAGTTT GGCCGATTTG 1200 

GCAGCTCCTT GACCCCCCTG GGGGACCTGG ACCAGGATGG CTACAATGAT GTGGCCATCG 1260 

GGGCTCCCTT TGGTGGGGAG ACCCAGCAGG GAGTAGTGTT TGTATTTCCT GGGGGCCCAG 1320 
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GAGGGCTGGG CTCTAAGCCT TCCCAGGTTC TGCAGCCCCT GTGGGCAGCC AGCCACACCC 1380 

CAGACTTCTT TGGCTCTGCC CTTCGAGGAG GCCGAGACCT GGATGGCAAT GGATATCCTG 1440 

ATCTGATTGT GGGGTCCTTT GGTGTGGACA AGGCTGTGGT ATACAGGGGC CGCCCCATCG 1500 

TGTCCGCTAG TGCCTCCCTC ACCATCTTCC CCGCCATGTT CAACCCAGAG GAGCGGAGCT 1560 

5 GCAGCTTAGA GGGGAACCCT GTGGCCTGCA TCAACCTTAG CTTCTGCCTC AATGCTTCTG 1620* 

GAAAACACGT TGCTGACTCC ATTGGTTTCA CAGTGGAACT TCAGCTGGAC TGGCAGAAGC 1680 

AGAAGGGAGG GGTACGGCGG GCACTGTTCC TGGCCTCCAG GCAGGCAACC CTGACCCAGA 1740 

CCCTGCTCAT CCAGAATGGG GCTCGAGAGG ATTGCAGAGA GATGAAGATC TACCTCAGGA 1800 

ACGAGTCAGA ATTTCGAGAC AAACTCTCGC CGATTCACAT CGCTCTCAAC TTCTCCTTGG I860 

10 ACCCCCAAGC CCCAGTGGAC AGCCACGGCC TCAGGCCAGC CCTACATTAT CAGAGCAAGA 1920 

GCCGGATAGA GGACAAGGCT CAGATCTTGC TGGACTGTGG AGAAGACAAC ATCTGTGTGC 1980 

CTGACCTGCA GCTGGAAGTG TTTGGGGAGC AGAACCATGT GTACCTGGGT GACAAGAATG 2040 

CCCTGAACCT CACTTTCCAT GCCCAGAATG TGGGTGAGGG TGGCGCCTAT GAGGCTGAGC 2100 

TTCGGGTCAC CGCCCCTCCA GAGGCTGAGT ACTCAGGACT CGTCAGACAC CCAGGGAACT 2160 

15 TCTCCAGCCT GAGCTGTGAC TACTTTGCCG TGAACCAGAG CCGCCTGCTG GTGTGTGACC 2220 

TGGGCAACCC CATGAAGGCA GGAGCCAGTC TGTGGGGTGG CCTTCGGTTT ACAGTCCCTC 2280 

ATCTCCGGGA CACTAAGAAA ACCATCCAGT TTGACTTCCA GATCCTCAGC AAGAATCTCA 2340 

ACAACTCGCA AAGCGACGTG GTTTCCTTTC GGCTCTCCGT GGAGGCTCAG GCCCAGGTCA 2400 

CCCTGAACGG TGTCTCCAAG CCTGAGGCAG TGCTATTCCC AGTAAGCGAC TGGCATCCCC 2460 

20 GAGACCAGCC TCAGAAGGAG GAGGACCTGG GACCTGCTGT CCACCATGTC TATGAGCTCA 2520 

TCAACCAAGG CCCCAGCTCC ATTAGCCAGG GTGTGCTGGA ACTCAGCTGT CCCCAGGCTC 2580 

TGGAAGGTCA GCAGCTCCTA TATGTGACCA GAGTTACGGG ACTCAACTGC ACCACCAATC 2640 

ACCCCATTAA CCCAAAGGGC CTGGAGTTGG ATCCCGAGGG TTCCCTGCAC CACCAGCAAA 2700 

AACGGGAAGC TCCAAGCCGC AGCTCTGCTT CCTCGGGACC TCAGATCCTG AAATGCCCGG 2760 

25 AGGCTGAGTG TTTCAGGCTG CGCTGTGAGC TCGGGCCCCT GCACCAACAA GAGAGCCAAA 2820 

GTCTGCAGTT GCATTTCCGA GTCTGGGCCA AGACTTTCTT GCAGCGGGAG CACCAGCCAT 2880 

TTAGCCTGCA GTGTGAGGCT GTGTACAAAG CCCTGAAGAT GCCCTACCGA ATCCTGCCTC 2940 

GGCAGCTGCC CCAAAAAGAG CGTCAGGTGG CCACAGCTGT GCAATGGACC AAGGCAGAAG 3000 

GCAGCTATGG CGTCCCACTG TGGATCATCA TCCTAGCCAT CCTGTTTGGC CTCCTGCTCC 3060 

30 TAGGTCTACT CATCTACATC CTCTACAAGC TTGGATTCTT CAAACGCTCC CTCCCATATG 3120 

GCACCGCCAT GGAAAAAGCT CAGCTCAAGC CTCCAGCCAC CTCTGATGCC TGAGTCCTCC 3180 

CAATTTCAGA CTCCCATTCC TGAAGAACCA GTCCCCCCAC CCTCATTCTA CTGAAAAGGA 3240 

GGGGTCTGGG TACTTCTTGA AGGTGCTGAC GGCCAGGGAG AAGCTCCTCT CCCCAGCCCA 3300 

GAGACATACT TGAAGGGCCA GAGCCAGGGG GGTGAGGAGC TGGGGATCCC TCCCCCCCAT 3360 

35 GCACTGTGAA GGACCCTTGT TTACACATAC CCTCTTCATG GATGGGGGAA CTCAGATCCA 3420 

GGGACAGAGG CCCAGCCTCC CTGAAGCCTT TGCATTTTGG AGAGTTTCCT GAAACAACTG 3480 

GAAAGATAAC TAGGAAATCC ATTCACAGTT CTTTGGGCCA GACATGCCAC AAGGACTTCC 3540 

TGTCCAGCTC CAACCTGCAA AGATCTGTCC TCAGCCTTGC CAGAGATCCA AAAGAAGCCC 3600 

CCAGTAAGAA CCTGGAACTT GGGGAGTTAA GACCTGGCAG CTCTGGACAG CCCCACCCTG 3660 

40 GTGGGCCAAC AAAGAACACT AACTATGCAT GGTGCCCCAG GACCAGCTCA GGACAGATGC 3720 

CACAAGGATA GATGCTGGCC CAGGGCCAGA GCCCAGCTCC AAGGGGAATC AGAACTCAAA 3780 

TGGGGCCAGA TCCAGCCTGG GGTCTGGAGT TGATCTGGAA CCCAGACTCA GACATTGGCA 3840 

CCAATCCAGG CAGATCCAGG ACTATATTTG GGCCTGCTCC AGACCTGATC CTGGAGGCCC 3900 

AGTTCACCCT GATTTAGGAG AAGCCAGGAA TTTCCCAGGA CCTGAAGGGG CCATGATGGC 3960 

45 AACAGATCTG GAACCTCAGC CTGGCCAGAC ACAGGCCCTC CCTGTTCCCC AGAGAAAGGG 4020 

GAGCCCACTG TCCTGGGCCT GCAGAATTTG GGTTCTGCCT GCCAGCTGCA CTGATGCTGC 4080 

CCCTCATCTC TCTGCCCAAC CCTTCCCTCA CCTTGGCACC AGACACCCAG GACTTATTTA 4140 

AACTCTGTTG CAAGTGCAAT AAATCTGACC CAGTGCCCCC ACTGACCAGA ACTAGAAAAA 4200 
AAAA 



50 



Seq ID NO: 6 Protein sequence: 
Protein Accession #: NP_002196.1 



55 1 11 21 31. 41 51 

I I I I I I 

MGSRTPESPL HAVQLRWGPR RRPPLVPLLL LLVPPPPRVG GFNLDAEAPA VLSGPPGSFP 60 

GFSVEFYRPG TDGVSVLVGA PKANTSQPGV LQGGAVYLCP WGASPTQCTP IEFDSKGSRL 120 

IjESSLSSSEG EEPVEYKSLQ WFGATVRAHG SSILACAPLY SWRTEKEPLS DPVGTCYIiST 180 

60 DNFTRILEYA PCRSDFSWAA GQGYCQGGFS AEFTKTGRW LGGPGSYFWQ GQILSATQEQ 240 

IAESYYPEYL INLVQGQLQT RQASSIYDDS YLGYSVAVGE FSGDDTEDFV AGVPKGNLTY 300 

GYVTILNGSD IRSLYNFSGE QMASYFGYAV AATDVNGDGL DDLLVGAPLL MDRTPDGRPQ 360 

EVGRVYVYLQ HPAGIEPTPT LTLTGHDEFG RFGSSLTPLG DLDQDGYNDV AIGAPFGGET 420 

QQGWFVFPG GPGGLGSKPS QVLQPLWAAS HTPDFFGSAL RGGRDLDGNG YPDLIVGSFG 480 

65 VDKAWYRGR PIVSASASLT IFPAMFNPEE RSCSLEGNPV ACINLSFCLN ASGKHVADSI 540 

GFTVELQLDW QKQKGGVRRA LFLASRQATL TQTLLIQNGA REDCREMKIY LRNESEFRDK 600 

liSPIHIALNF SLDPQAPVDS HGLRPALHYQ SKSRIEDKAQ ILLDCGEDNI CVPDLQLEVF 660 

GEQNHVYLGD KNALNLTFHA QNVGEGGAYE AELRVTAPPE AEYSGLVRHP GNFSSLSCDY 720 

FAVNQSRLLV CDLGNPMKAG ASLWGGLRFT VPHLRDTKKT IQFDFQILSK NLNNSQSDW 780 

70 SFRLSVEAQA QVTLNGVSKP EAVLFPVSDW HPRDQPQKEE DLGPAVHHVY ELINQGPSSI 840 

SQGVLELSCP QALEGQQIiLY VTRVTGLNCT TNHPINPKGL ELD PEGS LHH QQKREAPSRS 900 

SASSGPQILK CPEAECFRLR CELGPLHQQE SQSLQLHFRV WAKTFLQREH QPFSLQCEAV 960 

YKALKMPYRI LPRQLPQKER QVATAVQWTK AEGSYGVPLW IIILAILFGL LLLGLLIYIL 1020 
YKLGFFKRSL PYGTAMEKAQ LKPPATSDA 

75 
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Seq ID NO: 7 Nucleotide sequence: 
Nucleic Acid Accession #: NM_002211 

Coding sequence: 104 .. 2500 ~< underlined sequences correspond to start and stop codons) 



GTCCGCCAAA 
ACAAGCCTAA 
TTTCTGGATT 
ATGTTTAAAA 
GTGGTGCACA 
TTTAGAAGCC 
CAAAGATATA 
CAAGCCAGAG 
GGAGCCACAG 
CTACCTTATG 
AACAGATCTG 
ATTTGTGGAA 
TTGCACAAGT 
TAATAAAGGA 
TTCTCCAGAA 
CTGGAGGAAT 
AGATGGGAAA 
TATGTACACA 
GAGTGAAAAT 
GGAGCTGAAA 
TGTAATTCAG 
CGGCAAATTG 
TGGAACAGGG 
TGAAATTAGC 
GCCTCTGGGC 
CCAAAGCGAA 
TGGCGCGTGC 
AGTTAACAGT 
TAACAATGGA 
AATTTATTCT 
AATTTGTGGA 
TGGCAGTGCA 
CTGCAATGGC 
AGGGCAAACG 
TGTTCAGTGC 
CTATTTTAAC 
TCCTGTGTCC 
AGTGAATGGG 
TCCAGACATC 
ATTACTGCTG 
TGAAAAGGAG 
CGTAACAACT 
ACAACACTGA 
' TTGCCATGGT 
TGTTTTATTA 
ACAGGATGGT 
GACTATTGAA 
AGTTAAAGTA 
CTTTACATAT 
AAGGATTGTT 
CTGAAAGACA 
AAAGGCCATG 
GTGCCATTTT 
AGCCTTAGAT 
ACATACAGGC 
TTTATTATTT 
TGTATTTGTT 
AGCGGTCAAT 
TCACCTACGC 
CACCATTTGA 
GTTTTAACAG 



11 
I 

ACCTGCGCGG 
CGTCCGCTGG 
GGACTGATCA 
GCAAATGCCA 
AATTCAACAT 
TTAAAAAAGA 
AAGAAAAATA 
GATATTACTC 
ACATTTACAT 
GACCTGTCTT 
ATGAATGAAA 
AAGACTGTGA 
GAACAGAACT 
GAAGTATTTA 
GGTGGTTTCG 
GTTACACGGC 
CTTGGTGGCA 
ATGAGCCATT 
AATATTCAGA 
AACTTGATCC 
TTGATCATTG 
TCAGAAGGAG 
GAAAATGGAA 
ATAACTTCAA 
TTTACGGAGG 
GGCATCCCTG 
AGGTGCAATG 
GAAGACATGG 
GAGTGCGTCT 
GGCAAATTCT 
GGAAATGGTG 
TGTGACTGTT 
CGGGGCATCT 
TGTGAGATGT 
AGAGCCTTCA 
ATTACCAAGG 
CATTGTAAGG 
AACAACGAGG 
ATTCCAATTG 
ATATGGAAGC 
AAAATGAATG 
GTGGTCAATC 
ATGCAAAGTA 
TTTACTCATG 
TTTTGAAAAT 
TATTCTTGTC 
ATCAAGCTTA 
ATGAGCATGA 
GTCAGTTTGC 
TTAAATCTGT 
AGTATGTTGA 
GGAAAAATTC 
AAGAGTTACT 
AAAAGAACCG 
CATACTTTAC 
TTATTTTGTT 
TGCAATTTTG 
TTGCCTTTTT 
GAGTCTTACT 
GTTGCCCATC 
TTCA 



21 
I 

ATAGGGAAGA 
GCCCCGGACG 
GTTCAGTTTG 
AATCATGTGG 
TTTTACAGGA 
AGGGTTGCCC 
AAAATGTAAC 
AGATCCAACC 
TAAAATTCAA 
ATTCAATGAA 
TGAGGAGGAT 
TGCCTTACAT 
GCACCACCCC 
ATGAACTTGT 
ATGCCATCAT 
TGCTGGTGTT 
TTGTTTTACC 
ATTATGATTA 
CAATTTTTGC 
CTAAGTCAGC 
ATGCATACAA 
TAACAATAAG 
GAAAATGTTC 
ATAAGTGTCC 
AAGTAGAGGT 
AAAGTCCCAA 
AAGGGCGTGT 
ATGCTTACTG 
GCGGACAGTG 
GCGAGTGTGA 
TTTGCAAGTG 
CTTTGGATAC 
GCGAGTGTGG 
GTCAGACCTG 
ATAAAGGAGA 
TAGAAAGTCG 
AGAAGGATGT 
TCATGGTTCA 
TAGCTGGTGT 
TTTTAATGAT 
CCAAATGGGA 
CGAAGTATGA 
GCAATTTCCA 
TGCAGGTTTT 
AATGTTGTAA 
AGCTAAGGTC 
TTGGATTAAG 
TGAGAGTTTC 
AGTTATGCAG 
TATTTTGCTA 
GAGTTGCTGG 
AGAGAGTTAG 
TAATGTTTGG 
AGCAATTTTC 
AAAGTATTTG 
TAATGTCTGG 
GGGTAAGACT 
AATGAACATG 
TTGAGTTAGT 
TTGTTTCACA 



31 
I 

ACAGCACCCC 
CCGCGCGGAA 
CTGTGTGTTT 
AGAATGTATA 
AGGAATGCCT 
TCCAGATGAC 
CAACCGTAGC 
ACAGCAGTTG 
GAGAGCTGAA 
AGACGATTTG 
TACTTCGGAC 
TAGCACAACA 
ATTTAGCTAC 
TGGAAAACAG 
GCAAGTTGCA 
TTCCACAGAT 
AAATGATGGA 
TCCTTCTATT 
AGTTACTGAA 
AGTAGGAACA 
TTCCCTTTCC 
TTACAAATCT 
CAATATTTCC 
AAAAAAGGAT 
TATTCTTCAG 
GTGTCATGAA 
TGGTAGACAT 
CAGGAAAGAA 
TGTTTGTAGG 
TAATTTCAAC 
TCGTGTGTGT 
TAGTACTTGT 
TGTCTGTAAG 
CCTTGGTGTC 
AAAGAAAGAC 
GGACAAATTA 
TGACGACTGT 
TGTTGTGGAG 
GGTTGCTGGA 
AATTCATGAC 
CACGGGTGAA 
GGGAAAATGA 
TAGTCACAGT 
GAAAATGTAC 
TTCATGCCAG 
ACATTGTGCC 
TGATATTTCT 
TGTTAATCAT 
AATCCAAAGT 
TTTGCCTGTT 
TGTAAAATAC 
GAAGGAAAAA 
TAACTTTTAT- 
TGCTAAAAAG 
CTGAATGGGG 
TGCTTTCTAT 
TTTTTATGAG 
TGAAGTTATA 
GCCATAACAG 
CTAGTCACAT 



41 
I 

GGCGCCGATT 
AAGATGAATT 
GCTCAAACAG 
CAAGCAGGGC 
ACTTCTGCAC 
ATAGAAAATC 
AAAGGAACAG 
GTTTTGCGAT 
GACTATCCCA 
GAGAATGTAA 
TTCAGAATTG 
CCAGCTAAGC 
AAAAATGTGC 
CGCATATCTG 
GTTTGTGGAT 
GCCGGGTTTC 
CAATGTCACC 
GCTCACCTTG 
GAATTTCAGC 
TTATCTGCAA 
TCAGAAGTCA 
TACTGCAAGA 
ATTGGAGATG 
TCTGACAGCT 
TACATCTGTG 
GGAAATGGGA 
TGTGAATGCA 
AACAGTTCAG 
AAGAGGGATA 
TGTGATAGAT 
GAGTGCAACC 
GAAGCCAGCA 
TGTACAGATC 
TGTGCTGAGC 
ACATGCACAC 
CCCCAGCCGG 
TGGTTCTATT 
AATCCAGAGT 
ATTGTTCTTA 
AGAAGGGAGT 
AATCCTATTT 
GTACTGCCCG 
TAGGTAGCTT 
AATATGTATA 
GGACTGACAA 
TTTTTGACCT 
ATAGCGATTG 
GTATTAAAAC 
AAATGTCCTG 
AGACATGACT 
GTTTGAAATA 
CCAATAGCTT 
GCCTTCACTT 
TCCTTGATTT 
ACCTTTTGAG 
CACCTCTTCT 
TACTTTTTCT 
CTGTGGCTAT 
ACCACTGTAT 
TCTTGTTTTA 



51 
I 

GCCGTACCAA 

TACAACCAAT 

ATGAAAATAG 

CAAATTGTGG 

GATGTGATGA 

CCAGAGGCTC 

CAGAGAAGCT 

TAAGATCAGG 

TTGACCTCTA 

AAAGTCTTGG 

GATTTGGCTC 

TCAGGAACCC 

TCAGTCTTAC 

GAAATTTGGA 

CACTGATTGG 

ACTTTGCTGG 

TGGAAAATAA 

TCCAGAAACT 

CTGTTTACAA 

ATTCTAGCAA 

TTTTGGAAAA 

ACGGGGTGAA 

AGGTTCAATT 

TTAAAATTAG 

AATGTGAATG 

CATTTGAGTG 

GCACAGATGA 

AAATCTGCAG 

ATAGAAATGA 

CCAATGGCTT 

CCAACTACAC 

ACGGACAGAT 

CGAAGTTTCA 

ATAAAGAATG 

AGGAATGTTC 

TCCAACCTGA 

TTACGTATTC 

GTCCCACTGG 

TTGGCCTTGC 

TTGCTAAATT 

ATAAGAGTGC 

TGCAAATCCC 

TAGGGCAATA^ 

ATTTTTAAAA 

AAGACTTGAG 

TTTCTTCCTG 

AAAGGGCAAT 

TGATTTTTAG 

CTAGCTAGTT 

GATGACATAT 

GTTGATCTAC 

TAAAACCTGT 

TACAAATTCA 

AGCACTATTT 

TTGAATTTAT 

AATCTTTTAA 

TTGAAGTTTT 

GCAACAGCTC 

GTTTACTTCT 

AGTGCCTTTA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3460 
3540 
3600 



Seq ID NO: 8 Protein sequence: 
Protein Accession #: NP_0O2202 



1 11 21 31 41 51 

I I 1 I I t 

MNLQPIFWIG LISSVCCVFA QTDENRCIiKA NAKSCGECIQ AGPNCGWCTN STFLQEGMPT 



60 



182 



WO 02/079492 



PCT7US02/04915 



SARCDDLEAL 
LRLRSGEPQT 
RIGFGSFVEK 
ISGNLDSPEG 
CHLENNMYTM 
SANSSNVIQL 
GDEVQFEISI 
NGTFECGACR 
RDNTNEIYSG 
ASNGQICNGR 
CTQECSYFNI 
PECPTGPDII 
PIYKSAVTTV 



KKKGCPPDDI 
FTLKFKRAED 
TVMPYISTTP 
GFDAIMQVAV 
SHYYDYPSIA 
IIDAYNSLSS 
TSNKCPKKDS 
CNEGRVGRHC 
KFCECDNFNC 
GICECGVCKC 
TKVESRDKLP 
PIVAGWAGI 
VNPKYEGK 



ENPRGSKDIK 
YPIDLYYLMD 
AKLRNPCTSE 
CGSLIGWRNV 
HLVQKLSENN 
EVILENGKLS 
DSFKIRPLGF 
ECSTDEVNSE 
DRSNGLICGG 
TDPKFQGQTC 
QPVQPDPVSH 
VLIGLALLLI 



KNKNVTNRSK 
ttSYSMKDDLE 
QNCTSPFSYK 
TRLLVFSTDA 
IQTIFAVTEE 
EGVTISYKSY 
TEBVEVILQY 
DMDAYCRKEN 
NGVCKCRVCE 
EMCQTCLGVC 
CKEKDVDDCW 
WKUjMIIHDR 



GTAEKLKPED 
NVKSLGTDLM 
NVLSLTNKGB 
GFHFAGDGKL 
FQPVYKELKN 
CKNGVNGTGE 
ICECECQSEG 
SSEICSNNGE 
CNPNYTGSAC 
AEHKECVQCR 
FYFTYSVNGN 
REFAKFEKEK 



ITQIQPQQLV 
NEMRRITSDF 
VFNELVGKQR 
GGIVLPNDGQ 
LIPKSAVGTL 
NGRKCSNISI 
IPESPKCHEG 
CVCGQCVCRK 
DCSLDTSTCE 
AFNKGEKKDT 
NEVMVHWEN 
MNAKWDTGEN 



120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



Seq ID NO: 9 Nucleotide sequence: ' 
Nucleic Acid Accession #:NM_002425 

Coding sequence: 23 ..1453 (underlined sequences correspond to start and stop codons) 



1 
I 

AAAGAAGGTA 
AGTCTGCTCT 
TGCCCAGCAA 
AAAGGACAGT 
GGTGACAGGG 
TCCTGACGTT 
TACATACAGG 
TGAGAAAGCT 
AGGAGAGGCT 
TGATGGCCCA 
TATTCACTTT 
CGTTGCTGCT 
TTTGATGTAC 
TGATGTGAAT 
GGTGCCCACA 
GTCCTTCGAT 
TTGGCGAAGA 
CTCTCTTCCA 
TTTTAAAGGA 
AGGCATCCAT 
CAAGGAAAAG 
TAGCCAGTCC 
GCCTAAGGTT 
ACAGTTTGAG 
GTTACATTGC 
ATTATTCATC 
GAAGAAGATG 
ACTTGCTTTT 
' ATGTATTTTC 
CTT 



11 

I 

AGGGCAGTGA 
GCCTATCCTC 
TACCTAGAAA 
AATCTCATTG 
AAGCTAGACA 
GGTCACTTCA 
ATTGTGAATT 
CTGAAAGTCT 
GATATAATGA 
GGACACAGTT 
GATGATGATG 
CATGAACTTG 
CCACTCTACA 
GGCATTCAGT 
AAATCTGTTC 
GCCATCAGCA 
TCCCACTGGA 
TCATATTTGG 
AATGAGTTCT 
ACCCTGGGTT 
AAGAAAACAT 
ATGGAGCAAG 
GATGCTGTAT 
TTTGACCCCA 
TAGGCGAGAT 
TAATGTATTA 
AGCCTTGCAG 
GAATTGCACT 
ATAGATGTGT 



21 

I 

GAATGATGCA 
TGAGTGGGGC 
AGTACTACAA 
TTAAAAAAAT 
CTGACACTCT 
GCTCCTTTCC 
ATACACCAGA 
GGGAAGAGGT 
TCTCTTTCGC 
TGGCTCATGC 
AAAAATGGAC 
GCCACTCCCT 
ACTCATTCAC 
CTCTCTACGG 
CTTCGGGATC 
CTCTGAGGGG 
ACCCTGAACC 
ATGCTGCATA 
GGGCCATCAG 
TTCCTCCAAC 
ACTTCTTTGC 
GCTTCCCTAG 
TACAGGCATT 
ATGCCAGGAT 
AGGGGGAAGA 
TGAGCCAAAA 
ATATCTGCAT 
GAACAGAATT 
TATTACTTCC 



31 

I 

TCTTGCATTC 
AGCAAAAGAG 
CCTCGAAAAG 
CCAAGGAATG 
GGAGGTGATG 
TGGCATGCCG 
TTTGCCAAGA 
GACTCCACTC 
AGTTAAAGAA 
CTACCCACCT 
AGAAGATGCA 
GGGGCTCTTT 
AGAGCTCGCC 
ACCTCCCCCT 
TGAGATGCCA 
AGAATATCTG 
TGAATTTCAT 
TGAAGTTAAC 
AGGAAATGAG 
CATAAGGAAA 
AGCGGACAAA 
ACTAATAGCT 
TGGATTTTTC 
GGTGACACAC 
CAGATATGGG 
TGGTTAATTT 
GTGTCATGAA 
AAGAAATACT 
TCAATAAAAA 



41 
I 

CTTGTGCTGT 
GAGGACTCCA 
GATGTGAAAC 
CAGAAGTTCC 
CGCAAGCCCA 
AAGTGGAGGA 
GATGCTGTTG 
ACATTCTCCA 
CATGGAGACT 
GGACCTGGGC 
TCAGGCACCA 
CACTCAGCCA 
CAGTTCCGCC 
GCCTCTACTG 
GCCAAGTGTG 
TTCTTTAAAG 
TTGATTTCTG 
AGCAGGGACA 
GTACAAGCAG 
ATTGATGCAG 
TACTGGAGAT 
GATGACTTTC 
TACTTCTTCA 
ATATTAAAGA 
TGTTTTTAAT 
TTCCTGCATG 
GAATGTTTCT 
CATGTGCAAT 
GTTTTATTTT 



51 
I 

TGTGTCTGCC 
ACAAGGATCT 
AGTTTAGAAG 
TTGGGTTGGA 
GGTGTGGAGT 
AAACCCACCT 
ATTCTGCCAT 
GGCTGTATGA 
TTTACTCTTT 
TTTATGGAGA 
ATTTATTCCT 
ACACTGAAGC 
TTTCGCAAGA 
AGGAACCCCT 
ATCCTGCTTT 
ACAGATATTT 
CATTTTGGCC 
CCGTTTTTAT 
GTTATCCAAG 
CTGTTTCTGA 
TTGATGAAAA 
CAGGAGTTGA 
GTGGATCATC 
GTAACAGCTG 
AAATCTAATA 
TTCTGTGACT 
GGAATTCTTC 
AGGTGAGAGA 
GGGCCTGTTC 



60 
120 
180 
240 
300 
360 
42.0 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 



Seq ID NO: 10 Protein sequence: 
Protein Accession #: NP_002416 



MMHLAFTjVU* 
KKIOGMQKFL 
TPDLPRDAVD 
AHAYPPGPGL 
SFTELAQFRL 
LRGEYLFFKD 
AIRGNEVQAG 
FPRLIADDFP 



11 

1 

CLPVCSAYPL 
GLEVTGKLDT 
SAIEKALKVW 
YGDIHFDDDE 
SQDDVNGIQS 
RYFWRRSHWN 
YPRGIHTLGF 
GVEPKVDAVL 



21 
I 

SGAAKEEDSN 
DTLEVMRKPR 
EEVTPLTFSR 
KWTEDASGTN 
LYGPPPASTE 
PEPEFHLISA 
PPTIRKIDAA 
QAFGFFYFFS 



31 



I 

KDLAQQYLEK 
CGVPDVGHFS 
LYEGEADIMI 
LFLVAAHELG 
EPLVPTKSVP 
EWPSLPSYLD 
VSDKEKKKTY 
GSSQFEFDPN 



41 

1 

YYNLEKDVKQ 
SFPGMPKWRK 
SFAVKEHGDF 
HSLGLFHSAN 
SGSEMPAKCD 
AAYEVNSRDT 
FFAADKYWRF 
ARMVTHILKS 



51 
I 

FRRKDSNLIV 
TKLTYRIVNY 
YSFDGPGHSL 
TEALMYPLYN 
PALSFDAIST 
VFIFKGNEFW 
DENSQSMEQG 
NSWLHC 



60 
120 
180 
240 
300 
360 
420 



Seq ID NO: 11 Nucleotide sequence: 
Nucleic Acid Accession #: XMJD58189 

Coding sequence: 169.. 774 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

GAAGACCAGC TCAGCTCTTC AGTTGTTGAT CATTGTCTAT TGTTCTCCAA ACAGTAAACC 60 



183 



WO 02/079492 



PCT/US02/04915 



AGTATTTCAC ACTGAGATTG TCGGCTGCGG GTATATTCCA ATTCCCCGTC TCCTCATGAA 120 

TATGAAGTGA AGGGCTCTGA CCCTGGAAGT GGTTCTAAGC AGGGCAA AAT GG GGTCTCGG 180 

AAGTGTGGAG GCTGCCTAAG TTGTTTGCTG ATTCCGCTTG CACTTTGGAG TATAATCGTG 240 

AACATATTAT TGTATTTCCC GAATGGGCAA ACTTCCTATG CATCCAGCAA TAAACTCACC 300 

AACTACGTGT GGTATTTTGA AGGAATCTGT TTCTCAGGCA TCATGATGCT TATAGTAACA 360 

ACAGTTCTTC TGGTACTGGA GAATAATAAC AACTATAAAT GTTGCCAGAG TGAAAACTGC 420 

AGCAAAAAAT ATGTGACACT GCTGTCAATT ATCTTTTCTT CCCTCGGAAT TGCTTTTTCT 480 

GGATACTGCC TGGTCATCTC TGCCTTGGGT CTTGTCCAAG GGCCATATTG CCGCACCCTT 540 

GATGGCTGGG AGTATGCTTT TGAAGGCACT GCTGGACGTT TCCTTACAGA TTCTAGCATA 600 

TGGATTCAGT GCCTGGAACC TGCACATGTT GTGGAGTGGA ACATCATTTT ATTTTCCATT 660 

CTCATAACCC TCAGTGGGCT TCAAGTGATC ATCTGCCTCA TCAGAGTAGT CATGCAACTA 720 

TCCAAGATAC TGTGTGGAAG CTATTCAGTG ATCTTCCAGC CTGGAATCAT TTGAATAAGG 780 

ACAAAATGTT TTCCATTATC AAGACATGGC CATCTATCTA AATATTATAT CAACTGTGTA 840 

GACTTGAGGG CAATATTGAA ATGATGGTGC TTTCTGCATT TGGTGTTTAT TTGTAAAAAA 900 

TTTGCAGTCC TCACTGCACA TGCAAGTATA CCACCCTTCC ATTTAGTATG TTTTTTAAGT 960 

AATATGCATC AGAAACTTCA GAAATACTTC TGCCCTTTGA TCAAACAAAT CCATTTCCAA 1020 

GAATCTGTAC TAGGGAAGTA AATAAGAATA TGAGAGAAAC CTTTATGCAA ATATGTATAT 1080 

TGCAACATTA TTTAATATTC TGGAAAATTG GAAACACCCC AAAATTCTAA ACTCAGAGGA 1140 

AGGATTAAGT AAAGAGTGGT ACATACTGTA AATGTTTTCT GATATTAAAA AAAAAATTAA 1200 
ATAAAAAATA AAGAGTACTA CATGGTTGTA AAA 

Seq ID NO: 12 Protein sequence; 
Protein Accession #: XP_058189 



1 11 21 31 41 51 

I I I II I 

MGSRKCGGCL SCLLIPLALW SIIVNIIiLYF PNGQTSYASS NKLTNYVWYF EGICFSGIMM 60 
LIVTTVLLVIi ENNNNYKCCQ SENCSKKYVT LLSIIFSSLG IAFSGYCLVI SALGLVQGPY 120 
CRTLDGWEYA FEGTAGRFLT DSSIWIQCLE PAHWEWNII LFSILITLSG LQVIICLIRV 180 
VMQLSKILCG SYSVIFQPGI I 



Seq ID NO: 13 Nucleotide sequence: 
Nucleic Acid Accession #: NM_005397 

Coding sequence: 251.. 1837 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

AAACGCCGCC CAGGACGCAG CCGCCGCCGC CGCCGCTCCT CTGCCACTGG CTCTGCGCCC 60 

CAGCCCGGCT CTGCTGCAGC GGCAGGGAGG AAGAGCCGCC GCAGCGCGAC TCGGGAGCCC 120 

CGGGCCACAG CCTGGCCTCC GGAGCCACCC ACAGGCCTCC CCGGGCGGCG CCCACGCTCC 180 

TACCGCCCGG ACGCGCGGAT CCTCCGCCGG CACCGCAGCC ACCTGCTCCC GGCCCAGAGG 240 

CGAOGACACG ATG CGCTGCG CGCTGGCGCT CTCGGCGCTG CTGCTACTGT TGTCAACGCC 300 

GCCGCTGCTG CCGTCGTCGC CGTCGCCGTC GCCGTCGCCG TCGCCCTCCC AGAATGCAAC 360 

CCAGACTACT ACGGACTCAT CTAACAAAAC AGCACCGACT CCAGCATCCA GTGTCACCAT 420 

CATGGCTACA GATACAGCCC AGCAGAGCAC AGTCCCCACT TCCAAGGCCA ACGAAATCTT 480 

GGCCTCGGTC AAGGCGACCA CCCTTGGTGT ATCCAGTGAC TCACCGGGGA CTACAACCCT 540 

GGCTCAGCAA GTCTCAGGCC CAGTCAACAC TACCGTGGCT AGAGGAGGCG GCTCAGGCAA 600 

CCCTACTACC ACCATCGAGA GCCCCAAGAG CACAAAAAGT GCAGACACCA CTACAGTTGC 660 

AACCTCCACA GCCACAGCTA AACCTAACAC CACAAGCAGC CAGAATGGAG CAGAAGATAC 720 

AACAAACTCT GGGGGGAAAA GCAGCCACAG TGTGACCACA GACCTCACAT CCACTAAGGC 780 

AGAACATCTG ACGACCCCTC ACCCTACAAG . TCCACTTAGC CCCCGACAAC CCACTTTGAC 840 

GCATCCTGTG GCCACCCCAA CAAGCTCGGG ACATGACCAT CTTATGAAAA TTTCAAGCAG 900 

TTCAAGCACT GTGGCTATCC CTGGCTACAC ' CTTCACAAGC CCGGGGATGA CCACCACCCT 960 

ACCGTCATCG GTTATCTCGC AAAGAACTCA ACAGACCTCC AGTCAGATGC CAGCCAGCTC 1020 

TACGGCCCCT TCCTCCCAGG AGACAGTGCA GCCCACGAGC CCGGCAACGG CATTGAGAAC 1080 

ACCTACCCTG CCAGAGACCA TGAGCTCCAG CCCCACAGCA GCATCAACTA CCCACCGATA 1140 

CCCCAAAACA CCTTCTCCCA CTGTGGCTCA TGAGAGTAAC TGGGCAAAGT GTGAGGATCT 1200 

TGAGACACAG ACACAGAGTG AGAAGCAGCT CGTCCTGAAC CTCACAGGAA ACACCCTCTG 1260 

TGCAGGGGGC GCTTCGGATG AGAAATTGAT CTCACTGATA TGCCGAGCAG TCAAAGCCAC 1320 

CTTCAACCCG GCCCAAGATA AGTGCGGCAT ACGGCTGGCA TCTGTTCCAG GAAGTCAGAC 1380 

CGTGGTCGTC AAAGAAATCA CTATTCACAC TAAGCTCCCT GCCAAGGATG TGTACGAGCG 1440 

GCTGAAGGAC AAATGGGATG AACTAAAGGA GGCAGGGGTC AGTGACATGA AGCTAGGGGA 1500 

CCAGGGGCCA CCGGAGGAGG CCGAGGACCG CTTCAGCATG COCCTCATCA TCACCATCGT 1560 

CTGCATGGCG TCATTCCTGC TCCTCGTGGC GGCCCTCTAT GGCTGCTGCC ACCAGCGCCT 1620 

CTCCCAGAGG AAGGACCAGC AGCGGCTAAC AGAGGAGCTG CAGACAGTGG AGAATGGTTA 1680 

CCATGACAAC CCAACACTGG AAGTGATGGA GACCTCTTCT GAGATGCAGG AGAAGAAGGT 1740 

GGTCAGCCTC AACGGGGAGC TGGGGGACAG CTGGATCGTC CCTCTGGACA ACCTGACCAA 1800 

GGACGACCTG GATGAGGAGG AAGACACACA CCTCTAGTCC GGTCTGCCGG TGGCCTCCAG I860 

CAGCACCACA GAGCTCCAGA CCAACCACCC CAAGTGCCGT TTGGATGGGG AAGGGAAAGA 1920 

CTGGGGAGGG AGAGTGAACT CCGAGGGGTG TCCCCTCCCA ATCCCCCCAG GGCCTTAATT 1980 

TTTCCCTTTT CAACCTGAAC AAATCACATT CTGTCCAGAT TCCTCTTGTA AAATAACCCA 2040 

CTAGTGCCTG AGCTCAGTGC TGCTGGATGA TGAGGGAGAT CAAGAAAAAG CCACGTAAGG 2100 

GACTTTATAG ATGAACTAGT GGAATCCCTT CATTCTGCAG TGAGATTGCC GAGACCTGAA 2160 

GAGGGTAAGT GACTTGCCCA AGGTCAGAGC CACTTGGTGA CAGAGCCAGG ATGAGAACAA 2220 
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AGATTCCATT TGCACCATGC CACACTGCTG TGTTCACATG TGCCTTCCGT CCAGAGCAGT 2280 

CCCGGGCAGG GGTGAAACTC CAGCAGGTGG CTGGGCTGGA AAGGAGGGCA GGGCTACATC 2340 

CTGGCTCGGT GGGATCTGAC GACCTGAAAG TCCAGCTCCC AAGTTTTCCT TCTCCTACCC 2400 

CAGCCTCGTG TACCCATCTT CCCACCCTCT ATGTTCTTAC CCCTCCCTAC ACTCAGTGTT 2460 

TGTTCCCACT TACTCTGTCC TGGGGCCTCT GGGATTAGCA CAGGTTATTC ATAACCTTGA 2520 

ACCCCTTGTT CTGGATTCGG ATTTTCTCAC ATTTGCTTCG TGAGATGGGG GCTTAACCCA 2580 

CACAGGTCTC CGTGCGTGAA CCAGGTCTGC TTAGGGGACC TGCGTGCAGG TGAGGAGAGA 2640 

AGGGGACACT CGAGTCCAGG CTGGTATCTC AGGGCAGCTG ATGAGGGGTC AGCAGGAACA 2700 

CTGGCCCATT GCCCCTGGCA CTCCTTGCAG AGGCCACCCA CGATCTTCTT TGGGCTTCCA 2760 

TTTCCACCAG GGACTAAAAT CTGCTGTAGC TAGTGAGAGC AGCGTGTTCC TTTTGTTGTT 2820 

CACTGCTCAG CTGATGGGAG TGATTCCCTG AGACCCAGTA TGAAAGAGCA GTGGCTGCAG 2880 

GAGAGGCCTT CCCGGGGCCC CCCATCAGCG ATGTGTCTTC AGAGACAATC CATTAAAGCA 2940 

GCCAGGAAGG ACAGGCTTTC CCCTGTATAT CATAGGAAAC TCAGGGACAT TTCAAGTTGC 3000 

TGAGAGTTTT GTTATAGTTG TTTTCTAACC CAGCCCTCCA CTGCCAAAGG CCAAAAGCTC 3060 

AGACAGTTGG CAGACGTCCA GTTAGCTCAT CTCACTCACT CTGATTCTCC TGTGCCACAG 3120 

GAAAAGAGGG CCTGGAAAGC GCAGTGCATG CTGGGTGCAT GAAGGGCAGC CTGGGGGACA 3180 

GACTGTTGTG GGAACGTCCC ACTGTCCTGG CCTGGAGCTA GGCCTTGCTG TTCCTCTTCT 3240 

CTGTGAGCCT AGTGGGGCTG CTGCGGTTCT CTTGCAGTTT CTGGTGGCAT CTCAGGGGAA 3300 

CACAAAAGCT ATGTCTATTC CCCAATATAG GACTTTTATG GGCTCGGCAG TTAGCTGCCA 3360 

TGTAGAAGGC TCCTAAGCAG TGGGCATGGT GAGGTTTCAT CTGATTGAGA AGGGGGAATC 3420 

CTGTGTGGAA TGTTGAACTT TCGCCATGGT CTCCATCGTT CTGGGCGTAA ATTCCCTGGG 3480* 

ATCAAGTAGG AAAATGGGCA GAACTGCTTA GGGGAATGAA ATTGCCATTT TTCGGGTGAA 3540 

ACGCCACACC TCCAGGGTCT TAAGAGTCAG GCTCCGGCTG TAGTAGCTCT GATGAAATAG 3600 

GCTATCCACT CGGGATGGCT TACTTTTTAA AAGGGTAGGG GGAGGGGCTG GGGAAGATCT 3660 

GTCCTGCACC ATCTGCCTAA TTCCTTCCTC ACAGTCTGTA GCCATCTGAT ATCCTAGGGG 3720 

GAAAAGGAAG GCCAGGGGTT CACATAGGGC CCCAGCGAGT TTCCCAGGAG TTAGAGGGAT 3780 

GCGAGGCTAA CAAGTTCCAA AAACATCTGC CCCGATGCTC TAGTGTTTGG AGGTGGGCAG 3840 

GATGGAGAAC AGTGCCTGTT TGGGGGAAAA CAGGAAATCT TGTTAGGCTT GAGTGAGGTG 3900 

TTTGCTTCCT TCTTGCCCAG CGCTGGGTTC TCTCCACCCA GTAGGTTTTC TGTTGTGGTC 3960 

CCGTGGGAGA GGCCAGACTG GATTATTCCT CCTTTGCTGA TCCTGGGTCA CACTTCACCA 4020 

GCCAGGGCTT TTGACGGAGA CAGCAAATAG GCCTCTGCAA ATCAATCAAA GGCTGCAACC 4 080 

CTATGGCCTC TTGGAGACAG ATGATGACTG GCAAGGACTA GAGAGCAGGA GTGCCTGGCC 4140 

AGGTCGGTCC TGACTCTCCT GACTCTCCAT CGCTCTGTCC AAGGAGAACC CGGAGAGGCT 4200 

CTGGGCTGAT TCAGAGGTTA CTGCTTTATA TTCGTCCAAA CTGTGTTAGT CTAGGCTTAG 4260 

GACAGCTTCA GAATCTGACA CCTTGCCTTG CTCTTGCCAC CAGGACACCT ATGTCAACAG 4320 

GCCAAACAGC CATGCATCTA TAAAGGTCAT CATCTTCTGC CACCTTTACT GGGTTCTAAA 4380 

TGCTCTCTGA TAATTCAGAG AGCATTGGGT CTGGGAAGAG GTAAGAGGAA CACTAGAAGC 4440 

TCAGCATGAC TTAAACAGGT TGTAGCAAAG ACAGTTTATC ATCAACTCTT TCAGTGGTAA 4500 

ACTGTGGTTT CCCCAAGCTG CACAGGAGGC CAGAAACCAC AAGTATGATG ACTAGGAAGC 4560 

CTACTGTCAT GAGAGTGGGG AGACAGGCAG CAAAGCTTAT GAAGGAGGTA CAGAATATTC 4620 

TTTGCGTTGT AAGACAGAAT ACGGGTTTAA TCTAGTCTAG GCRCCAGATT TTTTTCCCGC 4680 

TTGATAAGGA AAGCTAGCAG AAAGTTTATT TAAACCACTT CTTGAGCTTT ATCTTTTTTG 4740 

ACAATATACT GGAGAAACTT TGAAGAACAA GTTCAAACTG ATACATATAC ACATATTTTT 4800 

TTGATAATGT AAATACAGTG ACCATGTTAA CCTACCCTGC ACTGCTTTAA GTGAACATAC 4860 

TTTGAAAAAG CATTATGTTA GCTGAGTGAT GGCCAAGTTT TTTCTCTGGA CAGGAATGTA 4920 

AATGTCTTAC TGGAAATGAC AAGTTTTTGC TTGATTTTTT TTTTTAAACA AAAAATGAAA 4980 

TATAACAAGA CAAACTTATG ATAAAGTATT TGTCTTGTAG ATCAGGTGTT TTGTTTTGTT 5040 

TTTTTAATTT TAAAATGCAA CCCTGCCCCC TCCCCAGCAA AGTCACAGCT CCATTTCAGT 5100 

AAAGGTTGGA GTCAATATGC TCTGGTTGGC AGGCAACCCT GTAGTCATGG AGAAAGGTAT 5160 

TTCAAGATCT AGTCCAATCT TTTTCTAGAG AAAAAGATAA TCTGAAGCTC ACAAAGATGA 5220 

AGTGACTTCC TCAAAATCAC ATGGTTCAGG ACAGAAACAA GATTAAAACC TGGATCCACA 5280 

GACTGTGCGC CTCAGAAGGA ATAATCGGTA AATTAAGAAT TGCTACTCGA AGGTGCCAGA 5340 

ATGACACAAA GGACAGAATT CCTTTCCCAG TTGTTACCCT AGCAAGGCTA GGGAGGGCAT 5400 

GAACACAAAC ATAAGAACTG GTCTTCTCAC ACTTTCTCTG AATCATTTAG GTTTAAGATG 5460 

TAAGTGAACA ATTCTTTCTT TCTGCCAAGA AACAAAGTTT TGGATGAGCT TTTATATATG 5520 

GAACTTACTC CAACAGGACT GAGGGACCAA GGAAACATGA TGGGGGAGGC AAGAGAGGGC 5580 

AAAGAGTAAA ACTGTAGCAT AGCTTTTGTC ACGGTCACTA GCTGATCCCT CAGGTCTGCT 5640 

GCAAACACAG CATGGAGGAC ACAGATGACT CTTTGGTGTT GGTCTTTTTG TCTGCAGTGA 5700 

" ATGTTCAACA GTTTGCCCAG GAACTGGGGG ATCATATATG TCTTAGTGGA CAGGGGTCTG 5760 

AAGTACACTG GAATTTACTG AGAAACTTGT TTGTAAAAAC TATAGTTAAT AATTATTGCA 5820 
TTTTCTTACA AAAATATATT TTGGAAAATT GTATACTGTC AATTAAAGT 

Seq ID NO: 14 Protein sequence: 
Protein Accession #: NP_005388 

1 11 21 31 41 51 

I I I I I I 

MRCALALSAL LLLLSTPPLL PSSPSPSPSP SPSQNATQTT TDSSNKTAPT PASSVTIMAT 60 

DTAQQSTVPT SKANEILASV KATTLGVSSD SPGTTTLAQQ VSGPVNTTVA RGGGSGNPTT 120 

TIESPKSTKS ADTTTVATST ATAKPNTTSS QNGAEDTTNS GGKSSHSVTT DLTSTKAEHL 180 

TTPHPTSPLS PRQPTLTHPV ATPTSSGHDH LMKISSSSST VAI PGYTFTS PGMTTTIiPSS 240 

VISQRTQQTS SQMPASSTAP SSQETVQPTS PATALRTPTL PETMSSSPTA ASTTHRYPKT 300 

PSPTVAHESN WAKCEDLETQ TQSEKQLVLN LTGNTLCAGG ASDEKLISLI CRAVKATFNP 360 

AQDKCGIRLA SVPGSQTVW KEITIHTKLP AKDVYERLKD KWDELKEAGV SDMKLGDQGP 420 

PEEAEDRFSM PLIITIVCMA SFLLLVAALY GCCHQRLiSQR KDQQRLTEEL QTVENGYHDN 480 
PTLEVMETSS EMQEKKWSL NGELGDSWIV PLDNLTKDDL DEEEDTHL 



185 



WO 02/079492 



PCT7US02/04915 



Seq ID NO: 15 Nucleotide sequence; 
Nucleic Acid Accession #: NMJ>04105 

Coding sequence: 150.. 1631 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

CTAGTATTCT ACTAGAACTG GAAGATTGCT CTCCGAGTTT TTTTTTTGTT ATTTTGTTAA 60 

AAAATAAAAA GCTTGAGCAG CAATTCATAT TACTGTCACA GGTATTTTTG CTGTGCTGTG 120 

CAAGGTAACT CTGCTAGCTA AGATTCACAA TGTTGAAAGC CCTTTTCCTA ACTATGCTGA 180 

CTCTGGCGCT GGTCAAGTCA CAGGACACCG AAGAAACCAT CACGTACACG CAATGCACTG 240 

ACGGATATGA GTGGGATCCT GTGAGACAGC AATGCAAAGA TATTGATGAA TGTGACATTG 300 

TCCCAGACGC TTGTAAAGGT GGAATGAAGT GTGTCAACCA CTATGGAGGA TACCTCTGCC 360 

TTCCGAAAAC AGCCCAGATT ATTGTCAATA ATGAACAGCC TCAGCAGGAA ACACAACCAG 420 

CAGAAGGAAC CTCAGGGGCA ACCACCGGGG TTGTAGCTGC CAGCAGCATG GCAACCAGTG 480 

GAGTGTTGCC CGGGGGTGGT TTTGTGGCCA GTGCTGCTGC AGTCGCAGGC CCTGAAATGC 540 

AGACTGGCCG AAATAACTTT GTCATCCGGC GGAACCCAGC TGACCCTCAG CGCATTCCCT 600 

CCAACCCTTC CCACCGTATC CAGTGTGCAG CAGGCTACGA GCAAAGTGAA CACAACGTGT 660 

GCCAAGACAT AGACGAGTGC ACTGCAGGGA CGCACAACTG TAGAGCAGAC CAAGTGTGCA 720 

TCAATTTACG GGGATCCTTT GCATGTCAGT GCCCTCCTGG ATATCAGAAG CGAGGGGAGC 780 

AGTGCGTAGA CATAGATGAA TGTACCATCC CTCCATATTG CCACCAAAGA TGCGTGAATA 840 

CACCAGGCTC ATTTTATTGC CAGTGCAGTC CTGGGTTTCA ATTGGCAGCA AACAACTATA 900 

CCTGCGTAGA TATAAATGAA TGTGATGCCA GCAATCAATG TGCTCAGCAG TGCTACAACA 960 

TTCTTGGTTC ATTCATCTGT CAGTGCAATC AAGGATATGA GCTAAGCAGT GACAGGCTCA 1020 

ACTGTGAAGA CATTGATGAA TGCAGAACCT CAAGCTACCT GTGTCAATAT CAATGTGTCA 1080 

ATGAACCTGG GAAATTCTCA TGTATGTGCC CCCAGGGATA CCAAGTGGTG AGAAGTAGAA 1140 

CATGTCAAGA TATAAATGAG TGTGAGACCA CAAATGAATG CCGGGAGGAT GAAATGTGTT 1200 

GGAATTATCA TGGCGGCTTC CGTTGTTATC CACGAAATCC TTGTCAAGAT CCCTACATTC 1260 

TAACACCAGA GAACCGATGT GTTTGCCCAG TCTCAAATGC CATGTGCCGA GAACTGCCCC 1320 

AGTCAATAGT CTACAAATAC ATGAGCATCC GATCTGATAG GTCTGTGCCA TCAGACATCT 1380 

TCCAGATACA GGCCACAACT ATTTATGCCA ACACCATCAA TACTTTTCGG ATTAAATCTG 144 0 

GAAATGAAAA TGGAGAGTTC TACCTACGAC AAACAAGTCC TGTAAGTGCA ATGCTTGTGC 1500 

TCGTGAAGTC ATTATCAGGA CCAAGAGAAC ATATCGTGGA CCTGGAGATG CTGACAGTCA 1560 

GCAGTATAGG GACCTTCCGC ACAAGCTCTG TGTTAAGATT GACAATAATA GTGGGGCCAT 1620 

TTTCATTTTA. GTCTTTTCTA AGAGTCAACC ACAGGCATTT AAGTCAGCCA AAGAATATTG 1680 

TTACCTTAAA GCACTATTTT ATTTATAGAT ATATCTAGTG CATCTACATC TCTATACTGT 1740 

ACACTCACCC ATAACAAACA ATTACACCAT GGTATAAAGT GGGCATTTAA TATGTAAAGA 1800 

TTCAAAGTTT GTCTTTATTA CTATATGTAA ATTAGACATT AATCCACTAA ACTGGTCTTC 1860 

TTCAAGAGAG CTAAGTATAC ACTATCTGGT GAAACTTGGA TTCTTTCCTA TAAAAGTGGG 1920 

ACCAAGCAAT GATGATCTTC TGTGGTGCTT AAGGAAACTT ACTAGAGCTC CACTAACAGT 1980 

CTCATAAGGA GGCAGCCATC ATAACCATTG AATAGCATGC AAGGGTAAGA ATGAGTTTTT 2040 

AACTGCTTTG TAAGAAAATG GAAAAGGTCA ATAAAGATAT ATTTCTTTAG AAAATGGGGA 2100 

TCTGCCATAT TTGTGTTGGT TTTTATTTTC ATATCCAGCC TAAAGGTGGT TGTTTATTAT 2160 

ATAGTAATAA ATCATTGCTG TACAACATGC TGGTTTCTGT AGGGTATTTT TAATTTTGTC 2220 

AGAAATTTTA GATTGTGAAT ATTTTGTAAA AAACAGTAAG CAAAATTTTC CAGAATTCCC 2280 

AAAATGAACC AGATACCCCC TAGAAAATTA TACTATTGAG AAATCTATGG GGAGGATATG 2340 

AGAAAATAAA TTCCTTCTAA ACCACATTGG AACTGACCTG AAGAAGCAAA CTCGGAAAAT 2400 

ATAATAACAT CCCTGAATTC AGGCATTCAC AAGATGCAGA ACAAAATGGA TAAAAGGTAT 2460 

TTCACTGGAG AAGTTTTAAT TTCTAAGTAA AATTTAAATC CTAACACTTC ACTAATTTAT .2520 

AACTAAAATT TCTCATCTTC GTACTTGATG CTCACAGAGG AAGAAAATGA TGATGGTTTT 2580 

TATTCCTGGC ATCCAGAGTG ACAGTGAACT TAAGCAAATT ACCCTCCTAC CCAATTCTAT 2640 

GGAATATTTT ATACGTCTCC TTGTTTAAAA TCTGACTGCT TTACTTTGAT GTATCATATT 2700 
TTTAAATAAA AATAAATATT CCTTTAGAAG ATCACTCTAA AA 



Seq ID NO: 16 Protein sequence.! 
Protein Accession #: NP_004096 



1 ' 11 21 31 41 51 
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MLKALFLTML TLALVKSQDT EETITYTQCT DGYEWDPVRQ QCKDIDECDI VPDACKGGMK 60 
CVNHYGGYLC LPKTAQIIVN NEQPQQETQP AEGTSGATTG WAASSMATS GVLPGGGFVA 120 
SAAAVAGPEM QTGRNNFVIR RNPADPQRIP SNPSHRIQCA AGYEQSEHNV CQDIDECTAG "180 
THNCRADQVC INLRGSFACQ CPPGYQKRGE QCVDIDECTI PPYCHQRCVN TPGSFYCQCS 240 
PGFQLAANNY TCVDINECDA SNQCAQQCYN ILGSFICQCN QGYELSSDRL NCEDIDECRT 300 
SSYLCQYQCV NEPGKFSCMC PQGYQWRSR TCQDINECET TNECREDEMC WNYHGGFRCY 360 
PRNPCQDPYI LTPENRCVCP VSNAMCRELP QSIVYKYMSI RSDRSVPSDI FQIQATTIYA 420 
NT I NT FR IKS GNENGEFYLR QTSPVSAKLV LVKSLSGPRE HIVDLEMLTV SSIGTFRTSS 480 
VLRLTIIVGP FSF 

Seq ID NO: 17 Nucleotide sequence: 
Nucleic Acid Accession #: NM_018894 

Coding sequence: 27.. 1967 (underlined sequences correspond to start and stop codons) 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



i 
I 

AAAACATTCA 
AGATAAAACC 
AAAGACACAT 
TGAGTGGGAT 
CGCTTGTAAA 
AACAGCCCAG 
AACCTCAGGG 
GCCCGGGGGT 
CCGAAATAAC 
TTCCCACCGT 
CATAGACGAG 
ACGGGGATCC 
AGACATAGAT 
CTCATTTTAT 
AGATATAAAT 
TTCATTCATC 
AGACATTGAT 
TGGGAAATTC 
AGATATAAAT 
TCATGGCGGC 
AGAGAACCGA 
AGTCTACAAA 
ACAGGCCACA 
AAATGGAGAG 
GTCATTATCA 
AGGGACCTTC 
TTAGTCTTTT 
AAAGCACTAT 
CCCATAACAA 
TTTGTCTTTA 
GAGCTAAGTA 
AATGATGATC 
GGAGGCAGCC 
TTGTAAGAAA 
TATTTGTGTT 
TAAATCATTG 
TTAGATTGTG 
ACCAGATACC 
AAATTCCTTC 
CATCCCTGAA 
GAGAAGTTTT 
ATTTCTCATC 
GGCATCCAGA 
TTTATACGTC 
AAAAATAAAT 



11 
I 

ACAAATTAAT 
AATAGAGTGC 
GGCCTTCTTT 
CCTGTGAGAC 
GGTGGAATGA 
ATTATTGTCA 
GCAACCACCG 
GGTTTTGTGG 
TTTGTCATCC 
ATCCAGTGTG 
TGCACTGCAG 
TTTGCATGTC 
GAATGTACCA 
TGCCAGTGCA 
GAATGTGATG 
TGTCAGTGCA 
GAATGCAGAA 
TCATGTATGT 
GAGTGTGAGA 
TTCCGTTGTT 
TGTGTTTGCC 
TACATGAGCA 
ACTATTTATG 
TTCTACCTAC 
GGACCAAGAG 
CGCACAAGCT 
CTAAGAGTCA 
TTTATTTATA 
ACAATTACAC 
TTACTATATG 
TACACTATCT 
TTCTGTGGTG 
ATCATAACCA 
ATGGAAAAGG 
GGTTTTTATT 
CTGTACAACA 
AATATTTTGT 
CCCTAGAAAA 
TAAACCACAT 
TTCAGGCATT 
AATTTCTAAG 
TTCGTACTTG 
GTGACAGTGA 
TCCTTGTTTA 
ATTCCTTTAG 



21 
I 

GGGTGTAAGG 
AGAATAAGAC 
GTGTACATGA 
AGCAATGCAA 
AGTGTGTCAA 
ATAATGAACA 
GGGTTGTAGC 
CCAGTGCTGC 
GGCGGAACCC 
CAGCAGGCTA 
GGACGCACAA 
AGTGCCCTCC 
TCCCTCCATA 
GTCCTGGGTT 
CCAGCAATCA 
ATCAAGGATA 
CCTCAAGCTA 
GCCCCCAGGG 
CCACAAATGA 
ATCCACGAAA 
CAGTCTCAAA 
TCCGATCTGA 
CCAACACCAT 
GACAAACAAG 
AACATATCGT 
CTGTGTTAAG 
ACCACAGGCA 
GATATATCTA 
CATGGTATAA 
TAAATTAGAC 
GGTGAAACTT 
CTTAAGGAAA 
TTGAATAGCA 
TCAATAAAGA 
TTCATATCCA 
TGCTGGTTTC 
AAAAAACAGT 
TTATACTATT 
TGGAACTGAC 
CACAAGATGC 
TAAAATTTAA 
ATGCTCACAG 
ACTTAAGCAA 
AAATCTGACT 
AAGATCACTC 



31 
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AACTGGAAAA 
TCAAGTCAAG 
CATGCATTCT 
AGATATTGAT 
CCACTATGGA 
GCCTCAGCAG 
TGCCAGCAGC 
TGCAGTCGCA 
AGCTGACCCT 
CGAGCAAAGT 
CTGTAGAGCA 
TGGATATCAG 
TTGCCACCAA 
TCAATTGGCA 
ATGTGCTCAG 
TGAGCTAAGC 
CCTGTGTCAA 
ATACCAAGTG 
ATGCCGGGAG 
TCCTTGTCAA 
TGCCATGTGC 
TAGGTCTGTG 
CAATACTTTT 
TCCTGTAAGT 
GGACCTGGAG 
ATTGACAATA 
TTTAAGTCAG 
GTGCATCTAC 
AGTGGGCATT 
ATTAATCCAC 
GGATTCTTTC 
CTTACTAGAG 
TGCAAGGGTA 
TATATTTCTT 
GCCTAAAGGT 
TGTAGGGTAT 
AAGCAAAATT 
GAGAAATCTA 
CTGAAGAAGC 
AGAACAAAAT 
ATCCTAACAC 
AGGAAGAAAA 
ATTACCCTCC 
GCTTTACTTT 
TAAAA 



41 
I 

CCTGGACTCC 
TAAGTAACGT 
CAACAATGCA 
GAATGTGACA 
GGATACCTCT 
GAAACACAAC 
ATGGCAACCA 
GGCCCTGAAA 
CAGCGCATTC 
GAACACAACG 
GACCAAGTGT 
AAGCGAGGGG 
AGATGCGTGA 
GCAAACAACT 
CAGTGCTACA 
AGTGACAGGC 
TATCAATGTG 
GTGAGAAGTA 
GATGAAATGT 
GATCCCTACA 
CGAGAACTGC 
CCATCAGACA 
CGGATTAAAT 
GCAATGCTTG 
ATGCTGACAG 
ATAGTGGGGC 
CCAAAGAATA 
ATCTCTATAC 
TAATATGTAA 
TAAACTGGTC 
CTATAAAAGT 
CTCCACTAAC 
AGAATGAGTT 
TAGAAAATGG 
GGTTGTTTAT 
TTTTAATTTT 
TTCCAGAATT 
TGGGGAGGAT 
AAACTCGGAA 
GGATAAAAGG 
TTCACTAATT 
TGATGATGGT 
TACCCAATTC 
GATGTATCAT 



51 
I 

TACCACATGC 
TAAACACCAT 
CTGACGGATA 
TTGTCCCAGA 
GCCTTCCGAA 
CAGCAGAAGG 
GTGGAGTGTT 
TGCAGACTGG 
CCTCCAACCC 
TGTGCCAAGA 
GCATCAATTT 
AGCAGTGCGT 
ATACACCAGG 
ATACCTGCGT 
ACATTCTTGG 
TCAACTGTGA 
TCAATGAACC 
GAACATGTCA 
GTTGGAATTA 
TTCTAACACC 
CCCAGTCAAT 
TCTTCCAGAT 
CTGGAAATGA 
TGCTCGTGAA 
TCAGCAGTAT 
CATTTTCATT 
TTGTTACCTT 
TGTACACTCA 
AGATTCAAAG 
TTCTTCAAGA 
GGGACCAAGC 
AGTCTCATAA 
TTTAACTGCT 
GGATCTGCCA 
TATATAGTAA 
GTCAGAAATT 
CCCAAAATGA 
ATGAGAAAAT 
AATATAATAA 
TATTTCACTG 
TATAACTAAA 
TTTTATTCCT 
TATGGAATAT 
ATTTTTAAAT 



60 
120 
180 
240 
300 
360 
420 
4B0 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2260 
2340 
2400 
2460 
2520 
2580 
2640 



Seq ID NO: 18 Protein sequence: 
Protein Accession #: NP_061489.1 



55 



60 



65 



I 

MHSQQCTDGY 
PQQETQPAEG 
ADPQRIPSNP 
GYQKRGEQCV 
CAQQCYNILG 
YQWRSRTCQ 
AMCRELPQSI 
PVSAMLVLVK 



11 
I 

EWDPVRQQCK 
TSGATTGWA 
SHRIQCAAGY 
DIDECTIPPY 
SFICQCNQGY 
DINECETTNE 
VYKYMSIRSD 
SLSGPREHIV 



21 
I 

DIDECDIVPD 
ASSMATSGVL 
EQSEHNVCQD 
CHQRCVNTPG 
ELSSDRLNCE 
CREDEMCWNY 
RSVPSDIFQI 
DLEMLTVSSI 



31 

I • 

ACKGGMKCVN 
PGGGFVASAA 
IDECTAGTHN 
SFYCQCSPGF 
DIDECRTSSY 
HGGFRCYPRN 
QATTIYANTI 
GTFRTSSVLR 



41 

I 

HYGGYLCLPK 
AVAGPEMQTG 
CRADQVCINL 
QLAANNYTCV 
LCQYQCVNEP 
PCQDPYILTP 
NTFRIKSGNE 
LTIIVGPFSF 



51 
I 

TAQIIVNNEQ 
RNNFVIRRNP 
RGSFACQCPP 
DINECDASNQ 
GKFSCMCPQG 
ENRCVCPVSN 
NGEFYLRQTS 



60 
120 
180 
240 
300 
360 
420 



Seq ID NO: 19 Nucleotide sequence: 
Nucleic Acid Accession #: NM_006500 

Coding sequence: 27.. 1967 (underlined sequences correspond to start and stop codons) 



70 



75 



ACTTGCGTCT 
TCGCCGCCTG 
CGCCTGAGCT 
AGTCCCAAGG 
TCATCTTCCG 
TCAGCCTCCA 



11 
I 

CGCCCTCCGG 
CTGCTGCTGT 
GGTGGAGGTG 
CAACCTCAGC 
TGTGCGCCAG 
GGACAGAGGG 



21 
I 

CCAAGCATGG 
CCTCGCGTCG 
GAAGTGGGCA 
CATGTCGACT 
GGCCAGGGCC 
GCTACTCTGG 



31 

I 

GGCTTCCCAG 
CGGGTGTGCC 
GCACAGCCCT 
GGTTTTCTGT 
AGAGCGAACC 
CCCTGACTCA 



41 
I 

GCTGGTCTGC 
CGGAGAGGCT 
TCTGAAGTGC 
CCACAAGGAG 
TGGGGAGTAC 
AGTCACCCCC 



51 
I 

GCCTTCTTGC 60 

GAGCAGCCTG 120 

GGCCTCTCCC 180 

AAGCGGACGC 240 

GAGCAGCGGC 300 

CAAGACGAGC 360 



187 



WO 02/079492 



PCTYUS02/04915 



GCATCTTCTT GTGCCAGGGC AAGCGCCCTC GGTCCCAGGA GTACCGCATC CAGCTCCGCG 420 

TCTACAAAGC TCCGGAGGAG CCAAACATCC AGGTCAACCC CCTGGGCATC CCTGTGAACA 4 80 

GTAAGGAGCC TGAGGAGGTC GCTACCTGTG TAGGGAGGAA CGGGTACCCC ATTCCTCAAG 540 

TCATCTGGTA CAAGAATGGC CGGCCTCTGA AGGAGGAGAA GAACCGGGTC CACATTCAGT 600 

5 CGTCCCAGAC TGTGGAGTCG AGTGGTTTGT ACACCTTGCA GAGTATTCTG AAGGCACAGC 660 

TGQTTAAAGA AGACAAAGAT GCCCAGTTTT ACTGTGAGCT CAACTACCGG CTGCCCAGTG 720 

GGAACCACAT GAAGGAGTCC AGGGAAGTCA CCGTCCCTGT TTTCTACCCG ACAGAAAAAG 780 

TGTGGCTGGA AGTGGAGCCC GTGGGAATGC TGAAGGAAGG GGACCGCGTG GAAATCAGGT 840 

GTTTGGCTGA TGGCAACCCT CCACCACACT TCAGCATCAG CAAGCAGAAC CCCAGCACCA 900 

10 GGGAGGCAGA GGAAGAGACA ACCAACGACA ACGGGGTCCT GGTGCTGGAG CCTGCCCGGA 960 

AGGAACACAG TGGGCGCTAT GAATGTCAGG CCTGGAACTT GGACACCATG ATATCGCTGC 1020 

TGAGTGAACC ACAGGAACTA CTGGTGAACT ATGTGTCTGA CGTCCGAGTG AGTCCCGCAG 1080 

CCCCTGAGAG ACAGGAAGGC AGCAGCCTCA CCCTGACCTG TGAGGCAGAG AGTAGCCAGG 1140 

ACCTCGAGTT CCAGTGGCTG AGAGAAGAGA CAGACCAGGT GCTGGAAAGG GGGCCTGTGC 1200 

15 TTCAGTTGCA TGACCTGAAA CGGGAGGCAG GAGGCGGCTA TCGCTGCGTG GCGTCTGTGC 1260 

CCAGCATACC CGGCCTGAAC CGCACACAGC TGGTCAAGCT GGCCATTTTT GGCCCCCCTT 1320 

GGATGGCATT CAAGGAGAGG AAGGTGTGGG TGAAAGAGAA TATGGTGTTG AATCTGTCTT 1380 

GTGAAGCGTC AGGGCACCCC CGGCCCACCA TCTCCTGGAA CGTCAACGGC ACGGCAAGTG 1440 

AACAAGACCA AGATCCACAG CGAGTCCTGA GCACCCTGAA TGTCCTCGTG ACCCCGGAGC 1500 

20 TGTTGGAGAC AGGTGTTGAA TGCACGGCCT CCAACGACCT GGGCAAAAAC ACCAGCATCC 1560 

TCTTCCTGGA GCTGGTCAAT TTAACCACCC TCACACCAGA CTCCAACACA ACCACTGGCC 1620 

TCAGCACTTC CACTGCCAGT CCTCATACCA GAGCCAACAG CACCTCCACA GAGAGAAAGC 1680 

TGCCGGAGCC GGAGAGCCGG GGCGTGGTCA TCGTGGCTGT GATTGTGTGC ATCCTGGTCC 174 0 

TGGCGGTGCT GGGCGCTGTC CTCTATTTCC TCTATAAGAA GGGCAAGCTG CCGTGCAGGC 1800 

25 GCTCAGGGAA GCAGGAGATC ACGCTGCCCC CGTCTCGTAA GACCGAACTT GTAGTTGAAG 1860 

TTAAGTCAGA TAAGCTCCCA GAAGAGATGG GCCTCCTGCA GGGCAGCAGC GGTGACAAGA 1920 

GGGCTCCGGG AGACCAGGGA GAGAAATACA TCGATCTGAG GCATTAGCCC CGAATCACTT 1980 

CAGCTCCCTT CCCTGCCTGG ACCATTCCCA GCTCCCTGCT CACTCTTCTC TCAGCCAAAG 2040 

CCTCCAAAGG GACTAGAGAG AAGCCTCCTG CTCCCCTCAC CTGCACACCC CCTTTCAGAG 2100 

30 GGCCACTGGG TTAGGACCTG AGGACCTCAC TTGGCCCTGC AAGCCGCTTT TCAGGGACCA 2160 

GTCCACCACC ATCTCCTCCA CGTTGAGTGA AGCTCATCCC AAGCAAGGAG CCCCAGTCTC 2220 

CCGAGCGGGT AGGAGAGTTT CTTGCAGAAC GTGTTTTTTC TTTACACACA TTATGGCTGT 2280 

AAATACCTGG CTCCTGCCAG CAGCTGAGCT GGGTAGCCTC TCTGAGCTGG TTTCCTGCCC 2340 

CAAAGGCTGG CTTCCACCAT CCAGGTGCAC CACTGAAGTG AGGACACACC GGAGCCAGGC 2400 

35 GCCTGCTCAT GTTGAAGTGC GCTGTTCACA CCCGCTCCGG AGAGCACCCC AGCGGCATCC 2460 

AGAAGCAGCT GCAGTGTTGC TGCCACCACC CTCCTGCTCG CCTCTTCAAA GTCTCCTGTG 2520 

ACATTTTTTC TTTGGTCAGA AGCCAGGAAC TGGTGTCATT CCTTAAAAGA TACGTGCCGG 2580 

GGCCAGGTGT GGTGGCTCAC GCCTGTAATC CCAGCACTTT GGGAGGCCGA GGCGGGCGGA 2640 

TCACAAAGTC AGGACGAGAC CATCCTGGCT AACACGGTGA AACCCTGTCT CTACTAAAAA 2700 

40 TACAAAAAAA AATTAGCTAG GCGTAGTGGT TGGCACCTAT AGTCCCAGCT ACTCGGAAGG 2760 . 

CTGAAGCAGG AGAATGGTAT GAATCCAGGA GGTGGAGCTT GCAGTGAGCC GAGACCGTGC 2820 

CACTGCACTC CAGCCTGGGC " AACACAGCGA GACTCCGTCT CGAGGAAAAA AAAAGAAAAG 2880 

ACGCGTACCT GCGGTGAGGA AGCTGGGCGC TGTTTTCGAG TTCAGGTGAA TTAGCCTCAA 2940 

TCCCCGTGTT CACTTGCTCC CATAGCCCTC TTGATGGATC ACGTAAAACT GAAAGGCAGC 3000 

45 GGGGAGCAGA CAAAGATGAG GTCTACACTG TCCTTCATGG GGATTAAAGC TATGGTTATA 3060 

TTAGCACCAA ACTTCTACAA ACCAAGCTCA GGGCCCCAAC CCTAGAAGGG CCCAAATGAG 3120 

AGAATGGTAC TTAGGGATGG AAAACGGGGC CTGGCTAGAG CTTCGGGTGT GTGTGTCTGT 3180 

CTGTGTGTAT GCATACATAT GTGTGTATAT ATGGTTTTGT CAGGTGTGTA AATTTG CAAA 3240 

TTGTTTCCTT TATATATGTA TGTATATATA TATATGAAAA TATATATATA TATGAAAAAT 3300. 

50 AAAGCTTAAT TGTCCCAGAA AATCATACAT TGCTTTTTTA TTCTACATGG GTACCACAGG 3360 

AACCTGGGGG CCTGTGAAAC TACAACCAAA AGGCACACAA AACCGTTTCC AGTTGGCAGC 3420 

AGAGATCAGG GGTTACCTCT GCTTCTGAGC AAATGGCTCA AGCTCTACCA GAGCAGACAG 3480 

CTACCCTACT TTTCAGCAGC AAAACGTCCC GTATGACGCA GCACGAAGGG CCTGGCAGGC 3540 
TGTTAGCAGG AGCTATGTCC CTTCCTATCG TTTCCGTCCA CTT 

55 

Seq ID NO: 20 Protein sequence: 
Protein Accession #: NP_006491 

60 

1 11 21 31 41 51 

I I I I I I 

MGLPRLVCAP LLAACCCCPR VAGVPGEAEQ PAPELVEVBV GSTALLKCGL SQSQGNLSHV 60 

DWFSVHKEKR TLIFRVRQGQ GQSEPGEYEQ RLSLQDRGAT LALTQVTPQD ERIFLCQGKR 120 

65 PRSQEYRIQL RVYKAPEEPN IQVNPIiGIPV NSKEPEEVAT CVGRNGYPIP QVIWYKNGRP 180 

LKEEKNRVHI QSSQTVESSG LYTLQSILKA QLVKEDKDAQ FYCELNYRLP SGNHMKESRE 240 

VTVPVFYPTE KWJXiEVEPVG MLKEGDRVEI RCIADGNPPP HFSISKQNPS TREAEEETTN 300 

DNGVLVLEPA RKEHSGRYEC QAWNLDTMIS LLSEPQELLV NYVSDVRVSP AAPERQEGSS 360 

LTLTCEAESS QDLEFGWLRE ETDQVLERGP VLQLHDLKRE AGGGYRCVAS VPSIPGLNRT 420 

70 QLVKLAIFGP PWMAFKERKV WVKENMVLNL SCEASGHPRP TISWNVNGTA SEQDQDPQRV 4 80 

LSTIiNVLVTP ELLETGVECT ASNDLGKNTS ILFLELVNLT TLTPDSNTTT GLSTSTASPH 540 

TRANSTSTER KLPEPESRGV VIVAVIVCIL VLAVLGAVLY FLYKKGKLPC RRSGKQEITL 600 
PPSRKTELW EVKSDKLPEE MGLLQGSSGD KRAPGDOGEK YIDLRH 

75 
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Seq ID NO: 21 Nucleotide sequence: 
Nucleic Acid Accession #: NM_002421 

Coding sequence: 72.. 1481 (underlined sequences correspond to start and stop codons) 



1 11 21 

I I I 

GGGATATTGG AGTAGCAAGA GGCTGGGAAG 
CAAAGGCCAG TATGCACAGC TTTCCTCCAC 
CTCACAGCTT CCCAGCGACT CTAGAAACAC 
ACCTGGAAAA ATACTACAAC CTGAAGAATG 
GTGGCCCAGT GGTTGAAAAA TTGAAGCAAA 
GGAAACCAGA TGCTGAAACC CTGAAGGTGA 
TGGCTCAGTT TGTCCTCACT GAGGGGAACC 
GGATTGAAAA TTACACGCCA GATTTGCCAA 
CCTTCCAACT CTGGAGTAAT GTCACACCTC 
CAGACATCAT GATATCTTTT GTCAGGGGAG 
CTGGAGGAAA TCTTGCTCAT GCTTTTCAAC 
TTGATGAAGA TGAAAGGTGG ACCAACAATT 
CTCATGAACT CGGCCATTCT CTTGGACTCT 
ACCCTAGCTA CACCTTCAGT GGTGATGTTC 
AAGCCATATA TGGACGTTCC CAAAATCCTG 
CGTGTGACAG TAAGCTAACC TTTGATGCTA 
TTAAAGACAG ATTCTACATG CGCACAAATC 
TTTCTGTTTT CTGGCCACAA CTGCCAAATG 
GAGATGAAGT CCGGTTTTTC AAAGGGAATA 
TACACGGATA CCCCAAGGAC ATCTACAGCT 
TCGATGCTGC TCTTTCTGAG GAAAACACTG 
ACTGGAGGTA TGATGAATAT AAACGATCTA 
ATGACTTTCC TGGAATTGGC CACAAAGTTG 
ATTTCTTTCA TGGAACAAGA CAATACAAAT 
TCCAGAAAGC TAATAGCTGG TTCAACTGCA 
AAAACACATG GTGTGAGTCC AAAGAAGGTG 
CATTTTTAAC CTCTAGAGTC ACTGATACAC 
CATATTTTTT TACTATTTAG AATGTAGCCC 
ATGGTGGGTA CAAAAAGTCA AGTTTGTGGC 
GATCTTTTCC AGAGTATGCA ACTCTGACGT 
ATATCCTTTC AAGACAGAAA GAGACAGGAG 
CAAGAACACA TGTGCAGTCA CTGGTGTCAC 
ACAAAATAAG TGTTTTATGT TTGGAATAAA 



31 41 51 

I I I 

CCATCACTTA CCTTGCACTG AGAAAGAAGA 60 

TGCTGCTGCT GCTGTTCTGG GGTGTGGTGT 120 

AAGAGCAAGA TGTGGACTTA GTCCAGAAAT 180 

ATGGGAGGCA AGTTGAAAAG CGGAGAAATA 240 

TGCAGGAATT CTTTGGGCTG AAAGTGACTG 300 

TGAAGCAGCC CAGATGTGGA GTGCCTGATG 360 

CTCGCTGGGA GCAAACACAT CTGACCTACA 420 

GAGCAGATGT GGACCATGCC ATTGAG AAAG ' 480 

TGACATTCAC CAAGGTCTCT GAGGGTCAAG 540 

ATCATCGGGA CAACTCTCCT TTTGATGGAC 600 

CAGGCCCAGG TATTGGAGGG GATGCTCATT 660 

TCAGAGAGTA CAACTTACAT CGTGTTGCGG 720 

CCCATTCTAC TGATATCGGG GCTTTGATGT 780 

AGCTAGCTCA GGATGACATT GATGGCATCC 840 

TCCAGCCCAT CGGCCCACAA ACCCCAAAAG 900 

TAACTACGAT TCGGGGAGAA GTGATGTTCT 960 

CCTTCTACCC GGAAGTTGAG CTCAATTTCA 1020 

GGCTTGAAGC TGCTTACGAA TTTGCCGACA 1080 

AGTACTGGGC TGTTCAGGGA CAGAATGTGC 1140 

CCTTTGGCTT CCCTAGAACT GTGAAGCATA 1200 

GAAAAACCTA CTTCTTTGTT GCTAACAAAT 1260 

TGGATCCAGG TTATCCCAAA ATGATAGCAC 1320 

ATGCAGTTTT CATGAAAGAT GGATTTTTCT 1380 

TTGATCCTAA AACGAAGAGA ATTTTGACTC 1440 

GGAAAAATTG AACATTACTA ATTTGAATGG 1500 

TTTTCCTGAA GAACTGTCTA TTTTCTCAGT 1560 

AGAATATAAT CTTATTTATA CCTCAGTTTG 1620 

TTTTTGTACT GATATAATTT AGTTCCACAA 1680 

TTATGGATTC ATATAGGCCA GAGTTGCAAA 1740 

TGATCCCAGA GAGCAGCTTC AGTGACAAAC 1800 

ACATGAGTCT TTGCCGGAGG AAAAGCAGCT 1860 

CCTGGATAGG CAAGGGATAA CTCTTCTAAC 1920 
GTCAACCTTG TTTCTACTGT TTT 



Seq ID NO: 22 Protein sequence: 
Protein Accession #: NP_002412 



1 11 21 31 41 51 

I I I I I I 

MHSFPPLLLL LFWGWSHSP PATLETQEQD VDLVQKYLEK YYNXjKNDGRQ VEKRRNSGPV 60 

VEKLKQMQEF FGLKVTGKPD AETLKVMKQP RCGVPDVAQF VLTEGNPRWE QTHLTYRIEN 120 

YTPDLPRADV DHAIEKAFQL WSNVTPLTFT* KVSEGQADIM ISFVRGDHRD NSPFDGPGGN 180 

LAHAFQPGPG IGGDAHFDED ERWTNNFREY NLHRVAAHEL GHSLGLSHSt DIGALMYPSY 240 

TFSGDVQLAQ DDIDGIQAIY GRSQNPVQPI GPQTPKACDS KLTFDAITTI RGEVMFFKDR 300 

FYMRTNPFYP EVELNFISVF WPQLPNGLEA AYEFADRDEV RFFKGNKYWA VQGQNVLHGY 360 

PKDIYSSFGF PRTVKHIDAA LSEENTGKTY FFVANKYWRY DEYKRSMDPG YPKMIAHDFP 420 
GIGHKVDAVF MKDGFFYFFH GTRQYKFDPK TKRILTLQKA NSWFNCRKN 

Seq ID NO: 23 Nucleotide sequence: 

Nucleic Acid Accession #: FGENESH predicted ORF 

Coding sequence: 141-1580 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

TCTGCGTGTG CCGGGGCTAG GGGCTGGAAG TCCTGGCTCT AGTTGCACCT CGGAAGGAAA 60 

AGGCAAACAG AGGAGGGAAG GCGTCTTAGG ACTGCCTGGA TCCAGAGCAC TTTCCTCGGC 120 

CTCTACAGGC CTGTGTCGCT ATGGGTTCCC CCGCCGCCCC GGAGGGAGCG CTGGGCTACG 180 

TCCGCGAGTT CACTCGCCAC TCCTCCGACG TGCTGGGCAA CCTCAACGAG CTGCGCCTGC 240 

GCGGGATCCT CACTGACGTC ACGCTGCTGG TTGGCGGGCA ACCCCTCAGA GCACACAAGG 300 

CAGTTCTCAT CGCCTGCAGT GGCTTCTTCT ATTCAATTTT CCGGGGCCGT GCGGGAGTCG 360 

GGGTGGACGT GCTCTCTCTG CCCGGGGGTC CCGAAGCGAG AGGCTTCGCC CCTCTATTGG 420 

ACTTCATGTA CACTTCGCGC CTGCGCCTCT CTCCAGCCAC TGCACCAGCA GTCCTAGCGG 480 

CCGCCACCTA TTTGCAGATG GAGCACGTGG TCCAGGCATG CCACCGCTTC ATCCAGGCCA 540 

GCTATGAACC TCTGGGCATC TCCCTGCGCC CCCTGGAAGC AGAACCCCCA ACACCCCCAA 600 

CGGCCCCTCC ACCAGGTAGT CCCAGGCGCT CCGAAGGACA CCCAGACCCA CCTACTGAAT 660 
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CTCGAAGCTG CAGTCAAGGC CCCCCCAGTC CAGCCAGCCC TGACCCCAAG GCCTGCAACT 720 

GGAAAAAGTA CAAGTACATC GTGCTAAACT CTCAGGCCTC CCAAGCAGGG AGCCTGGTCG 780 

GGGAGAGAAG TTCTGGTCAA CCTTGCCCCC AAGCCAGGCT CCCCAGTGGA GACGAGGCCT 840 

CCAGCAGCAG CAGCAGCAGC AGCAGCAGCA GTGAAGAAGG ACCCATTCCT GGTCCCCAGA 900 

GCAGGCTCTC TCCAACTGCT GCCACTGTGC AGTTCAAATG TGGGGCTCCA GCCAGTACCC 960 

CCTACCTCCT CACATCCCAG GCTCAAGACA CCTCTGGATC ACCCTCTGAA CGGGCTCGTC 1020 

CACTACCGGG AAGTGAATTT TTCAGCTGCC AGAACTGTGA GGCTGTGGCA GGGTGCTCAT 1080 

CGGGGCTGGA CTCCTTGGTT CCTGGGGACG AAGACAAACC CTATAAGTGT CAGCTGTGCC 1140 

GGTCTTCGTT CCGCTACAAG GGCAACCTTG CCAGTCATCG TACAGTGCAC ACAGGGGAAA 1200 

AGCCTTACCA CTGCTCAATC TGCGGAGCCC GTTTTAACCG GCCAGCAAAC CTGAAAACGC 1260 

ACAGCCGCAT CCATTCGGGA GAGAAGCCGT ATAAGTGTGA GACGTGCGGC TCGCGCTTTG 1320 

TACAGGTGGC ACATCTGCGG GCGCACGTGC TGATCCACAC CGGGGAGAAG CCCTACCCTT 1380 

GCCCTACCTG CGGAACCCGC TTCCGCCACC TGCAGACCCT CAAGAGCCAC GTTCGCATCC 1440 

ACACCGGAGA GAAGCCTTAC CACTGCGACC CCTGTGGCCT GCATTTCCGG CACAAGAGTC 1500 

AACTGCGGCT GCATCTGCGC CAGAAACACG GAGCTGCTAC CAACACCAAA GTGCACTACC 1560 

ACATTCTCGG GGGGCC CTAG CTGAGCGCAG GCCCAGGCCC CACTTGCTTC CTGCGGGTGG 1620 

GAAAGCTGCA GGCCCAGGCC TTGCTTCCCT ATCAGGCTTG GGCATAGGGG TGTGCCAGGC 1680 

CACTTTGGTA TCAGAAATTG CCACCCTCTT AATTTCTCAC TGGGGAGAGC AGGGGTGGCA 1740 

GATCCTGGCT AGATCTGCCT CTGTTTTGCT GGTCAAAACC TCTTCCCCAC AAGCCAGATT 1800 

GTTTCTGAGG AGAGAGCTAG CTAGGGGCTG GGAAAGGGGA GAGATTGGAG TCCTGGTCTC 1860 

CCTAAGGGAA TAGCCCTCCA CCTGTGGCCC CCATTGCATT CAGTTTATCT GTAAATATAA 1920 

TTTATTGAGG CCTTTGGGTG GCACCGGGGC CTTCATTCGA TTGCATTTCC CACTCCCCTC 1980 

TTCCACAAGT GTGATTAAAA GTGACCAGAA ACACAGAAGG TGAGATCACA GCTCTGCTGG 2040 

CAGAGATTAC TAGCCCTTGG CTCTCTCGTT TGGCTTGGGT ATTTTATATT ATTTCTGTCA 2100 

TAACTTTTAT CTTTAGAATT GTTCTTTCTC CTGTTTGTTT GCTTGTTAGT TTGTTTAAAA 2160 

TGGAAAAAGG GGTTCTCTGT GTTCTGCCCC TGTAATTCTA GGTCTGGAAC CTTTATTTGT 2220 

TCTAGGGCAG CTCTGGGAAC ATGCGGGATT GTGGAATFGG GTCAGGAACC CTCTCTGGTA 2280 

TTCTGGATGT TGTAGGTTCT CTAGCAGTCT AGAAATGGAT ACAGACATTT CTCTGTTCTT 2340 

CAAGGGTGAT AGGAACCATT ATGTTGAGCC CAAAATGGAA GTAATAATAA ATGCCTCCTG 2400 

GAGGCTGTGG GTGTGGGGGA TTCTGTATCT GGATTCCGTA TCACTCCAAC TGGAGGCTGT 2460 

GGGTGTGGGG GATTCTGTAT CTGGATTCCG TATCACTCCA AGTGGAGGCT GGCAGGTTTT 2520 

TCTGCAAGAT GGTCCAGAAT CTAAAATGTC CCATTAATCT GGTCACTTGG GTTTGGCTCT 2580 

GCTGTATCCA TCTATAGTGG TAGAGACCCA CCAGGGCTCA AGTGGAGTCC ATCATCCTCC 2640 

CACGGGGGCC TGTTCTTAGC ACTGAGTTGA TCGCTCCATG GGGGAGAGAT CAGACATTCC 2700 

TTATCAGAGA TGATGTGACC TTTTCTGACT CTGCCCAGTC TCTATGAATG TTATGGCCTA 2760 

GGGAAGAATC ATGAAACTCT TTAGCTTGAT TAGATGGTAA ACAGTGTTAA CCCATCCTTT 2820 

ACTACAGAGG CATATGGGTT TGAATGTTAC CTGGGGTTCT CTCTATTGAG TTGAGCCCCT 2880 

TCTTCCTTTA GTGGGTTTTG GACATCTTCT GGCAAGTGTC CAGATGCCAG AACCTTCTTT 2940 

TCCTCTAGAA GGGATGGTGC TTGGTAACCT TACCTTTTAA AAGCTGGGTC TGTGACCTGG 3000 

TCTTCCCATC CCTGCATTCC TGTCTGGAAC CAGTGAATGC ATTAGAACCT TCCATAGGAA 3060 

AAGAAAAGGG GCTGAGTTCC ATTCTGGGTT TGCTGTAGTT TGGTTGGGAT TATTGTTGGC 3120 

ATTACAGATG TAAAAGATTG ACTAGCCCAT AGGCCAAAGG CCTGTTCTAG TTGACCAAGT 3180 

TTCAAGTAGG ATTAAGAGGT TGGTTGAGGG GTGCAGTTTC TGGTGTAGGC CAGGTAGGTA 3240 

GAAAGTGAGG AACAGGGTTG CCTCTTGGCT GGGTGGAGTC TCTGAAATGT TAGAAGAAGC 3300 

GCTGAAGCCT TGATTGATAG TTCTGCCCCT TGTTGCCCTG GGGCTTATCT GATTATGGGA 3360 

CGAGGGTAGA AAGTAAGAAG CACTTTTGAA TTTGTGGGGT AGAACTTCAA CAATAAGTCA 3420 

GTTCTAGTGG CTGTCGCCTG GGGACTAGTG AGAAAGCTAC TCTTCTCCCT CTTCCCTCTT 3480 

TCTCCCCATG GCCCCACTGC AGAATTAAAG AAGGAAGAAG GGAAGGCGGA GGAGTCTATA 3540 

AGAAGGAATC ATGATTTCTA TTTAGCAGAT TGGATGGGCA GGTGGAGAAT GCCTGGGGGT 3600 

AGAAATGTTA GATCTTGCAA CATCAGATCC TTGGAATAAA GAAGCCTCTC TGYGCWRAAA 3660 
AAAAAAAAAA AAAAAA 



Seq ID NO: 24 Protein sequence: 
Protein Accession #: FGENESH predicted 



1 11 21 31 41 51 

I I I I I I 

MGSPAAPEGA LGYVREFTRH SSDVLGNLNE LRLRGILTDV TLLVGGQPLR AHKAVLIACS 60 

GFFYSIFRGR AGVGVDVLSL PGGPEARGFA PLLDPMYTSR LRLSPATAPA VLAAATYLQM 120 

EHWQACHRF IQASYEPLGI SLRPLEAEPP TPPTAPPPGS PRRSEGHPDP PTESRSCSQG 180 

PPSPASPDPK ACNWKKYKYI VLNSQASQAG SLVGERSSGQ PCPQARLPSG DEASSSSSSS 240 

SSSSEEGPIP GPQSRIiSPTA ATVQFKCGAP ASTPYLLTSQ AQDTSGSPSE RARPLPGSEP 300 

FSCQNCEAVA GCSSGLDSLV PGDEDKPYKC QLCRSSFRYK GNLASHRTVH TGEKPYHCSI 360 

CGARFNRPAN LKTHSRIHSG EKPYKCETCG SRFVQYAHLR AHVLIHTGEK PYPCPTCGTR 420 
FRHLQTLKSH VRIHTGEKPY HCDPCGLHFR HKSQLRLHLR QKHGAATNTK VHYHILGGP 



Seq ID NO: 25 Nucleotide sequence: 
Nucleic Acid Accession #: U21551 

Coding sequence: 1..1155 (underlined sequences correspond to start and stop codons) 
1 11 21 31 41 51 

I I I I I I 

ATGG ATTQCA GTAACGGATC GGCAGAGTGT ACCGGAGAAG GAGGATCAAA AGAGGTGGTG 60 
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GGGACTTTTA AGGCTAAAGA CCTAATAGTC 
GACCCCAATA ATCTGGTTTT TGGAACTGTG 
TCCTCAGAGT TTGGATGGGA GAAACCTCAT 
CCTGGCTCAT CAGCTTTGCA CTATGCAGTG 
GGAGTAGATA ATAAAATTCG ACTGTTTCAG 
TCTGCTGTGA GGGCAACTCT GCCGGTATTT 
CAGCTTGTGA AATTGGATCA AGAATGGGTC 
CGTCCTGCAT TCATTGGAAC TGAGCCTTCT 
CTCTTTGTAC TCT7GAGCCC AGTGGGACCT 
TCCCTGTGGG CCAATCCCAA GTATGTAAGA 
ATGGGAGGGA ATTACGGCTC ATCTCTTTTT 
CAGCAGGTCC TGTGGCTCTA TGGCAGAGAC 
CTTTTTCTTT ACTGGATAAA TGAAGATGGA 
GGCATCATTC TTCCAGGAGT GACAAGGCGG 
GAATTTAAGG TGTCAGAGAG ATACCTCACC 
AACAGAGTGA GAGAGATGTT TAGCTCTGGT 
ATACTGTACA AAGGCGAGAC AATACACATT 
AGCCGCATCT TGAGCAAATT AACTGATATC 
ATTGTGCTAT CC TGA 



ACACCAGCTA CCATTTTAAA GGAAAAACCA 120 

TTCACGGATC ATATGCTGAC GGTGGAGTGG 180 

ATCAAGCCTC TTCAGAACCT GTCATTGCAC 240 

GAATTATTTG AAGGATTGAA GGCATTTCGA 3 00 

CCAAACCTCA ACATGGATAG AATGTATCGC 360 

GACAAAGAAG AGCTCTTAGA GTGTATTCAA 420 

CCATATTCAA CATCTGCTAG TCTGTATATT 4 80 

CTTGGAGTCA AGAAGCCTAC CAAAGCCCTG 540 

TATTTTTCAA GTGGAACCTT TAATCCAGTG 600 

GCCTGGAAAG GTGGAACTGG GGACTGCAAG 660 

GCCCAATGTG AAGACGTAGA TAATGGGTGT 720 

CATCAGATCA CTGAAGTGGG AACTATGAAT 780 

GAAGAAGAAC TGGCAACTCC TCCACTAGAT 840 

TGCATTCTGG ACCTGGCACA TCAGTGGGGT 900 

ATGGATGACT TGACAACAGC CCTGGAGGGG 960 

ACAGCCTGTG TTGTTTGCCC AGTTTCTGAT 1020 

CCAACTATGG AGAATGGTCC TAAGCTGGCA 1080 

CAGTATGGAA GAGAAGAGAG CGACTGGACA 1140 



Seq ID NO: 26 Protein sequence: 
Protein Accession #: AAB08528 



1 11 21 31 41 51 

I I I I I I 

MDCSNGSAEC TGEGGSKEW GTFKAKDLIV TPATILKEKP DPNNLVFGTV FTDHMLTVEW 60 

SSEFGWEKPH IKPLQNLSLH PGSSALHYAV ELFEGLKAFR GVDNKIRLFQ PNLNMDRMYR 120 

SAVRATLPVF DKEELLECIQ QLVKLDQEWV PYSTSASLYI RPAFIGTEPS LGVKKPTKAL 180 

LFVLLSPVGP YFSSGTFNPV SLWANPKYVR AWKGGTGDCK MGGNYGSSLF AQCEDVDNGC 240 

QQVLWLYGRD HQITEVGTMN LFLYWINEDG EEELATPPLD GIILPGVTRR CILDLAHQWG 300 

EFKVSERYLT MDDLTTALEG NRVREMFSSG TACWCPVSD ILYKGETIHI PTMENGPKLA 360 
SRILSKLTDI QYGREESDWT IVLS 



Seq ID NO: 27 Nucleotide sequence; 
Nucleic Acid Accession #: XM_039209 

Coding sequence: 656. .2758 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

TCGCGCGGGG GCCGCCCCCT CCCCTTCCCT CCACCCTGGG CGGGGGCGCG CGAGAAGCGG 60 

TGACGTCAAG GGGCGCGCTG TGGCAGCACC TCCCCGCGCG CTAGTTAAAA AGAAGAAGAA 120 

AAGAGGGAAC GAAACATGAG AGGCTGTGTG AGAAGCTGCA GCCGCCGGCA GAGGAGACCT 180 

CAGCATCATC TAGAGCCCAG CGCTGGCCCT GCCTCCGCCT GCCCCGCCGC CGCCGTCGCC 240 

GTTTCTGTTC CTGCTACTGT CCCACCTAAA CAACTCCCGT TACACGGACA AGTGAACATC 300 

TGTGGCTGTC CTCTCCTTTT CTTCCTCCTC TTCCAACTCC TTCTCCTCCT CCCACTTCCC 360 

AGCCGCAGCA GAAAGCCCCC AACCCAACTG ACACTGGCAC AACTGCAAAC GGTGTCATCC 420 

GCACAACTTT ATCTCGCTCC TCGGGCTCCC GTAAGGCATT GGACCCATCG CCGCGTCTTT 480 

TATTTTTTGC AAAGTTGCAT CGCTGTACAT ATTTTTGTCC CCGCCACCTC CCTCTGTCTG 540 

TGGAGTGCCC TACAGCCCCG CAAACTCCTC CTGGAGCTGC GCCCTAGTGC CCCTGCTGGG 600 

CAGTGGCGTT CCCCCCCATC CTCCCGCGCC CAGCCCCTGC TGCTCTGGGC AGACGATGCT 660 

GAAGATGCTC TCCTTTAAGC TGCTGCTGCT GGCCGTGGCT CTGGGCTTCT TTGAAGGAGA 720 

TGCTAAGTTT GGGGAAAGAA ACGAAGGGAG CGGAGCAAGG AGGAGAAGGT GCCTGAATGG 780 

GAACCCCCCG AAGCGCCTGA AAAGGAGAGA CAGGAGGATG ATGTCCCAGC TGGAGCTGCT 840 

GAGTGGGGGA GAGATGCTGT GCGGTGGCTT CTACCCTCGG CTGTCCTGCT GCCTGCGGAG 900 

TGACAGCCCG GGGCTAGGGC GCCTGGAGAA TAAGATATTT TCTGTTACCA ACAACACAGA 960 

ATGTGGGAAG TTACTGGAGG AAATCAAATG TGCACTTTGC TCTCCACATT CTCAAAGCCT 1020 

GTTCCACTCA CCTGAGAGAG AAGTCTTGGA AAGAGACCTA GTACTTCCTC TGCTCTGCAA 1080 

AGACTATTGC AAAGAATTCT TTTACACTTG CCGAGGCCAT ATTCCAGGTT TCCTTCAAAC 1140 

AACTGCGGAT GAGTTTTGCT TTTACTATGC AAGAAAAGAT GGTGGGTTGT GCTTTCCAGA 1200 

TTTTCCAAGA AAACAAGTCA GAGGACCAGC ATCTAACTAC TTGGACCAGA TGGAAGAATA 1260 

TGACAAAGTG GAAGAGATCA GCAGAAAGCA CAAACACAAC TGCTTCTGTA TTCAGGAGGT 1320 

TGTGAGTGGG CTGCGGCAGC CCGTTGGTGC CCTGCATAGT GGGGATGGCT CGCAACGTCT 1380 

CTTCATTCTG GAAAAAGAAG GTTATGTGAA GATACTTACC CCTGAAGGAG AAATTTTCAA 1440 

GGAGCCTTAT TTGGACATTC ACAAACTTGT TCAAAGTGGA ATAAAGGGAG GAGATGAAAG 1500 

AGGACTGCTA AGCCTCGCAT TCCATCCCAA TTACAAGAAA AATGGAAAGT TGTATGTGTC 1560 

CTATACCACC AACCAAGAAC GGTGGGCTAT CGGGCCTCAT GACCACATTC TTAGGGTTGT 1620 

GGAATACACA GTATCCAGAA AAAATCCACA CCAAGTTGAT TTGAGAACAG CCAGAGTCTT 1680 

TCTTGAAGTT GCAGAACTCC ACAGAAAGCA TCTGGGAGGA CAACTGCTCT TTGGCCCTGA 1740 

CGGCTTTTTG TACATCATTC TTGGTGATGG GATGATTACA CTGGATGATA TGGAAGAAAT 1800 

GGATGGGTTA AGTGATTTCA CAGGCTCAGT GCTACGGCTG GATGTGGACA CAGACATGTG 1860 

CAACGTGCCT TATTCCATAC CAAGGAGCAA CCCACACTTC AACAGCACCA ACCAGCCCCC 1920 

CGAAGTGTTT GCTCATGGGC TCCACGATCC AGGCAGATGT GCTGTGGATA GACATCCCAC 1980 

TGATATAAAC ATCAATTTAA CGATACTGTG TTCAGACTCC AATGGAAAAA ACAGATCATC 2040 
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AGCCAGAATT CTACAGATAA TAAAGGGGAA AGATTATGAA AGTGAGCCAT CACTTTTAGA 2100 

ATTCAAGCCA TTCAGTAATG GTCCTTTGGT TGGTGGATTT GTATACCGGG GCTGCCAGTC 2160 

AGAAAGATTG TATGGAAGCT ACGTGTTTGG AGATCGTAAT GGGAATTTCC TAACTCTCCA 2220 

GCAAAGTCCT GTGACAAAGC AGTGGCAAGA AAAACCACTC TGTCTCGGCA CTAGTGGGTC 2280 

CTGTAGAGGC TACTTTTCCG GTCACATCTT GGGATTTGGA GAAGATGAAC TAGGTGAAGT 2340 

TTACATTTTA TCAAGCAGTA AAAGTATGAC CCAGACTCAC AATGGAAAAC TCTACAAAAT 2400 

TGTAGATCCC AAAAGACCTT TAATGCCTGA GGAATGCAGA GCCACGGTAC AACCTGCACA 2460 

GACACTGACT TCAGAGTGCT CCAGGCTCTG TCGAAACGGC TACTGCACCC CCACGGGAAA 2520 

GTGCTGCTGC AGTCCAGGCT GGGAGGGGGA CTTCTGCAGA ACTGCAAAAT GTGAGCCAGC 2580 

ATGTCGTCAT GGAGGTGTCT GTGTTAGACC GAACAAGTGC CTCTGTAAAA AAGGATATCT 2640 

TGGTCCTCAA TGTGAACAAG TGGACAGAAA CATCCGCAGA GTGACCAGGG CAGGTATTCT 2700 

TGATCAGATC ATTGACATGA CATCTTACTT GCTGGATCTA ACAAGTTACA TTGT ATAG TT 2760 

TCTGGGACTG TTTGAATATT CTATTCCAAT GGGCATTTAT TTTTTATCCT GTCATTAAAA 2820 

AAAAAAGACT GTTATCCTGC TACACACTCC TGTGATTTCA TTCTCTTTTA TTAATTTAAA 2880 

AATAATTTCC AGAAATGTGC AGATCCTCTG TGTGTATGTC AGCATGTTTG TTCACATATG 2940 

CACATACACA TACTCATAAC CCCTATATGC GTTGTTGCAT AACAGATGAT TTTTTAAAAT 3000 

ATATACTTCC TTATGCAAAG TAATTTACAC AGAAATTCCA TTGTAAATTG ATAATGGATT 3060 

TTTTATGTTA CTAGAAGAGA TTATTTGACT TCCCAGGAAT TTTCTGTCTG TAATCACTAA 3120 

AGTCAACTTT AATAGAGTTT TGAAACAGTA CTGTGCAATC CGATGGATCT AATTAAAAAA 3180 

AAGGCAATAT TTTTATATTA AAGTACTATA CTAGGAGAGA ATGTTTCAGA ACTCCCTGAT 3240 

GAATTTCTAA GTGAGCAACT TGATATAAAA TTGTAATCTT CATTTTTGTC AGTGTATCCA 3300 

GTTACAGAAT GCTACACACT TACCTTTTTA TTGGCTGAGA AATCTGGTTA TTTCATCTTA 3360 

ATCTCAAGAT TGTTTTCAAG TGTTTTATAA TTAAATCATA ATAGCATATT TTAAAATCAA 3420 

TCTTCCTAAA AGGTCTGCTT TTATTGTATA TTTTATTTAA CAATAGGCAC TGGGTTTGTG 3480 
TTACATATTT ATATATTTTA TTTTATTTTT ATAATATAGA CATCACCTAG 

Seq ID NO: 28 Protein sequence ; 
Protein Accession #: XP_039209 



1 11 21 31 41 51 

I I I 1 I I 

MLKMLSFKLL LLAVALGFFB GDAKFGERNE GSGARRRRCL NGNPPKRLKR RDRRMMSQLE 60 

LLSGGEMLCG GFYPRLSCCL RSDSPGLGRL ENKIFSVTNN TECGKLLEEI KCALCSPHSQ 120 

SLFHSPEREV LERDLVLPLL CKDYCKEFFY TCRGHIPGFL QTTADEFCFY YARKDGGLCF 180 

PDFPRKQVRG PASNYLDOME EYDKVEEISR KHKHNCFCIQ EWSGLRQPV GALHSGDGSQ 240 

RLFI LEKEGY VKILTPEGEI FKEPYLDIHK LVQSGIKGGD ERGLLSLAFH PNYKKNGKLY 300 

VSYTTNQERW AIGPHDHILR WEYTVSRKN PHQVDLRTAR VFLEVAELHR KHLGGQLLFG 360 

PDGFLYIIIiG DGMITLDDME EMDGLSDFTG SVLRLDVDTD MCNVPYSIPR SNPHFNSTNQ 420 

PPEVFAHGLH DPGRCAVDRH PTDININLTI LCSDSNGKNR SSARILQIIK GKDYESEPSL 480 

LEFKPFSNGP LVGGFVYRGC QSERLYGSYV FGDRNGNFLT LQQSPVTKQW QEKPLCLGTS 540 

GSCRGYFSGH ILGFGEDELG EVYILSSSKS MTQTHNGKLY KIVDPKRPLM PEECRATVQP 600 

AQTLTSECSR LCRNGYCTPT GKCCCSPGWE GDFCRTAKCE PACRHGGVCV RPNKCLCKKG 660 
YLGPQCEQVD RNIRRVTRAG ILDQIIDMTS YLLDLTSYIV 



Seq ID NO: 29 Nucleotide sequence: 
Nucleic Acid Accession #: NMJJ24756 

Coding sequence: 75.. 2924 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

III- I I I 

AAGACAACGT .CACTAGCAGT TTCTGGAGCT ACTTGCCAAG GCTGAGTGTG AGCTGAGCCT 60 

GCCCCACCAC CAAGATGATC CTGAGCTTGC TGTTCAGCCT TGGGGGCCCC CTGGGCTGGG 120 

GGCTGCTGGG GGCATGGGCC CAGGCTTCCA GTACTAGCCT CTCTGATCTG CAGAGCTCCA 180 

GGACACCTGG GGTCTGGAAG GCAGAGGCTG AGGACACCAG CAAGGACCCC GTTGGACGTA 240 

ACTGGTGCCC CTACCCAATG TCCAAGCTGG TCACCTTACT AGCTCTTTGC AAAACAGAGA 300 

AATTCCTCAT CCACTCGCAG CAGCCGTGTC CGCAGGGAGC TCCAGACTGC CAGAAAGTCA 360 

AAGTCATGTA CCGCATGGCC CACAAGCCAG TGTACCAGGT CAAGCAGAAG GTGCTGACCT 420 

CTTTGGCCTG GAGGTGCTGC CCTGGCTACA CGGGCCCCAA CTGCGAGCAC CACGATTCCA 4 80 

TGGCAATCCC TGAGCCTGCA GATCCTGGTG ACAGCCACCA GGAACCTCAG GATGGACCAG 540 

TCAGCTTCAA ACCTGGCCAC CTTGCTGCAG TGATCAATGA GGTTGAGGTG CAACAGGAAC 600 

AGCAGGAACA TCTGCTGGGA GATCTCCAGA ATGATGTGCA CCGGGTGGCA GACAGCCTGC 660 

CAGGCCTGTG GAAAGCCCTG CCTGGTAACC TCACAGCTGC AGTGATGGAA GCAAATCAAA 720 

CAGGGCACGA GTTCCCTGAT AGATCCTTGG AGCAGGTGCT GCTACCCCAC GTGGACACCT 780 

TCCTACAAGT GCATTTCAGC CCCATCTGGA GGAGCTTTAA CCAAAGCCTG CACAGCCTTA 840 

CCCAGGCCAT AAGAAACCTG TCTCTTGACG TGGAGGCCAA CCGCCAGGCC ATCTCCAGAG 900 

TCCAGGACAG TGCCGTGGCC AGGGCTGACT TCCAGGAGCT TGGTGCCAAA TTTGAGGCCA 960 

AGGTCCAGGA GAACACTCAG AGAGTGGGTC AGCTGCGACA GGACGTGGAG GACCGCCTGC 1020 

ACGCCCAGCA CTTTACCCTG CACCGCTCGA TCTCAGAGCT CCAAGCCGAT GTGGACACCA 1080 

AATTGAAGAG GCTGCACAAG GCTCAGGAGG CCCCAGGGAC CAATGGCAGT CTGGTGTTGG 1140 

CAACGCCTGG GGCTGGGGCA AGGCCTGAGC CGGACAGCCT GCAGGCCAGG CTGGGCCAGC 1200 

TGCAGAGGAA CCTCTCAGAG CTGCACATGA CCACGGCCCG CAGGGAGGAG GAGTTGCAGT 1260 

ACACCCTGGA GGACATGAGG GCCACCCTGA CCCGGCACGT GGATGAGATC AAGGAACTGT 1320 

ACTCCGAATC GGACGAGACT TTCGATCAGA TTAGCAAGGT GGAGCGGCAG GTGGAGGAGC 1380 

TGCAGGTGAA CCACACGGCG CTCCGTGAGC TGCGCGTGAT CCTGATGGAG AAGTCTCTGA 1440 



192 



WO 02/079492 



PCT7US02/04915 



TCATGGAGGA 
AGCACCTGCA 
AGCTCTATTT 
AGGAGACCCA 
TGCAGAACGC 
GGGCGCGGGC 
GCGCGCTGAA 
TCGCCGCCCT 
AGGAGGTGCT 
AGATCCGCGT 
ACGAGCTGGC 
AGCACCTGGA 
GGCTGGCGCG 
AGGCCGAGGC 
CACTCTTCGC 
GGAACTTCCA 
TGCTGAGCAG 
AGAAGGAAGC 
GCGCGGCGCT 
CGGCTGCCCT 
TCCCTGAACA 
TTGAATTTGG 
CAGTCTGTAC 
TGCAGAAGGG 
TGTCGGGCAC 
GATCAGACAT 
TGGAGGTCAT 
TACGTGGCTT 
AATGCTTCTC 
TCAGACAGCA 
GTGTTTTGCT 
TCCTCTTCTC 
AAAATGGAGG 
AACCTGCTTG 
GAAATTAGTA 
CTAAAATATC 
GAGGTAATTG 
TCTGATGATT 
TGAAGATGCC 
CAGCCATGTG 



GAACAAGGAG 
GGGTGGCCAT 
AGACCTGGAC 
GGTGAGCCTG 
CGTGGACGCC 
GGCCACGTCG 
GGCGGCCGCG 
GCTGGAGGAC 
GGAGGAGATG 
GGCCCTGCAG 
CGCCCGAGTG 
GCCCAGCCAC 
GGAGCTCCAG 
CGGGGCCGGG 
CACTCAGCGC 
AGGGCTCATG 
GAAAGGGAAG 
GGAGCCTTTG 
CTGGGAGGCA 
GCAGACAGTG 
TGGCTACTTC 
CCCAGGGCCA 
CACTG^GCAG 
TGAGCGAGTA 
TGCATTTGGG 
CATGGACTCG 
TCAGTTGGTC 
CCCTGTAACC 
CGGGCAGGAC 
CGGCCTGGGC 
CCTCCTGTGG 
TTCTCCCTTA 
CAACATTTTG 
GTCTCTATTA 
TCTTTAGTTT 
ATCTTGAATT 
GATCTTGGGG 
TTATAAGTTT 
TTGGTTCCTC 
GAACAGTGAG 



GAGGTGGAGC 
GCCGACCTCA 
GTCATCCGGG 
GACGAGCGGC 
GTGTCGCTGG 
CGGCTCCGGA 
GCCGAGGCCC 
GCGCTGCGGC 
TCTGAGCAGA 
GACGCCGCTA 
ACGGCCCTGG 
GACGCGGGCC 
AGCCTGAGCA 
GCCGCCTCCC 
AGCTTGGAGC 
GAAGCCAACG 
AAGCAGCAGA 
GTGGACATAC 
GGATCCCCTG 
AAGTTCAACA 
CGAGCCCCTG 
GGCACCGGGC 
GGGAGTGGAA 
TGGTTTGAGT 
GGCTTCCTGA 
CCCAGCTCTC 
TGTCTCTTCC 
ACATGGGGCT 
ATGGTCCTAG 
TCCAACTCTT 
TTGGAAACTT 
TCGTATGATA 
GCCAACATTG 
GTCAGTAATG 
CAGTCATTCC 
GTAATCCCTA 
GCGGTTCCCC 
GATAGTTCCT 
TTCACTGTCT 
TCAATTAAAC 



GGCAGCTCCT 
TCAAGTACGT 
AGGGCCAGAG 
GGCAGCTGGA 
CCGTGGACGC 
GCCAAGTGCA 
GCCACGAGGT 
ACGAGGCGGT 
CGCCGGGACC 
GCGGGCTGCA 
AGCAGGCCTC 
GCGAGGAGGC 
ACGACGTCAA 
TCAACGCCTC 
AGCACGAGCG 
TCAGCCTGGA 
AAGACCTGGA 
GGGTCACAGG 
TGGCCTTCTA 
CCACATACAT 
AGCGTGGTGT 
AGCTGGTGTT 
GCACAGCAAC 
TAACCCAGGG 
TGTTTAAGAC 
CTCGGCCTGG 
CTGGAAACCT 
TGGCCATTTC 
GAAGCCTGAA 
CACCACACCC 
CTGTACAACA 
GAAAGACATT 
GAAAGCACTA 
AAGACGACAG 
TTGTAGGATA 
TAATCCCCAC 
CATGCTGTTC 
CCTGTGTTCA 
GCCATGATTG 



GGAGCTCAAC 
GAAGGACTGC 
GGACGCCACG 
CGGCTCCTCC 
GCACAAAGCG 
GGCGCTGGAT 
GCGCCAGCTG 
GCTGGCCGCG 
GCTGCCCCTG 
GGAGCAGGCG 
GGAGCCCCCG 
CGCCACCACC 
GAATGTCGGG 
CCTTGACGGC 
GCTCTTCCAC 
CCTGGGGAAG 
AGCTCCCCGG 
GCCTGTGCCA 
TGCCAGCTTT 
CAACATTGGC 
CTACCTGTTT 
TGGAGGTCAC 
GGTCTTTGCC 
ATCAATAACA 
CTGAACCCCA 
GGCTCTGGCC 
TCTGCAAAGA 
TCCATGATGA 
CCTTGGCTTG 
TGTATTCTAC 
CTTTAAACTT 
CTTCCCCAGG 
GAGGGCAATG 
CCTGGCCAAC 
TGGTTTAGCT 
ATCAAGGGAG 
TTGTGATAGT 
TTCTCCTTCC 
TAAGTTTCCT 
TATAAATT 



CTCACGCTGC 
AATTGCCAGA 
CGTGCCCTGG 
CTGCAGGCCC 
GAGGGCGAGC 
GACGAGGTGG 
CACAGCGCCT 
CTCTTCGGGG 
AGCTACGAGC 
CTCGGCTGGG 
CGGCCGGCAG 
GCCCTGGCCG 
CGGTGCTGCG 
CTCCACAACG 
AGCCTCTTTG 
CTGCAGACCA 
AAGAGGGACA 
GGTGCCTTGG 
TCAGAAGGGA 
AGCAGCTACT 
GCAGTGAGCG 
CATCGGACTC 
ATGGCTGAGC 
AAGAGAAGCC 
GCCCCAATCT 
AAGGATGGGC 
TGGTGTGGTG 
GAAGGACTGG 
GCATGCCTTC 
AACTTCTTTG 
TTCTCTTGCT 
AGGAATGTTT 
GGATTAAACC 
CAAGGGAAAG 
GTGCCCCCAC 
AGATCAGGTG 
TCTCACGAGA 
TGCCACCTTG 
GAGGCCTCCC 



1500 
1560 
1620 
1680 
1740 
1800 
1660 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 



Seq ID NO: 30 Protein sequence: 
Protein Accession #: NP 079032 



MILSUiFSLG 
PMSKLVTLLA 
CCPGYTGPNC 
LGDLQNDVHR 
FSPIWRSFNQ 
TQRVGQLRQD 
GARPEPDSLQ 
ETFDQISKVE 
GHADLIKYVK 
DAVSLAVDAH 
EDALRHEAVL. 
RVTALEQASE 
AGAASLNA5L 
GKKQQKDLEA 
TVKFNTTYIN 
GQGSGSTATV 



11 
I 

GPLGWGLLGA 
LCKTEKFLIH 
EHHDSMAIPE 
VADSLPGLWK 
SLHSLTQAIR 
VEDRLHAQHF 
ARLGQLQRNL 
RQVEELQVNH 
DCNCQKLYXiD 
KAEGERARAA 
AALFGEEVLE 
PPRPAEHLEP 
DGLHNALFAT 
PRKRDKKEAE 
IGSSYFPEHG 
FAMAELQKGE 



21 
I 

WAQASSTSLS 
SQQPCPQGAP 
PADPGDSHQE 
ALPGNLTAAV 
NLSLDVEANR 
TLHRSISELQ 
SELHMTTARR 
TALRELRVIL 
LDVIREGORD 
TSRLRSQVQA 
EMSEQTPGPL 
SHDAGREEAA 
QRSLEQHQRL 
PLVDIRVTGP 
YFRAPERGVY 
RVWFELTQGS 



31 
I 

DLQSSRTPGV 
DCQKVKVMYR 
PQDGPVSFKP 
MEANQTGHEF 
QAISRVQDSA 
ADVDTKLKRL 
EEELQYTLED 
MEKSLIMEEN 
ATRALEETQV 
LDDEVGALKA 
PLSYEQIRVA 
TTALAGLARE 
FHSLFGNFQG 
VPGALGAALW 
LFAVSVEFGP 
ITKRSLSGTA 



41 
I 

WKAEAEDTSK 
MAHKPVYQVK 
GHLAAVINEV 
PDRSLEQVLL 
VARADFQELG 
HKAQEAFGTN 
MRATLTRHVD 
KEEVERQLLE 
SLDERRQLDG 
AAAEARHEVR 
IiQDAASGLQE 
LQSLSNDVKN 
LMEANVSLDL 
EAGSPVAFYA 
GPGTGQLVFG 
FGGFLMFKT 



51 

I. 

DPVGRNWCPY 
QKVLTSLAWR 
EVQQEQQEHL 
PHVDTFLQVH 
AKFEAKVQEN 
GSLVLATPGA 
EIKELYSESD 
LNLTLQHLQG 
SSLQALQNAV 
QLHSAFAALL 
QALGWDELAA 
VGRCCEAEAG 
GKLQTMLSRK 
SFSEGTAALQ 
GHHRTPVCTT 



60 
120 
160 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
B40 
900 



Seq ID NO: 31 Nucleotide sequencer 
Nucleic Acid Accession #: AB037715 

Coding sequence: 370.. 3489 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

GAACGCTCAC AGAACAGGCA GTGCAATTCC ATGTTCCTCT TAAGTATGTT AGCCCTACCG 60 

GGAGCTGAGC TGGCCAGTCT ACTTGGAGAG GAAAAGTAGA TCTGGGGAAG GTGGAAGGGT 120 

CAGTTCCTAA GTGACTTCCT CCTCGGGGAT GGTAAGGGCA TTTGCTGATC TCCAGTGACT 180 

GCCTGGTGCC TCATGGTCAG ACTCGGCTGT CTCACTCCCA GATATCTGAT TTTGCAAAAA 240 

GGGACACACC TATCTGCAGC AAAGAAGACA CTGACCAGAT TGCGAGCGGT GCTTTTGGAT 300 
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GCTCTGTAGC CACCCGGGGC CCAGGAGGAC 
TCGGAGACCA. TGGCAGTGCA GCTGGTGCCC 
GAGGGCCGCC GATGTCAAGT ACATCTTCTT 
CCCAAGCTGT TGGCCAAGGA GCTTCTTGAC 
5 AAGGAGTACT TTGGAATAGC ATTCACAGAT 
GATCGAAGAG TATTGGAACA TGACTTCCCT 
TGTGTCAGGT TCTATATAGA AAGCATTTCA 
TTCTTTCTGA ACGCGAAGTC CTGCATCTAC 
GTGTTTGAAT TAGCTTCCTA TATTTTACAG 

10 GTTGTGAGGA GTGACTTGAA GAAGCTGCCA 
CCTTCCCTGG CCTACTGTGA AGACAGAGTC 
ACAAGAGGTC AAGCAATCGT AAACTACATG 
GTTCACTATT ATGCAGTGAA GGACAAGCAG 
AAAGGGATCT TCCAGTATGA CTACCATGAT 

15 AGACAGTTGG AAAACCTGTA CTTCAGAGAA 
CGCAGGGCTT CAGTGACAAG GAGGACGTTT 
TATGCATGTC CGGCATTGAT CAAGTCCATC 
TATCTGGACA GAAAGCAGAG TAAGTCCAAA 
GCCATCGACC TGACCGAGAC GGGGACGCTG 

20 AAGGGGAAGA TCATCAGCGG CAGCAGCGGC 
GATAGCTCGC AGTCGGCCAA GAAGGACATG 
CTGGAGGAAA CCCTGCGTCA GAGGCTGGAG 
GAGCTCACGG GCAAGCTGCC AGTAGAATAT 
GTTCGGAGAA GAATAGGAAC AGCCTTCAAA 

25 GAGGAAGCTG AGCTGGAACG CCTGGAACGA 
GCCGCCCGCC GCCTAGCCAG TGACCCCAAC 
ACCTCGTATC TGAATGCACT GAAGAAACTG 
CGCATCAAGT CTGGGAAGAA ACCCACCCAG 
ATTGCCAGTG AAGACAGCTC CCTCTCAGAT 

30 GTTACCAGCA CAATATCCCC CCTACATTCT 
TCGCACAACA GGCCTCCTCC TCCCCAGTCC 
CGCAACGACT ATGACAAGTC ACCCATCAAG 
GAACCCTATG AGAAGGTCAA GAAGCGCTCC 
TTCCCCAGCA CAGGAAGCTG TGCGGAAGCC 

35 CCCATCCGCG GCCTCCCGCA CTGGAACTCC 
CGGGTCCGGA GTCCCCACTA CGTCCATTCC 
CTGCACAGCC TCGCACTGCA CTTTAGGCAC 
CTCCTGGGCT CGGAAAACGA CACCGGGAGC 
AGCAACGGCT CAGACCCCAT GGACGACTGC 

40 CACTACTACC CGGCGCAGAT GAACGCCAAC 
AAGGCGCGCC AGAGGCAGAG GCAGCGGCAG 
TCGGGCAGCA TGCCCAACCT GGCGGCGCGC 
GGCGGTGTGT ACCTGCACAG CCAGAGCCAG 
CCGCTGTACA TCGAGGGCGG ' CGCCACGCCC 

45 GAGTGCCACT ACAGCGTCAA GGCTCAGTTC 
CTGTTCAAGG AGAGCTGGCG CGGCGGCGGC 
CCGTCGCGAT CGCAGATCCT GCGGACTCCG 
GGCGCGGGCC GTGCCGCCGT CTCAGACGAG 
TCGCACAAGG AGCACAGCCG CCTGTCGCAC 

50 CAGTACAGCA CCTCCTCCCA GAGCACCTTC 
CAGATGTGCA AGGCCACGTC AGCTGCCTTA 
AGTGAAATTG GAGCCACCCC CCCAAGCAGC 
GAAGCAACAG AAAACTCACC CATTCTGGAT 
GATGAATAGA GGAGCTACAA TGATAGCTGT 

55 GCTGATGTCC AGTGGTACGG GCAGGAAAAA 
• 6CGGCCTAAT CTGACCGCCT CAACGCCATT 

TTACCCAGAC GCACCGTCAC CCTGCACCAG 
CTCCGCATTC CCTCCCCCTT GAAAACCTGA 
CACTGTGTGT CCCCTGGCGC TCTTGCCCAT 

60 • CTTGGTGGCT TCCCTCTGCC ATGACAGCCC 
GGCATCCAAT TCCTGCGGAT AAGTAGCGTT 
CAGGGTGACC CAGAAAGACG ATTCAGCTGT 
CAAGCACTTC ATGAAGAGGA GGCCTCGTGG 
GATGGGACAG CTTGTGGGGA TGGCTATGGG 

65 ACACCAGAAA TGCATCGGAG GACCACAATC 
TAAAAACATA AAAAATTAAG AGGGGCCAAG 
TTTTAAATTC TGAACTGCTA CTACACACAA 
CTCTCTCTAG CCCTCTCCCT TACTGGCCCA 
GCCCCAATGC CACGGTAAAG GCGAGGAAGT 

70 ATCCATCTGG ACACAAAGAG AGACCTGTGG 
CATGCAGGGG GTTCAGCCGA GCCCAAGACT 
AACGTAAGGT GATAATGGCC AAAAGTGGTT 
ATCCTATTTT TTTGCATAAG GTGTTTCATT 
ACATTGCGAT CCATTCAGTG TTTAACTGTC 

75 GTGACAAAAG AGCTCAGATC CGACTTCTCC 
TGCCCTTAGG TAGAAAGATT TGACTCGTGT 



TGACTCGGCA GCAGGATTCG TGCATGGGAA 360 

GACTCAGCTC TCGGCCTGCT GATGATGACG 420 

GATGACAGGA AGCTGGAACT CCTAGTACAG 480 

CTTGTGGCTT CTCACTTCAA TCTGAAGGAA 540 

GAAACGGGAC ACTTAAACTG GCTTCAGCTA 600 

AAAAAGTCAG GACCCGTGGT TTTATACTTT 660 

TACCTGAAGG ATAATGCTAC CATTGAGCTT 720 

AAGGAGCTTA TTGACGTTGA CAGCGAAGTG 780 

GAGGCAAAGG GAGATTTTTC TAGCAATGAA 840 

GCCCTTCCCA CCCAAGCCCT GAAGGAGCAC 900 

ATTGAGCACT ACAAGAAACT GAACGGTCAG 960 

AGCATCGTGG AGTCTCTCCC AACCTACGGG 1020 

GG CAT AC CAT GGTGGCTGGG CCTGAGCTAC 1080 

AAAGTGAAGC CAAGAAAGAT ATTCCAATGG 1140 

AAGAAGTTTT CCGTGGAAGT TCATGACCCA 1200 

GGGCACAGCG GCATTGCAGT GCACACGTGG 1260 

TGGGCTATGG CCATAAGCCA ACACCAGTTC 1320 

ATCCATGCAG CACGCAGCCT GAGTGAGATC 1380 

AAGACCTCGA AGCTGGCCAA CATGGGTAGC 1440 

AGCCTGCTGT CTTCAGGTTC TCAGGAATCA 1500 

CTGGCTGCCT TGAAGTCCAG GCAGGAAGCT 1560 

GAACTGAAGA AGCTGTGTCT CCGAGAAGCT 1620 

CCCCTGGATC CAGGGGAGGA ACCACCCATT 1680 

CTGGATGAAC AGAAAATCCT GCCCAAAGGA 1740 

GAGTTTGCCA TTCAGTCCCA GATTACGGAG 1800 

GTCAGCAAAA AACTGAAGAA ACAAAGGAAA 1860 

CAGGAGATTG AAAATGCAAT CAATGAGAAC 1920 

AGGGCTTCGC TGATCATAGA CGATGGAAAC 1980 

GCCCTTGTTC TTGAGGATGA AGACTCTCAG 2040 

CCTCACAAGG GACTCCCTCC TCGGCCACCG 2100 

CTGGAGGGAC TCCGACAGAT GCACTATCAC 2160 

CCCAAAATGT GGAGTGAGTC CTCTTTAGAT 2220 

TCTCACAGCC ATTCCAGCAG CCACAAGCGC 2280 

GGCGGAGGAA GCAACTCCTT GCAGAACAGC 2340 

CAGTCCAGCA TGCCGTCCAC GCCAGACCTG 2400 

ACGAGGTCGG TGGACATCAG CCCCACCCGA 2460 

CGGAGCTCCA GCCTGGAGTC CCAGGGCAAG 2520 

CCCGACTTCT ACACCCCGCG GACTCGTAGC 2580 

TCGTCGTGCA CCAGCCACTC GAGCTCGGAG 2640 

TACTCCACGC TGGCCGAGGA CTCGCCGTCC 2700 

CGGGCGGCGG GCGCACTGGG CTCAGCCAGC 2760 

GGGGGTGCGG GGGGCGCGGG GGGCGCGGGG 2820 

CCCAGCTCGC AGTACCGCAT CAAGGAGTAC 2880 

GTGGTGGTGC GCAGCCTGGA GAGCGACCAG 2940 

AAGACGTCCA ACTCCTACAC GGCGGGCGGC 3000 

GGCGACGAGG GCGACACGGG CCGCCTGACG 3060 

TCGCTGGGCC GCGAGGGCGC CCACGACAAG 3120 

CTGCGCCAGT GGTACCAGCG TTCCACCGCC 3180 

ACCAGCTCCA CCTCCTCGGA CAGCGGCTCG 3240 

GTGGCGCACA GCAGGGTCAC CAGGATGCCC 3300 

CCTCAAAGCC AGAGAAGCTC GACACCGTCA 3360 

CCCCACCACA TCCTAACCTG GCAGACTGGA 3420 

GGGTCTGAGT CTCCACCTCA CCAAAGTACT 3480 

TTCCTGGATT CCTCCCTCTA TCCAGAACTA 3540 

GCCAAGCCCG GGACCCTCGT GTGAGCCAGC 3600 

CTGAGATCAC CTCACTGCCT CTCATTTGCC 3660 

CTTTGGCCCT CAGCACTTTT TTTCTCCTGT 3720 

CTGAGGAGAC ATTCTGGAAG GTTCCGGTCC 3780 

AGAGAGCCAG ACACCAATCC TCAATGGCAC 3840 

CTAGGCCAGG AACCATCAGG GGGGCCAGCC 3900 

GGGAGAGAAC GGGAAAGGGG ACTTGGGTTA 3960 

GTCCAGCCTG CCACCCATAC GTAGGCCAAC 4020 

CATATTCAGT TTACACCTGA AATATTCCTT 4080 

GGAAGGGGAG GTTGAGAAAG GAAGTTCTCG 4140 

AGTTCTATGC TGCCAAAGAT TAAAAATAAA 4200 

AGGAAGACAT TCTTTCTGCA AGGAAATTTC 4260 

GTGAAAGTCA ACCCTATGTA AACTGGTGTC 4320 

CTTCTCTCTC CGTAGAGAGC CTGAAAAACT 4380 

CTTGGCTGGC GTTGCTGACT CACAGTCGCC 4440 

GAGTCATAGA GGGTACTGTT AGCCCOGGTC 4500 

CAAAGCTGCT TTCCTTTCAG GATTTGTAGT 4560 

CTCTCTCATT AAACCAACCA GTAAAAGCGT 4620 

TTCGTTTTTA TGGGAAACCA AGGGAAAAGC 4680 

GTGGCTCATT TTCTGTTCGT TAGCACTTGT 4740 

TATGTGTCAC TTATTCCAAG AACCCAACTA 4800 

GTCTACTAGC CAACAGGCAG AGCAGGGTTG 4860 
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AAAAAAATAT CAGCTCCCAA AGGGCCCATG 
ACATTTGTGT GCAGATACCA AAAGAGGAGG 
GCACGTTTAC ATGTTTTGAG CTATGCTTCA 
GGCCTCAAAA ATACTTTTAT AGTAACAAGT 
CACAAAAAGG TTTCCGCAGA GGTGGTATGC 
GGGGTGGGGG TGGAATTTTT TTCTCACTCT 
CCAGGGACAG GAGCCAGGGT GGGGGTAGTT 
AGCATAAAAA CAAAGAAAAA TCTTCGCTTT 
GGCTCTACCA GACCAGGAGG GTAAGGATGG 
CCTTCTGCTG CAGCCTGGAG ACCACCGAGA 
GGACCCGGCA GGGACAAGGC GGGCCGTGGC 
GGGCCTGCTT TCCCCAGCTC CATGCATGGC 
TTCATATTCC CAGAACGCTT TAAGTGTACA 
ATTAAATGAT TCTAGGGATT CACTGGGGGA 
AGCTACAAAG AACAGTGATT TTTTTTTTTT 
TGGGCCATTT TTCTTTCTCC CAAAGAAGAT 
CAGGAAAAGT CAAAAGGGAA AAGGCAGCTG 
ATGCAGAGTA GCTTGAAATC TAGTCTGGAG 
GTTGCAAGGA ATGAGAGGCA AAAATTCTAA 
CAGAGAGAAA TGGAGAGCAG GAATTACAGT 
AAGACAGAGA TTAAGTAAAA CAGGTTTTAC 
TACATAAAAC GTTAGTCCTT TGAGACTGAC 
TGTAGTTATT GTACACAAGC ACTTGCAAAC 
AGGTGAAATA CGAAGTCCTT GGTCTGATAT 
AAAGAAATTG CCTGTTTCAG CCAGAAGACT 
GAGCCAGGTT GATTTTTTAT TTTATTATAT 
AATTTGTTTT CATTCAGTAT TAGTTTAGTT 
TCAGATGACC AGTTACTGCT TAGTTAACTA 
AATAGTTTAT TACAAGTTGT GTAAAATGGA 
AGGTTGCTCC TGAAACTGAC TGTAGAGCAT 
TGTAATCAAT GAAAAAGATG TACGTTGTAG 
GCTTTTAATT TATTCTTTTT GTATTAAGAA 
AGTGTTGTCA ACACTTATTA AAGCATTTTC 



TGTCTACATC ATCAGTTACT GTCATGCACC 4920 

AAAGAAGAAA AAAATTAATG TGTGGGAGCT 4980 

AACACAACTG GAAAGCCATC AATCTTCAAA 5040 

GCACGACTTT AGTTGGGTTA TTCAAGATGG 5100 

TGTGCTTTTG GCGCAAGTGG TGGGGGGATG 5160 

AATGACTTCC TATTGGAAAG GCATTGACAG 5220 

TTGTGGGAAA GCAGAACTGA AGTTAGCTTA 5280 

TCATGTATGT GGAATCCAAG AATAACCATA 5340 

ACACTAAAAT GAAACAAATA CCAAGGTATT 5400 

GTCGAGCTGG GGCACACACA CACCTGGCCG 5460 

CTCCTCCACC AAGTCTCTCT AGACAATTCA 5520 

TGGACTGGTG ATTCCAGGGT GCAGAAGGGA 5580 

CCTGCAGGAT AAAGAGATAC CGGTTACATT 5640 

TATTTTTGTT GCTTTTACTT TCATGGTTAG 5700 

CTCCCTTCCC CATTCAGAAA CATTATACAT 5760 

TCATGGATAG TCAGACTGAA CTGTGTGCAA 5820 

ATGAGGTTAC ATGGTTACAT GTTCTACATC 5880 

AAAACTGGAT CAAGATTCTA GCCCACTGGA 5940 

AGATTTGGGT TATATTTTCA ACTTGGGGGA 6000 

TCCAACAAAC ATCATGATAG TCTGGTAGTC 6060 

TGTTTAGCTG AGTTCAGTTA ATACAAAATG 6120 

ATGATTAATG ATCAGTGTGG TGGGAAATGA 6180 

TCTTTATCCC TATTTCTTTA AAACAAAATA 6240 

AAAGCCCCTA TTGGATTCTT CGGATGCGTA 6300 

GGTGAAAACA CATACATCAG ACTATGTTGT 6360 

GCAGGTGAGT GTTGAAACTG TTAAAATTCC 6420 

CTAAATATAG CAAACCCCAT CCAGGTGCTA 6480 

GGTGTAAAGT TTTACATATA CATTAATTTC 6540 

CTCTAGTTTA ATAATGGGGG AAAAAAGATT 6600 

GTAAAATGAT TTTACTGGAT TCTGTTCAAC 6660 

ACAAAGTTGC AGAATTAAAA AAAGAAATCT 6720 

TTTGTATAGT ATCTTTACAT TTTGCAAAAC 6780 
AAAATG 



Seq ID NO: 32 Protein sequence! 
Protein Accession #: BAA92532 



1 11 21 31 41 51 

I I I I I I 

MAVQLVPDSA LGLLMMTEGR RCQVHLLDDR KLELLVQPKL LAKELLDLVA SHFNLKEKEY 60 

FGIAFTDETG HLNWLQLDRR VLEHDFPKKS GPWLYPCVR FYIESISYLK DNATIELFFL 120 

NAKSCIYKEL IDVDSEWFE LASYILQEAK GDFSSNEWR SDLKKLPALP TQALKEHPSL 180 

AYCEDRVIEH YKKLNGQTRG QAIVNYMSIV ESLPTYGVHY YAVKDKQGIP WWLGLSYKGI 240 

FQYDYHDKVK PRKIFQWRQL ENLYFREKKF SVEVHDPRRA SVTRRTFGHS GIAVHTWYAC 300 

PALIKSIWAM AISQHQFYLD RKQSKSKIHA ARSLSEIAID LTETGTLKTS KLANMGSKGK 360 

IISGSSGSLL SSGSQESDSS QSAKKDMLAA LKSRQEALEE TLRQRLEELK KLCLREAELT 420 

GKLPVEYPLD PGEEPPIVRR RIGTAFKLDE QKIIiPKGEEA ELERLEREFA IQSQITEAAR 480 

RLASDPNVSK KLKKQRKTSY LNALKKLQEI ENAINENRIK SGKKPTQRAS LIIDDGNIAS 540 

EDSSLSDALV LEDEDSQVTS TISPLHSPHK GLPPRPPSHN RPPPPQSLEG LRQMHYHRND 600 

YDKSPIKPKM WSESSLDEPY EKVKXRSSHS HSSSHKRFPS TGSCAEAGGG SNSLQNSPIR 660 

GLPHWNSQSS MPSTPDLRVR SPHYVHSTRS VDISPTRLHS LALHFRHRSS SLESQGKLLG 720 

SENDTGSPDF YTPRTRSSNG SDPMDDCSSC TSHSSSEHYY PAQMNANYST LAEDSPSKAR 780 

QRORQRQRAA GALGSASSGS MPNLAARGGA GGAGGAGGGV YLHSQSQPSS QYRIKEYPLY 840 

IEGGATPVW RSLESDQECH YSVKAQFKTS NSYTAGGLFK ESWRGGGGDE GDTGRLTPSR 900 

SQILRTPSLG REGAHDKGAG RAAVSDELRQ WYQRSTASHK EHSRLSHTSS TSSDSGSQYS 960 

TSSQSTFVAH SRVTRMPQMC KATSAALPQS QRSSTPSSEI GATPPSSPHH ILTWQTGEAT 1020 
ENSPILDGSE SPPHQSTDE 

Seq ID NO: 33 Nucleotide sequence : 
Nucleic Acid Accession #: NM_014331 

Coding sequence: 1. .1506 {underlined sequences correspond to start and stop codons) 



'l 11 21 31 41 51 

I I I I I I 

ATgGTCAGAA AGCCTGTTGT GTCCACCATC TCCAAAGGAG GTTACCTGCA GGGAAATGTT 60 

AACGGGAGGC TGCCTTCCCT GGGCAACAAG GAGCCACCTG GGCAGGAGAA AGTGCAGCTG 120 

AAGAGGAAAG TCACTTTACT GAGGGGAGTC TCCATTATCA TTGGCACCAT CATTGGAGCA 180 

GGAATCTTCA TCTCTCCTAA GGGCGTGCTC CAGAACACGG GCAGCGTGGG CATGTCTCTG 240 

ACCATCTGGA CGGTGTGTGG GGTCCTGTCA CTATTTGGAG CTTTGTCTTA TGCTGAATTG 300 

GGAACAACTA TAAAGAAATC TGGAGGTCAT TACACATATA TTTTGGAAGT CTTTGGTCCA 360 

TTACCAGCTT TTGTACGAGT CTGGGTGGAA CTCCTCATAA TACGCCCTGC AGCTACTGCT 420 

GTGATATCCC TGGCATTTGG ACGCTACATT CTGGAACCAT TTTTTATTCA ATGTGAAATC 480 
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CCTGAACTTG CGATCAAGCT CATTACAGCT GTGGGCATAA CTGTAGTGAT GGTCCTAAAT 540 

AGCATGAGTG TCAGCTGGAG CGCCCGGATC CAGATTTTCT TAACCTTTTG CAAGCTCACA 600 

GCAATTCTGA TAATTATAGT CCCTGGAGTT ATGCAGCTAA TTAAAGGTCA AACGCAGAAC 660 

TTTAAAGACG CGTTTTCAGG AAGAGATTCA AGTATTACGC GGTTGCCACT GGCTTTTTAT 720 

TATGGAATGT ATGCATATGC TGGCTGGTTT TACCTCAACT TTGTTACTGA AGAAGTAGAA 780 

AACCCTGAAA AAACCATTCC CCTTGCAATA TGTATATCCA TGGCCATTGT CACCATTGGC 840 

TATGTGCTGA CAAATGTGGC CTACTTTACG ACCATTAATG CTGAGGAGCT GCTGCTTTCA 900 

AATGCAGTGG CAGTGACCTT TTCTGAGCGG CTACTGGGAA ATTTCTCATT AGCAGTTCCG 960 

ATCTTTGTTG CCCTCTCCTG CTTTGGCTCC ATGAACGGTG GTGTGTTTGC TGTCTCCAGG 1020 

TTATTCTATG TTGCGTCTCG AGAGGGTCAC CTTCCAGAAA TCCTCTCCAT GATTCATGTC 1080 

CGCAAGCACA CTCCTCTACC AGCTGTTATT GTTTTGCACC CTTTGACAAT GATAATGCTC 1140 

TTCTCTGGAG ACCTCGACAG TCTTTTGAAT TTCCTCAGTT TTGCCAGGTG GCTTTTTATT 1200 

GGGCTGGCAG TTGCTGGGCT GATTTATCTT CGATACAAAT GCCCAGATAT GCATCGTCCT 1260 

TTCAAGGTGC CACTGTTCAT CCCAGCTTTG TTTTCCTTCA CATGCCTCTT CATGGTTGCC 1320 

CTTTCCCTCT ATTCGGACCC ATTTAGTACA GGGATTGGCT TCGTCATCAC TCTGACTGGA 1380 

GTCCCTGCGT ATTATCTCTT TATTATATGG GACAAGAAAC CCAGGTGGTT TAGAATAATG 1440 

TCAGAGAAAA TAACCAGAAC ATTACAAATA ATACTGGAAG TTGTACCAGA AGAAGATAAG 1500 

TTATGAACTA ATGGACTTGA GATCTTGGCA ATCTGCCCAA GGGGAGACAC AAAATAGGGA 1560 

TTTTTACTTC ATTTTCTGAA AGTCTAGAGA ATTACAACTT TGGTGATAAA CAAAAGGAGT 1620 

CAGTTATTTT TATTCATATA TTTTAGCATA TTCGAACTAA TTTCTAAGAA ATTTAGTTAT 1680 

AACTCTATGT AGTTATAGAA AGTGAATATG CAGTTATTCT ATGAGTCGCA CAATTCTTGA 1740 

GTCTCTGATA CCTACCTATT GGGGTTAGGA GAAAAGACTA GACAATTACT ATGTGGTCAT 1800 

TCTCTACAAC ATATGTTAGC ACGGCAAAGA ACCTTCAAAT TGAAGACTGA GATTTTTCTG 1860 

TATATATGGG TTTTGTAAAG ATGGTTTTAC ACACTACAGA TGTCTATACT GTGAAAAGTG 1920 

TTTTCAATTC TGAAAAAAAG CATACATCAT GATTATGGCA AAGAGGAGAG AAAGAAATTT 1980 

ATTTTACATT GACATTGCAT TGCTTCCCCT TAGATACCAA TTTAGATAAC AAACACTCAT 2040 

GCTTTAATGG ATT AT AC CCA GAGCACTTTG AACAAAGGTC AGTGGGGATT GTTGAATACA 2100 

TTAAAGAAGA GTTTCTAGGG GCTACTGTTT ATGAGACACA TCCAGGAGTT ATGTTTAAGT 2160 

AAAAATCCTT GAGAATTTAT TATGTCAGAT GTTTTTTCAT TCATTATCAG GAAGTTTTAG 2220 

TTATCTGTCA TTTTTTTTTT TCACATCAGT TTGATCAGGA AAGTGTATAA CACATCTTAG 2280 

AGCAAGAGTT AGTTTGGTAT TAAATCCTCA TTAGAACAAC CACCTGTTTC ACTAATAACT 2340 

TACCCCTGAT GAGTCTATCT AAACATATGC ATTTTAAGCC TTCAAATTAC ATTATCAACA 2400 

TGAGAGAAAT AACCAACAAA GAAGATGTTC AAAATAATAG TCCCATATCT GTAATCATAT 2460 

CTACATGCAA TGTTAGTAAT TCTGAAGTTT TTTAAATTTA TGGCTATTTT TACACGATGA 2520 

TGAATTTTGA CAGTTTGTGC ATTTTCTTTA TACATTTTAT ATTCTTCTGT TAAAATATCT 2580 

CTTCAGATGA AACTGTCCAG ATTAATTAGG AAAAGGCATA TATTAACATA AAAATTGCAA 2640 

AAGAAATGTC GCTGTAAATA AGATTTACAA CTGATGTTTC TAGAAAATTT CCACTTCTAT 2700 

ATCTAGGCTT TGTCAGTAAT TTCCACACCT TAATTATCAT TCAACTTGCA AAAGAGACAA 2760 

CTGATAAGAA GAAAATTGAA ATGAGAATCT GTGGATAAGT GTTTGTGTTC AGAAGATGTT 2820 

GTTTTGCCAG TATTAGAAAA TACTGTGAGC CGGGCATGGT GGCTTACATC TGTAATCCCA 2880 

GCACTTTGGG AGGCTGAGGG GGTGGATCAC CTGAGGTCGG GAGTTCTAGA CCAGCCTGAC 2940 

CAACATGGAG AAACCCCATC TCTACTAAAA ATACAAAATT AGCTGGGCAT GGTGGCACAT 3000 

GCTGGTAATC TCAGCTATTG AGGAGGCTGA GGCAGGAGAA TTGCTTGAAC CCGGGAGGCG 3060 

GAGGTTGCAG TGAGCCAAGA TTGCACCACT GTACTCCAGC CTGGGTGACA AAGTCAGACT 3120 
CCATCTCCAA AAAAAAAAAA AAAA 



Seq ID NO: 34 Protein sequence: 
Protein Accession #; NPJ)55146 

1 11 21 31 41 51 

I I I I I I 

MVRKPWSTI SKGGYLQGNV NGRLPSLGNK EPPGQEKVQL KRKVTLLRGV SIIIGTIIGA 60 
GIFISPKGVL QNTGSVGMSL TIWTVCGVLS LFGALSYAEL GTTIKKSGGH YTYILEVFGP 120 
LPAFVRVWVE' LLIIRPAATA VISLAFGRYI LEPFFIQCEI PELAIKLITA VGITWMVLN 180 
SMSVSWSARI QIFLTFCKLT AILIIIVPGV MQLIKGQTQN FKDAFSGRDS SITRLPLAFY 240 
YGMYAYAGWF YLNFVTEEVE NPEKTIPLAI CISMAIVTIG YVLTNVAYFT TINAEELLLS 300 
NAVAVTFSER LLGNFSLAVP IFVALSCFGS MNGGVFAVSR LFYVASREGH LPEILSMIHV 360 
RKHTPLPAVI VLHPLTMIML FSGDLDSLLN FLSFARWLFI GLAVAGLIYL RYKC PDMHRP 420 
FKVPLFIPAL FSFTCLFMVA LSLYSDPFST GIGFVITLTG VPAYYLFIIW DKKPRWFRIM 480 
SEKITRTLQI ILEWPEEDK L 



Seq ID NO: 35 Nucleotide sequence: 
Nucleic Acid Accession #: NM_002422 

Coding sequence: 64.. 1497 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

ACAAGGAGGC AGGCAAGACA GCAAGGCATA GAGACAACAT AGAGCTAAGT AAAGCCAGTG 60 

GAAATGAAGA GTCTTCCAAT CCTACTGTTG CTGTGCGTGG CAGTTTGCTC AGCCTATCCA 120 

TTGGATGGAG CTGCAAGGGG TGAGGACACC AGCATGAACC TTGTTCAGAA ATATCTAGAA 180 

AACTACTACG ACCTCAAAAA AGATGTGAAA CAGTTTGTTA GGAGAAAGGA CAGTGGTCCT 240 

GTTGTTAAAA AAATCCGAGA AATGCAGAAG TTCCTTGGAT TGGAGGTGAC GGGGAAGCTG 300 
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GACTCCGACA CTCTGGAGGT GATGCGCAAG CCCAGGTGTG GAGTTCCTGA TGTTGGTCAC 360 

TTCAGAACCT TTCCTGGCAT CCCGAAGTGG AGGAAAACCC ACCTTACATA CAGGATTGTG 420 

AATTATACAC CAGATTTGCC AAAAGATGCT GTTGATTCTG CTGTTGAGAA AGCTCTGAAA 480 

GTCTGGGAAG AGGTGACTCC ACTCACATTC TCCAGGCTGT ATGAAGGAGA GGCTGATATA 540 

ATGATCTCTT TTGCAGTTAG AGAACATGGA GACTTTTACC CTTTTGATGG ACCTGGAAAT 600 

GTTTTGGCCC ATGCCTATGC CCCTGGGCCA GGGATTAATG GAGATGCCCA CTTTGATGAT 660 

GATGAACAAT GGACAAAGGA TACAACAGGG ACCAATTTAT TTCTCGTTGC TGCTCATGAA 720 

ATTGGCCACT CCCTGGGTCT CTTTCACTCA GCCAACACTG AAGCTTTGAT GTACCCACTC 780 

TATCACTCAC TCACAGACCT GACTCGGTTC CGCCTGTCTC AAGATGATAT AAATGGCATT 840 

CAGTCCCTCT ATGGACCTCC CCCTGACTCC CCTGAGACCC CCCTGGTACC CACGGAACCT 900 

GTCCCTCCAG AACCTGGGAC GCCAGCCAAC TGTGATCCTG CTTTGTCCTT TGATGCTGTC 960 

AGCACTCTGA GGGGAGAAAT CCTGATCTTT AAAGACAGGC ACTTTTGGCG CAAATCCCTC 1020 

AGGAAGCTTG AACCTGAATT GCATTTGATC TCTTCATTTT GGCCATCTCT TCCTTCAGGC 1080 

GTGGATGCCG CATATGAAGT TACTAGCAAG GACCTCGTTT TCATTTTTAA AGGAAATCAA 1140 

TTCTGGGCCA TCAGAGGAAA TGAGGTACGA GCTGGATACC CAAGAGGCAT CCACACCCTA 1200 

GGTTTCCCTC CAACCGTGAG GAAAATCGAT GCAGCCATTT CTGATAAGGA AAAGAACAAA 1260 

ACATATTTCT TTGTAGAGGA CAAATACTGG AGATTTGATG AGAAGAGAAA TTCCATGGAG 1320 

CCAGGCTTTC CCAAGCAAAT AGCTGAAGAC TTTCCAGGGA TTGACTCAAA GATTGATGCT 1380 

GTTTTTGAAG AATTTGGGTT CTTTTATTTC TTTACTGGAT CTTCACAGTT GGAGTTTGAC 144 0 

CCAAATGCAA AGAAAGTGAC ACACACTTTG AAGAGTAACA GCTGGCTTAA TTGTTGAAAG 1500 

AGATATGTAG AAGGCACAAT ATGGGCACTT TAAATGAAGC TAATAATTCT TCACCTAAGT 1560 

CTCTGTGAAT TGAAATGTTC GTTTTCTCCT GCCTGTGCTG TGACTCGAGT CACACTCAAG 1620 

GGAACTTGAG CGTGAATCTG TATCTTGCCG GTCATTTTTA TGTTATTACA GGGCATTCAA 1680 

ATGGGCTGCT GCTTAGCTTG CACCTTGTCA CATAGAGTGA TCTTTCCCAA GAGAAGGGGA 1740 

" AGCACTCGTG TGCAACAGAC AAGTGACTGT ATCTGTGTAG ACTATTTGCT TATTTAATAA 1800 
AGACGATTTG TCAGTTGTTT T 



Seq ID NO: 36 Protein sequence: 
Protein Accession #: NP_002413 



1 11 21 31 41 51 

I I I 1 I I 

MKSI1PII1LLL CVAVCSAYPL DGAARGEDTS MNLVQKYLEN YYDLKKDVKQ FVRRKDSGPV 60 

VKKIREMQKF LGLEVTGKLD SDTLEVMRKP RCGVPDVGHF RTFPGIPKWR KTHLTYRIVN 120 

YTPDLPKDAV DSAVEKALKV WEEVTPLTFS RLYEGEADIM ISFAVREHGD FYPFDGPGNV 180 

LAHAYAPGPG INGDAHFDDD EQWTKDTTGT NLFLVAAHEI GHSLGLFHSA NTEALMYPLY 240 

HSLTDLTRFR LSQDDINGIQ SLYGPPPDSP ETPLVPTEPV PPEPGTPANC DPALSFDAVS 300 

TLRGEIIjIFK DRHFWRKSLR KLEPELHLIS SFWPSLPSGV DAAYEVTSKD LVFIFKGNQF 360 

WAIRGNEVRA GYPRGIHTLG FPPTVRKIDA AISDKEKNKT YFFVEDKYWR FDEKRNSMEP 420 
GFPKQIAEDF PGIDSKIDAV FEEFGFFYFF TGSSQLEFDP NAKKVTHTLK SNSWLNC 



Seq ID NO: 37 Nucleotide sequence: 
Nucleic Acid Accession #: NM_003246 

Coding sequence: 112.. 3624 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 .51 

I I I I I I 

GGACGCACAG GCATTCCCCG CGCCCCTCCA. GCCCTCGCCG CCCTCGCCAC CGCTCCCGGC 60 

CGCCGCGCTC CGGTACACAC AGGATCCCTG CTGGGCACCA ACAGCTCCAC CATGGGGCTG 120 

•GCCTGGGGAC TAGGCGTCCT GTTCCTGATG CATGTGTGTG GCACCAACCG CATTCCAGAG 180 

TCTGGCGGAG ACAACAGCGT GTTTGACATC TTTGAACTCA CCGGGGCCGC CCGCAAGGGG 240 

TCTGGGCGCC GACTGGTGAA GGGCCCCGAC CCTTCCAGCC CAGCTTTCCG CATCGAGGAT 300 

GCCAACCTGA TCCCCCCTGT GCCTGATGAC AAGTTCCAAG ACCTGGTGGA TGCTGTGCGG 360 

GCAGAAAAGG GTTTCCTCCT TCTGGCATCC CTGAGGCAGA TGAAGAAGAC CCGGGGCACG 420 

CTGCTGGCCC TGGAGCGGAA AGACCACTCT GGCCAGGTCT TCAGCGTGGT GTCCAATGGC 480 

AAGGCGGGCA CCCTGGACCT CAGCCTGACC GTCCAAGGAA AGCAGCACGT GGTGTCTGTG 540 

GAAGAAGCTC TCCTGGCAAC CGGCCAGTGG AAGAGCATCA CCCTGTTTGT GCAGGAAGAC 600 

AGGGCCCAGC TGTACATCGA CTGTGAAAAG ATGGAGAATG CTGAGTTGGA CGTCCCCATC 660 

CAAAGCGTCT TCACCAGAGA CCTGGCCAGC ATCGCCAGAC TCCGCATCGC AAAGGGGGGC 720 

GTCAATGACA ATTTCCAGGG GGTGCTGCAG AATGTGAGGT TTGTCTTTGG AACCACACCA 780 

GAAGACATCC TCAGGAACAA AGGCTGCTCC AGCTCTACCA GTGTCCTCCT CACCCTTGAC 840 

AACAACGTGG TGAATGGTTC CAGCCCTGCC ATCCGCACTA ACTACATTGG CCACAAGACA 900 

AAGGACTTGC AAGCCATCTG CGGCATCTCC TGTGATGAGC TGTCCAGCAT GGTCCTGGAA 960 

CTCAGGGGCC TGCGCACCAT TGTGACCACG CTGCAGGACA GCATCCGCAA AGTGACTGAA 1020 

GAGAACAAAG AGTTGGCCAA TGAGCTGAGG CGGCCTCCCC TATGCTATCA CAACGGAGTT 1080 

CAGTACAGAA ATAACGAGGA ATGGACTGTT GATAGCTGCA CTGAGTGTCA CTGTCAGAAC 1140 

TCAGTTACCA TCTGCAAAAA GGTGTCCTGC CCCATCATGC CCTGCTCCAA TGCCACAGTT 1200 

CCTGATGGAG AATGCTGTCC TCGCTGTTGG CCCAGCGACT CTGCGGACGA TGGCTGGTCT 1260 

CCATGGTCCG AGTGGACCTC CTGTTCTACG AGCTGTGGCA ATGGAATTCA GCAGCGCGGC 1320 

CGCTCCTGCG ATAGCCTCAA CAACCGATGT GAGGGCTCCT CGGTCCAGAC ACGGACCTGC 1380 

CACATTCAGG AGTGTGACAA AAGATTTAAA CAGGATGGTG GCTGGAGCCA CTGGTCCCCG 1440 

TGGTCATCTT GTTCTGTGAC ATGTGGTGAT GGTGTGATCA CAAGGATCCG GCTCTGCAAC 1500 
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TCTCCCAGCC CCCAGATGAA TGGGAAACCC TGTGAAGGCG AAGCGCGGGA GACCAAAGCC 1560 

TGCAAGAAAG ACGCCTGCCC CATCAATGGA GGCTGGGGTC CTTGGTCACC ATGGGACATC 1620 

TGTTCTGTCA CCTGTGGAGG AGGGGTACAG AAACGTAGTC GTCTCTGCAA CAACCCCGCA 1680 

CCCCAGTTTG GAGGCAAGGA CTGCGTTGGT GATGTAACAG AAAACCAGAT CTGCAACAAG 1740 

5 CAGGACTGTC CAATTGATGG ATGCCTGTCC AATCCCTGCT TTGCCGGCGT GAAGTGTACT 1800 

AGCTACCCTG ATGGCAGCTG GAAATGTGGT GCTTGTCCCC CTGGTTACAG TGGAAATGGC 1860 

ATCCAGTGCA CAGATGTTGA TGAGTGCAAA GAAGTGCCTG ATGCCTGCTT CAACCACAAT 1920 

GGAGAGCACC GGTGTGAGAA CACGGACCCC GGCTACAACT GCCTGCCCTG CCCCCCACGC 1980 

TTCACCGGCT CACAGCCCTT CGGCCAGGGT GTCGAACATG CCACGGCCAA CAAACAGGTG 2040 

10 TGCAAGCCCC GTAACCCCTG CACGGATGGG ACCCACGACT GCAACAAGAA OGCCAAGTGC 2100 

AACTACCTGG GCCACTATAG CGACCCCATG TACCGCTGCG AGTGCAAGCC TGGCTACGCT 2160 

GGCAATGGCA TCATCTGCGG GGAGGACACA GACCTGGATG GCTGGCCCAA TGAGAACCTG 2220 

GTGTGCGTGG CCAATGCGAC TTACCACTGC AAAAAGGATA ATTGCCCCAA CCTTCCCAAC 2280 

TCAGGGCAGG AAGACTATGA CAAGGATGGA ATTGGTGATG CCTGTGATGA TGACGATGAC 2340 

15 AATGATAAAA TTCCAGATGA CAGGGACAAC TGTCCATTCC ATTACAACCC AGCTCAGTAT 2400 

GACTATGACA GAGATGATGT GGGAGACCGC TGTGACAACT GTCCCTACAA CCACAACCCA 2460 

GATCAGGCAG ACACAGACAA CAATGGGGAA GGAGACGCCT GTGCTGCAGA CATTGATGGA 2520 

GACGGTATCC TCAATGAACG GGACAACTGC CAGTACGTCT ACAATGTGGA CCAGAGAGAC 2580 

ACTGATATGG ATGGGGTTGG AGATCAGTGT GACAATTGCC CCTTGGAACA CAATCCGGAT 2640 

20 CAGCTGGACT CTGACTCAGA CCGCATTGGA GATACCTGTG ACAACAATCA GGATATTGAT 2700 

GAAGATGGCC ACCAGAACAA TCTGGACAAC TGTCCCTATG TGCCCAATGC CAACCAGGCT 2760 

GACCATGACA AAGATGGCAA ' GGGAGATGCC TGTGACCACG ATGATGACAA CGATGGCATT 2820 

CCTGATGACA AGGACAACTG CAGACTCGTG CCCAATCCCG ACCAGAAGGA CTCTGACGGC 2880 

GATGGTCGAG GTGATGCCTG CAAAGATGAT TTTGACCATG ACAGTGTGCC AGACATCGAT 2940 

25 GACATCTGTC CTGAGAATGT TGACATCAGT GAGACCGATT TCCGCCGATT CCAGATGATT 3000 

CCTCTGGACC CCAAAGGGAC ATCCCAAAAT GACCCTAACT GGGTTGTACG CCATCAGGGT 3060 

AAAGAACTCG TCCAGACTGT CAACTGTGAT CCTGGACTCG CTGTAGGTTA TGATGAGTTT 3120 

AATGCTGTGG ACTTCAGTGG CACCTTCTTC ATCAACACCG AAAGGGACGA TGACTATGCT 3180 

GGATTTGTCT TTGGCTACCA GTCCAGCAGC CGCTTTTATG TTGTGATGTG GAAGCAAGTC 3240 

30 ACCCAGTCCT ACTGGGACAC CAACCCCACG AGGGCTCAGG GATACTCGGG CCTTTCTGTG 3300 

AAAGTTGTAA ACTCCACCAC AGGGCCTGGC GAGCACCTGC GGAACGCCCT GTGGCACACA 3360 

GGAAACACCC CTGGCCAGGT GCGCACCCTG TGGCATGACC CTCGTCACAT AGGCTGGAAA 3420 

GATTTCACCG CCTACAGATG GCGTCTCAGC CACAGGCCAA AGACGGGTTT CATTAGAGTG 3480 

GTGATGTATG AAGGGAAGAA AATCATGGCT GACTCAGGAC CCATCTATGA TAAAACCTAT 3540 

35 GCTGGTGGTA GACTAGGGTT GTTTGTCTTC TCTCAAGAAA TGGTGTTCTT CTCTGACCTG 3600 

AAATACGAAT GTAGAGATCC CTAAT CATCA AATTGTTGAT TGAAAGACTG ATCATAAACC 3660 

AATGCTGGTA TTGCACCTTC TGGAACTATG GGCTTGAGAA AACCCCCAGG ATCACTTCTC 3720 

CTTGGCTTCC TTCTTTTCTG TGCTTGCATC AGTGTGGACT CCTAGAACGT GCGACCTGCC 3780 

TCAAGAAAAT GCAGTTTTCA AAAACAGACT CATCAGCATT CAGCCTCCAA TGAATAAGAC 3840 

40 ATCTTCCAAG CATATAAACA ATTGCTTTGG TTTCCTTTTG AAAAAGCATC TACTTGCTTC 3900 

AGTTGGGAAG GTGCCCATTC CACTCTGCCT TTGTCACAGA GCAGGGTGCT ATTGTGAGGC 3960 

CATCTCTGAG CAGTGGACTC AAAAGCATTT TCAGGCATGT CAGAGAAGGG AGGACTCACT 4020 

AGAATTAGCA AACAAAACCA CCCTGACATC CTCCTTCAGG AACACGGGGA GCAGAGGCCA 4080 

AAGCACTAAG GGGAGGGCGC ATACCCGAGA CGATTGTATG AAGAAAATAT GGAGGAACTG 4140 

45 TTACATGTTC GGTACTAAGT CATTTTCAGG GGATTGAAAG ACTATTGCTG GATTTCATGA 4200 

TGCTGACTGG CGTTAGCTGA TTAACCCATG TAAATAGGCA CTTAAATAGA AGCAGGAAAG 4260 

GGAGACAAAG ACTGGCTTCT GGACTTCCTC CCTGATCCCC ACCCTTACTC ATCACCTTGC 4320 

AGTGGCCAGA ATTAGGGAAT CAGAATCAAA CCAGTGTAAG GCAGTGCTGG CTGCCATTGC 4380 

CTGGTCACAT TGAAATTGGT GGCTTCATTC TAGATGTAGC TTGTGCAGAT GTAGCAGGAA 4440 

50 AATAGGAAAA CCTACCATCT CAGTGAGCAC CAGCTGCCTC CCAAAGGAGG GGCAGCCGTG 4500 

CTTATATTTT TATGGTTACA ATGGCACAAA ATTATTATCA ACCTAACTAA AACATTCCTT 4560 

TTCTCTTTTT TCCGTAATTA CTAGGTAGTT TTCTAATTCT CTCTTTTGGA AGTATGATTT 4620 

TTTTAAAGTC TTTACGATGT AAAATATTTA TTTTTTACTT ATTCTGGAAG ATCTGGCTGA 4680 

AGGATTATTC ATGGAACAGG AAGAAGCGTA AAGACTATCC ATGTCATCTT TGTTGAGAGT 4740 

55 CTTCGTGACT GTAAGATTGT AAATACAGAT TATTTATTAA CTCTGTTCTG CCTGGAAATT 4800 

TAGGCTTCAT ACGGAAAGTG TTTGAGAGCA AGTAGTTGAC ATTTATCAGC AAATCTCTTG 4860 

CAAGAACAGC ACAAGGAAAA TCAGTCTAAT AAGCTGCTCT GCCCCTTGTG CTCAGAGTGG 4920 

ATGTTATGGG ATTCCTTTTT TCTCTGTTTT ATCTTTTCAA GTGGAATTAG TTGGTTATCC 4980 

ATTTGCAAAT GTTTTAAATT GCAAAGAAAG CCATGAGGTC TTCAATACTG TTTTACCCCA 5040 

60 TCCCTTGTGC ATATTTCCAG GGAGAAGGAA AGCATATACA CTTTTTTCTT TCATTTTTCC 5100 

AAAAGAGAAA AAAATGACAA AAGGTGAAAC TTACATACAA ATATTACCTC ATTTGTTGTG 5160 

TGACTGAGTA AAGAATTTTT GGATCAAGCG GAAAGAGTTT AAGTGTCTAA CAAACTTAAA 5220 

GCTACTGTAG TACCTAAAAA GTCAGTGTTG TACATAGCAT AAAAACTCTG CAGAGAAGTA 5280 

TTCCCAATAA GGAAATAGCA TTGAAATGTT AAATACAATT TCTGAAAGTT ATGTTTTTTT 5340 

65 TCTATCATCT GGTATACCAT TGCTTTATTT TTATAAATTA . TTTTCTCATT GCCATTGGAA 5400 

TAGAATATTC AGATTGTGTA GATATGCTAT TTAAATAATT TATCAGGAAA TACTGCCTGT 5460 

AGAGTTAGTA TTTCTATTTT TATATAATGT TTGCACACTG AATTGAAGAA TTGTTGGTTT 5520 

TTTCTTTTTT TTGTTTTTTT TTTTTTTTTT TTTTTTTTTG CTTTTGACCT CCCATTTTTA 5580 

CTATTTGCCA ATACCTTTTT CTAGGAATGT GCTTTTTTTT GTACACATTT TTATCCATTT 5640 

70 TACATTCTAA AGCAGTGTAA GTTGTATATT ACTGTTTCTT ATGTACAAGG AACAACAATA 5700 
AATCATATGG AAATTTATAT TT 



75 



Seq ID NO: 38 Protein sequence; 
Protein Accession #: NP_003237 
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1 11 21 31 41 51 

I I I I I I 

MGLAWGLGVL FLMHVCGTNR IPESGGDNSV FDIFELTGAA RKGSGRRLVK GPDPSSPAFR 60 

IEDANLIPPV PDDKFQDLVD AVRAEKGFLL LASLRQMKKT RGTLLALERK DHSGQVPSW 120 

SNGKAGTLDL SLTVQGKQHV VSVEEALLAT GQWKSITLFV QEDRAQLYID CEKMENAELD 180 

VPIQSVFTRD LASIARLRIA KGGVNDNFQG VLQNVRFVPG TTPEDILRNK GCSSSTSVLL 240 

TLDNNWNGS SPAIRTNYIG HKTKDLQAIC GISCDEIiSSM VLEIiRGLRTI VTTLQDSIRK 300 

VTEENKELAN ELRRPPLCYH NGVQYRNNEE WTVDSCTECH CONSVTICKK VSCPIMPCSN 360 

ATVPDGECCP RCWPSDSADD GWSPWSEWTS CSTSCGNGIQ QRGRSCDSLN NRCEGSSVQT 420 

RTCHIQECDK RFKQDGGWSH WSPWSSCSVT CGDGVITRIR LCNSPSPQMN GKPCEGEARE 4 80 

TKACKKDACP 1NGGWGPWSP WDICSVTCGG GVQKRSRLCN NPAPQFGGKD CVGDVTENQI 540 

CNKQDCPIDG CLSNPCFAGV KCTSYPDGSW KCGACPPGYS GNGIQCTDVD ECKEVPDACF 600 

NHNGEHRCEN TDPGYNCLPC PPRFTGSQPF GQGVEHATAN KQVCKPRNPC TDGTHDCNKN 660 

AKCNYLGHYS DPMYRCECKP GYAGNGIICG EDTDLDGWPN ENLVCVANAT YHCKKDNCPN 720 

LPNSGQEDYD KDGIGDACDD DDDNDKIPDD RDNCPFHYNP AQYDYDRDDV GDRCDNCPYN 780 

HNPDQADTDN NGEGDACAAD IDGDGILNER DNCQYVYNVD QRDTDMDGVG DQCDNCPLEH 840 

NPDQLDSDSD RIGDTCDNNQ DIDEDGHQNN LDNCPYVPNA NQADHDKDGK GDACDHDDDN 900 

DGIPDDKDNC RLVPNPDQKD SDGDGRGDAC KDDFDHDSVP DIDDICPENV DISBTDFRRF 960 

QMIPLDPKGT SQNDPNWWR HQGKELVQTV NCDPGIAVGY DEFNAVDFSG TFFINTERDD 1020 

DYAGFVFGYQ SSSRFYWMW KQVTQSYWDT NPTRAQGYSG LSVKWNSTT GPGEHLRNAL 1080 

WHTGNTPGQV RTLWHDPRHI GWKDFTAYRW RLSHRPKTGF IRWMYEGKK IMADSGPIYD 1140 
KTYAGGRLGL FVFSQEMVFF SDLKYECRDP 



Seq ID NO: 39 Nucleotide sequence; 
Nucleic Acid Accession #: BC004299 

Coding sequence: 69.. 1235 (underlined sequences correspond to start and stop codonsj 

1 11 21 31 41 51 

I I I I I I 

CCCGACCCGT GCGAGGGCCA GGTCCGCGCC TGCCCCGCCA GGCGAAGCGA GGCGACCCGC 60 

GTGCGGCCAT GGCTTCGCTG CTGGGAGCCT ACCCTTGGCC CGAGGGTCTC GAGTGCCCGG 120 

CCCTGGACGC CGAGCTGTCG GATGGACAAT CGCCGCCGGC CGTCCCCCGG CCCCCGGGGG 180 

ACAAGGGCTC CGAGAGCCGT ATCCGGCGGC CCATGAACGC CTTCATGGTT TGGGCCAAGG 240 

ACGAGAGGAA ACGGCTGGCA GTGCAGAACC CGGACCTGCA CAACGCCGAG CTCAGCAAGA 300 

TGCTGGGAAA GTCGTGGAAG GCGCTGACGC TGTCCCAGAA GAGGCCGTAC GTGGACGAGG 360 

CGGAGCGGCT GCGCCTGCAG CACATGCAGG ACTACCCCAA CTACAAGTAC CGGCCGCGCA 420 

GGAAGAAGCA GGCCAAGCGG CTGTGCAAGC GCGTGGACCC GGGCTTCCTT CTGAGCTCCC 480 

TCTCCCGGGA CCAGAACGCC CTGCCGGAGA AGAGAAGCGG CAGCCGGGGG GCGCTGGGGG 540 

AGAAGGAGGA CAGGGGTGAG TACTCCCCCG GCACTGCCCT GCCCAGCCTC CGGGGCTGCT 600 

ACCACGAGGG GCCGGCTGGT GGTGGCGGCG GCGGCACCCC GAGCAGTGTG GACACGTACC 660 

CGTACGGGCT GCCCACACCT CCTGAAATGT CTCCCCTGGA CGTGCTGGAG CCGGAGCAGA 720 

CCTTCTTCTC CTCCCCCTGC CAGGAGGAGC ATGGCCATCC CCGCCGCATC CCCCACCTGC 780 

CAGGGCACCC GTACTCACCG GAGTACGCCC CAAGCCCTCT CCACTGTAGC CACCCCCTGG 840 

GCTCCCTGGC CCTTGGCCAG TCCCCCGGCG TCTCCATGAT GTCCCCTGTA CCCGGCTGTC . 900 

CCCCATCTCC TGCCTATTAC TCCCCGGCCA CCTACCACCC ACTCCACTCC AACCTCCAAG 960 

CCCACCTGGG CCAGCTTTCC CCGCCTCCTG AGCACCCTGG CTTCGACGCC CTGGATCAAC 1020 

TGAGCCAGGT GGAACTCCTG GGGGACATGG ATCGCAATGA ATTCGACCAG TATTTGAACA 1080 

CTCCTGGCCA CCCAGACTCC GCCACAGGGG CCATGGCCCT CAGTGGGCAT GTTCCGGTCT 1140 

CCCAGGTGAC ACCAACGGGT CCCACAGAGA CCAGCCTCAT CTCCGTCCTG GCTGATGCCA 1200 

CGGCCACGTA CTACAACAGC TACAGTGTGT CATAGAGCTG GAGGCGCCCC GTCCGGTCAG 1260 

CCCTCGCGCC CTCTCCTTCT TGTGCCTTGA GTGGCAGAGG AGCCGTCCAG CCACACCAGC 1320 

TTTCCTCCCA CCGCTCAGGG CAGGGAGGTC TGAACTGCGG CCCCAGAGCC TTTGGCCTAA 1380 

GCTGGACTCT CCTTATCCGA GTGCCGCCTC TATCCCCTTC CCCACGTTCC AGCCCCTGCA 1440 

GCCCACATTT TAAGTATATT CCTTCAAGTG AGTTTTCCTC CAGCCCCTGA GAGTTGCTGT 1500 

CTCCCAGTGG AATGTTCACT GACGTCTTTT CTTGGTAGCC ATCATCGAAA CTAATGGGGG 1560 

GACAGACTTG ATAGCCAAGG TCCCTTCTGG TCCAGTTTTC TGATTTAGGG TTCTCTCAAG 1620 

ATTAATAAAG GAAGATGGGG AAATTTGACT CATTAATGAG CTCGCTAACC TACGATCTGG 1680 

TGATAATTTT GTGTGCACAG CCCAAGGACC ACGAGGCTTT CTGCACTTTC TGCACCCCCT 1740 

TCCAAAGTGA CCACAAAATT TCAAAGGGAC TCATACAATT TGAGAAAAAA CAGTCAACCT 1800 

GATTTGAGAA ATTAACCAGT ATGGCTAACT ATATCACAGA AAATGGGATT GAGTTAAAAC 1860 

TATTTTATTT TAAATATACA TTTTAAAGCA GTTCTTTTTT TTTGTTAATT TGTTTATTAT 1920 

ACACACACTT CAAGAGCCAC CGCGCCCAGC CTACATTTAT AATTTTCATT CTCTTTTACC 1980 

TATAAAATTC AGTGTATTAG TTTCATTACA TAGGAGAAAT TATATTTCTA AACATTTTAT 2040 
GATGTTTAAA AACAAAACAG GCTGTTGTAA AAAAAAAAAA AAAAAAAAA 

Seq ID NO: 40 Protein sequence^ 
Protein Accession #: AAH04299 

1 11 21 31 41 51 

I I I I I I 

MASIiLGAYPW PEGLECPALD AELSDGQSPP AVPRPPGDKG SESRIRRPMN AFMVWAKDER 60 

KRIaAVQNPDL HNAELSKMLG KSWKALTLSQ KRPYVDEAER LRLQHMQDYP NYKYRPRRKK 120 

QAKRLCKRVD PGFLLSSLSR DQNALPEKRS GSRGALGEKE DRGEYSPGTA LPSLRGCYHE 180 
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GPAGGGGGGT PSSVDTYPYG LPTPPEMSPI* DVLEPEQTFF SSPCQEEHGH PRRIPHLPGH 240 

PYSPEYAPSP LHCSHPLGSL ALGQSPGVSM MSPVPGCPPS PAYYSPATYH PLHSNLQAHL 300 

GQLSPPPEHP GFDALDQLSQ VELLGDMDRN EFDQYLNTPG HPDSATGAMA LSGHVPVSQV 360 
TPTGPTETSL ISVLADATAT YYNSYSVS 



Seq ID NO: 41 Nucleotide sequence: 
Nucleic Acid Accession #; NM_004449 

Coding sequence: 1..1389 (underlined sequences correspond to start and stop codons) 



11 



21 



31 



41 



51 



ATGATTCAGA 
GAGGACCAGT 
ACCGCGTCCT 
CAGGATTGGC 
CAGGTGAATG 
GTGGGCAGCC 
CCACCCCCAA 
TGGAGTACAG 
GACGTCAACA 
GACGACTTCC 
TACCTCAGAG 
AACTCTCCAC 
TCAGCCTGGA 
TCCACAGTGC 
CCAACAAGTA 
CTGGAGCTCC 
GAGTTCAAGA 
CCCAACATGA 
ATGACCAAGG 
GCCCTCCAGC 
ATGGGCTCCT 
CTCCCCGTGA 
GGGGGTATAT 
TACTACTAA 



CTGTCCCGGA 
CGTTGTTTGA 
CCTCCAGCGA 
TGTCTCAACC 
GCTCAAGGAA 
CAGACACCGT 
ACATGACCAC 
ACCATGTGCG 
TCTTGTTATT 
AGAGGCTCAC 
AGACTCCTCT 
GGTTAATGCA 
CCGGTCACGG 
CCAAAACTGA 
GCCGCCTTGC 
TGTCGGACAG 
TGACGGATCC 
ACTACGATAA 
TCCATGGGAA 
CCCACCCCCC 
ATCACGCCCA 
CATCTTCCAG 
ACCCCAACAC 



CCCAGCAGCT 
GTGTGCCTAC 
CTATGGACAG 
CCCAGCCAGG 
CTCTCCTGAT 
TGGGATGAAC 
GAACGAGCGC 
GCAGTGGCTG 
CCAGAACATC 
CCCCAGCTAC 
TCCACATTTG 
TGCTAGAAAC 
CCACCCCACG 
AGACCAGCGT 
AAATCCAGGC 
CTCCAACTCC 
CGACGAGGTG 
GCTCAGCCGC 
GCGCTACGCC 
GGAGTCATCT 
CCCACAGAAG 
TTTTTTTGCT 
TAGGCTCCCC 



CATATCAAGG 
GGAACGCCAC 
ACTTCCAAGA 
GTCACCATCA 
GAATGCAGTG 
TACGGCAGCT 
AGAGTTATCG 
GAGTGGGCGG 
GATGGGAAGG 
AACGCCGACA 
ACTTCAGATG 
ACAGATTTAC 
CCCCAGTCGA 
CCTCAGTTAG 
AGTGGCCAGA 
AGCTGCATCA 
GCCCGGCGCT 
GCCCTCCGTT 
TACAAGTTCG 
CTGTACAAGT 
ATGAACTTTG 
GCCCCAAACC 
ACCAGCCATA 



AAGCCTTATC 
ACCTGGCTAA 
TGAGCCCACG 
AAATGGAATG 
TGGCCAAAGG 
ACATGGAGGA 
TGCCAGCAGA 
TGAAAGAATA 
AACTGTGCAA 
TCCTTCTCTC 
ATGTTGATAA 
CATATGAGCC 
AAGCTGCTCA 
ATCCTTATCA 
TCCAGCTTTG 
CCTGGGAAGG 
GGGGAGAGCG 
ACTACTATGA 
ACTTCCACGG 
ACCCCTCAGA 
TGGCGCCCCA 
CATACTGGAA 
TGCCTTCTCA 



AGTTGTGAGT 
GACAGAGATG 
CGTCCCTCAG 
TAACCCTAGC 
CGGGAAGATG 
GAAGCACATG 
TCCTACGCTA 
TGGCCTTCCA 
GATGACCAAG 
ACATCTCCAC 
AGCCTTACAA 
CCCCAGGAGA 
ACCATCTCCT 
GATTCTTGGA 
GCAGTTCCTC 
CACCAACGGG 
GAAGAGCAAA 
CAAGAACATC 
GATCGCCCAG 
CCTCCCGTAC 
CCCTCCAGCC 
TTCACCAACT 
TCTGGGCACT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
B40 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



Seq ID NO: 42 Protein sequence: 
Protein Accession #: NP 004440 



1 
I 

MIQTVPDPAA 
QDWLSQPPAR 
PPPNMTTNER 
DDFQRLTPSY 
SAWTGHGHPT 
LELLSDSSNS 
MTKVHGKRYA 
LPVTSSSFFA 



11 
I 

HIKEALSWS 
VTIKMECNPS 
RVIVPADPTL 
NADILLSHLH 
PQSKAAQPSP 
SCITWEGTNG 
YKFDFHGIAQ 
APNPYWNSPT 



21 
I 

EDQSLFECAY 
QVNGSRNSPD 
WSTDHVRQWL 
YLRETPLPHL 
STVPKTEDQR 
EFKMTDPDEV 
ALQPHPPESS 
GGIYPNTRLP 



31 
I 

GTPHIAKTEM 
ECSVAKGGKM 
EWAVKEYGLP 
TSDDVDKALQ 
PQLDPYQILG 
ARRWGERKSK 
LYKYPSDIjPY 
TSHMPSHLGT 



41 
I 

TASSSSDYGQ 
VGSPDTVGMN 
DVNILLFQNI 
NSPRLMHARN 
PTSSRLANPG 
PNMNYDKLSR 
MGSYHAHPQK 
YY 



51 
I 

TSKMSPRVPQ 
YGSYMEEKHM 
DGKEIjCKMTK 
TDLPYEPPRR 
SGQIQLWQFL 
ALRYYYDKNI 
MNFVAPHPPA 



60 
120 
180 
240 
300 
360 
420 



Seq ID NO: 43 Nucleotide sequence: 
Nucleic Acid Accession #: NM_005100 

Coding sequence: 192.. 5537 (underlined sequences correspond to start and stop codons) 



CCTTCTTTTA 
CAGTCGGCTG 
CTTGCCCCTG 
GCGGAGGAGC 
AGGGGAGCTC 
CGCCAGACAC 
AGAAGAATGG 
AGGAGGGTGA 
AGGAGGAAGA 
AAAGAGACTC 
ATGGGCAGGA 
AGCTAACACA 
TTGTTGGCTT 



11 
I 

AGGAGTTTGC 
GCGGCGAAGG 
TCCCTGCGGC 
CA^GGGCGCC 
CACGCCGGCT 
CACCGCGGAC 
TCAGCTGTCC 
CCTAAATGGC 
AGAAGTCATT 
CGATAAAGAG 
GGAGAACCGA 
ACCCACTGAG 
TAAATTCACT 



21 
I 

CGCGAGCGCG 
AAGGCGCTCT 
TTGGGGAAAG 
GGGAGCTCCA 
GAGCCCGAGC 
CCCGCCATCG 
ACCATCAATG 
CAGAAAGGAG 
GTCACGGAGG 
ATGGCTACTA 
AATAT CGAAC 
TCCCAGGCTA 
GTGAAAAAGG 



31 
I 

TCTCCTTCAT 
CGGGACCTCA 
CGTAACCCGG 
CCGAGCAGCG 
CCAGCGGCGG 
CTGCCTCGGA 
GCGTAGCTGA 
CCCTGAACGG 
TTGGACAGAG 
AGTCAGCGGT 
AGATTCCTTC 
ATGATATTGG 
ATAAGACAGA 



41 
I 

TCGCAGGCTG 
CGGGCGCGCG 
CGGCTAGGCG 
CAGCCCGGAG 
CGGCCCCTCG 
CCCCGCCACC 
GCAAGATGAG 
TCAAGGAGCC 
AGACTCTGAA 
TGTTCACGAC 
TTCAGAAAGC 
ATTTAAGAAG 
GAAGCCTGAC 



51 
I 

GGCGCGTTOG 
TCTTTTGGCT 
CGGGAGAAGT 
CAGCCGCCCG 
GCCGAGGCGG 
AAGCTCCTAC 
CTCAGCCTCC 
CTAAACAGCC 
GATGTGAGCG 
ATCACAGATG 
AATTTAGAAG 
GTGTTTAAGT 
ACTGTCCAGC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
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TACTCACTGT GAAGAAAGAT GAAGGGGAGG 
CCAGCCTTGG GGCTGGAGAA GCAGCATCCA 
AACCCGAAGA GACCCTGAAG CGTGAGCAAA 
CTGGCCAAGC AGTGGAGGAA TGCAAAGAGG 
5 GCAAGTCTGC AGAATCTCCG ACTAGTCCCG 
AATTCTTCAC TCAAGGTTGG GCCGGCTGGC 
AGGATGAAGT GGAAGCTTCA GAGAAGAAAA 
AAGAAGACGG AAAGGCAGAG GTTGCCTCCG 
CACAGGAGCC GGCAGAAAGT GCCCACGAGC 

10 AGCTGCCCTC AGAGGAGCAA GTCAGTGGCT 
CGTTGGCGAC AGAAGTGTTT GATGAGAAAA 
AAGTCCACGT CAGCACCGTG GAGGAGAGAA 
CAGCAGGGTC TGTGCCAGCT GAAGAATTGG 
AACCTGCCAA GGAGCTGGTG AAGCTCAAAG 

15 AGGGAGCTGA CCTCAGTCCT GATGAGAAGG 
GTGAGGTGGA AATGCTGTCA TCACAGGAGA 
AGCTTTTTAC CAGCACTGGC TTAAAAAAGC 
GAGGAGGAGA CGAGGAATCA GGGGAGCACA 
AGGAGGAGCA AAAGGGCGAG AGCTCTGCCT 

20 GTCTGGAAAA GGGCTTAGCC GAGGTGCAGC 
CCGATGGAGA GAAAAAAAGA GAAGGTGTCA 
CGCCCAAGAA GCGTGTTAGA CGGCCTTCGG 
TCAAGAGCGC TACCTTGTCT TCCACCGAGA 
AAGGGAGCGT GGAAGAGCCA AAGCCGGAAG 

25 . CTTGGGAAGC TTTAATTTGT GTGGGATCAT 
CTGATGAGGA AGGGGGACCA AAAGCAATGG 
GAAAAGACAA AGAGACGGGG ACAGACGGGA 
GGCAGGGAAG TTCCTCCCCG GAGCAAGCTG 
CCTGGGAGTC ATTTAAAAGG TTAGTCACGC 

30 AGAAAAGCGA AGACTCCATA GCTGGGTCTG 
' CCGGTAAAGA AGAATCCTGG GTCTCAATCA 
GGCCAGATGG GAAACAAGAA CAAGCCCCTG 
AAGATGACTC TGATGTCCCG GCCGTGGTCC 
AGAAAATGGA GGCACAGCAA GCCCAAAAAG 

35 CTGAGGTGTC CAAGGAGCTC AGCGAGAGTC 
ACGGGACGAG GGCAGCTACC ATTATTGAAG 
TGACAGAACC TCTTGAACAA GTAGAAGCTG 
AAAGAGAAGT AATTGCAGAA GAAGAACCCC 
GAGAGGCCCG GGGCGACACG GTCGTTAGTG 

40 CTGCAGAAAC TGCAGGGCCA TTGGGTTCCG 
AGACCACAGA AATGGTGTCA GCAGTCTCCC 
AGGCCACTCC GGTGCAGGAG GTGGAAGGTG 
GGACTCAAGA GGTCCTCCAG GCAGTGGCAG 
GCACCGGTGG GCCAGAAGAT GTGCTTCAGC 

45 AAGAGCAGGC TGAAGCGTCG GGTCTGAAGA 
CTCAGGAGGC AAAAACTGAG CCTTTTACAC 
AAAGCTTTGA AAAAGCTCCT CAAGTCACAG 
CTTGTCAAGC CGAAACCTTA GCTGGGGTAA 
TCCCCCCTGA CTCGGTGGAA ACCCCTACAG 

50 CCGACTTTGA CGCACCAGGC ACAACCCAGA 
ATGAGGTCGC ATCTGGTACC CAGTCAGGGG 
AAGAGAGGCC TCCAGCACCT TCCAGTTTTG 
AGATGGAAGA CACTCTAGAG CATACAGATA 
TGTCAAAGAC TGAGGGGACT CAAGAGGCTG 

55 TACCATTTTT CGAAGGACTT GAGGGGTCTA 
AGGTCACTGA AGTTGCCCTT AAAGGTGAAG 
ATGCTCTTGA ACTGCAGAGT CACGCTAAGT 
TAGTTCAAGT CGAAAGGGAG AAAACAGAAG 
TTGAGCACGA AACAGCTGTT ACCGTATCTG 

60 TGAATGTGCC CATCATAGAT GGGGCAAAGG 
CCTGCCTAGG TCAAGAGGAG GCAGTATGCA 
CATTCACTCT AACAGCGGCT GCAGAGGAGG 
TAGAAACAGG TGAAACGTTG GAGCCTGCAG 
CTGAAAAAAA TGAAGACTTT GCCGCTCATC 

65 ACTGTCAGGC AAAATCGACA CCAGTGATAG 
CCGACCTGGA AGGAGAGAAA ACCACATCAC 
AGGTTGCTTG CCAGGAGGTC AAAGTGAGTG 
GGATTTTGGA ACTTGAGACC AAAAGCAGTA 
TTGACCAGTT TGTACGTACA GAAGAAACAG 

70 CACAAGCTCA CGTGATAAAA GCTGACAGCC 
GAGAGGAACC TCAGGCCTCT GCACAGGATG 
CAGAGTCAAC CGCAGTGGGA CAAGCACATT 
CAGAAAAGAC CATGACTGTT GAGGTAGAAG 
AGGTCGTCCT CCCATCTGAG GAAGAGGGAG 

75 ATGATGGTCA TGCCTTGTTA GCAGAAAGAA 
ATGAAAAAGG TGATGATGTT GATGACCCTG 



GAGCAGCAGG GGCTGGCGAC CACCAGGACC 840 

AAGAAAGCGA ACCCAAACAA TCTACAGAGA 900 

GCCACGCAGA AATTTCTCCC CCAGCCGAAT 960 

AAGGAGAAGA GAAACAAGAA AAAGAACCTA 1020 

TGACCAGTGA AACAGGATCA ACCTTCAAAA 1080 

GCAAAAAGAC CAGTTTCAGG AAGCCGAAGG 1140 

AGGAACAAGA GCCAGAAAAA GTAGACACAG 1200 

AGAAACTGAC CGCCTCCGAG CAAGCCCACC 1260 

CCCGGTTATC AGCTGAATAT GAGAAAGTTG 1320 

CGCAGGGACC TTCTGAAGAG AAACCTGCTC 1380 

TAGAAGTCCA CCAAGAAGAG GTTGTGGCCG 1440 

CCGAAGAGCA GAAAACGGAG GTGGAAGAAA 1500 

TTGGAATGGA TGCAGAACCT CAGGAAGCCG 1560 

AAACGTGTGT TTCCGGAGAG GACCCTACAC 1620 

TGCTGTCCAA ACCCCCCGAA GGCGTTGTGA 1680 

GAATGAAGGT GCAGGGAAGT CCACTAAAGA 1740 

TTTCTGGAAA GAAACAGAAA GGGAAAAGAG 1800 

CTCAGGTTCC AGCCGATTCT CCGGACAGCC 1860 

CATCCCCTGA GGAGCCCGAG GAGATCACGT 1920 

AGGATGGGGA AGCTGAAGAA GGAGCTACTT 1980 

CTCCCTGGGC ATCATTCAAA AAGATGGTGA 2040 

AAAGTGATAA AGAAGATGAG CTGGACAAGG 2100 

GCACAGCCTC TGAAATGCAA GAAGAAATGA 2160 

AACCAAAGCG CAAGGTGGAT ACCTCAGTAT 2220 

CCAAGAAAAG AGCAAGGAGA AGGTCCTCTT 2280 

GAGGAGACCA CCAGAAAGCT GATGAGGCCG 2340 

TCCTTGCTGG TTCCCAAGAA CATGATCCAG 2400 

GAAGCCCTAC CGAAGGGGAG GGCGTTTCCA 2460 

CAAGAAAAAA ATCAAAGTCC AAGCTGGAAG 2520 

GTGTAGAACA TTCCACTCCA GACACTGAAC 2580 

AGAAGTTTAT TCCTGGACGA AGGAAGAAAA 2640 

TTGAAGACGC AGGGCCAACA GGGGCCAACG 2700 

CTCTGTCTGA GTATGATGCT GTAGAAAGGG 2760 

GCGCAGAGCA GCCCGAGCAG AAGGCAGCCA 2820 

AGGTTCATAT GATGGCAGCA GCTGTCGCTG 2880 

AAAGGTCTCC TTCTTGGATA TCTGCTTCAG 2940 

AAGCCGCACT GTTAACTGAG GAGGTATTGG 3000 

CCACGGTTAC TGAACCTCTG CCAGAGAACA 3060 

AGGCGGAATT GACCCCCGAA GCTGTGACAG 3120 

AAGAAGGAAC CGAAGCATCT GCTGCTGAAG 3180 

AGTTAACCGA CTCCCCAGAC ACCACAGAGG 3240 

GCGTACCTGA CATAGAAGAG CAAGAGAGGC 3300 

AAAAAGTGAA AGAGGAATCC CAGCTGCCTG 3360 

CTGTGCAGAG AGCAGAGGCA GAAAGACCAG 3420 

AAGAGACGGA TGTAGTGTTG AAAGTAGATG 3480 

AAGGGAAGGT GGTGGGGCAG ACCACCCCAG 3540 

AGAGCATAGA GTCCAGTGAG CTTGTAACCA 3600 

AATCACAGGA GATGGTGATG GAACAGGCTA 3660 

ACAGTGAGAC TGATGGAAGC ACCCCCGTAG 3720 

AAGACGAGAT TGTGGAAATC CATGAGGAGA 3780 

GCACAGAAGC AGAGGCAGTT CCTGCACAGA 3840 

TGTTCCAGGA AGAAACTAAA GAACAATCAA 3900 

AAGAGGTGTC AGTGGAAACT GTATCCATTC 3960 

ACCAGTATGC TGATGAGAAA ACCAAAGACG 4020 

TAGACACAGG CATAACAGTC AGTCGGGAAA 4080 

GGACAGAAGA AGCTGAATGT AAAAAGGATG 4140 

CTCCTCCATC CCCCGTGGAG AGAGAGATGG 4200 

CAGAGCCAAC CCATGTGAAT GAAGAGAAGC 4260 

AAGAGGTCAG TAAGCAGCTC CTCCAGACAG 4320 

AAGTCAGCAG TTTGGAAGGA AGCCCTCCTC 4380 

CCAAAATTCA AGTTCAGAGC TCTGAGGCAT 4440 

AAAAGGTCTT AGGAGAAACT GCCAACATTT 4500 

GTGCACATTT AGTTCTGGAA GAGAAATCCT 4560 

CAGGGGAAGA TGCTGTGCCC ACAGGGCCCG 4620 

TATCTGCTAC TACCAAGAAA GGCTTAAGTT 4680 

TGAAGTGGAA GTCAGATGAA GTCGATGAGC 4740 

TAGCAATTGA GGATTTAGAG CCTGAAAATG 4800 

AACTTGTCCA AAACATCATC CAGACAGCCG 4860 

CCACCGAAAT GTTGACGTCT GAGTTACAGA 4920 

AGGACGCTGG ACAGGAAACG GAGAAAGAAG 4980 

AAACACCAAT TACTTCAGCC AAAGAGGAGT 5040 

CTGATATTTC CAAAGACATG AGTGAAGCCT 5100 

GTTCCACTGT AAATGATCAG CAGCTGGAAG 5160 

GTGGAGCTGG AACAAAGTCT GTGCCAGAAG 5220 

TAGAGAAGTC ACTAGTTGAA CCGAAAGAAG 5280 

AAAACCAGAA CTCAGCCCTG GCTGATACTG 5340 
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10 



15 



20 



25 



ATGCCTCAGG 
AGGAGGATGC 
CGATCACACC 
AACTTACAGA 
TGTGAAGACA 
AGAACTTTGA 
CAATCCTGAG 
CTACAATTTT 
GAACTGGAGT 
TTATATGTAT 
AGGAAATGTG 
CATAGTGCAG 
CCTAGATGTA 
CATATTTAGT 
AAAAGCATTT 
CAGTGGAGGT 
ACTTGTTATG 
GTTTTATA7G 
GACTGCATTC 
TACTGGATTG 
TCATTTGGAT 
AAAAAAAA 



AGGCTTAACC 
CCAGGAAGTA 
CCAAGCACAG 
ATCTTAAAAC 
AGTAGTAGAA 
GAATTGGAGA 
GCTTCATCAG 
CCCTTGATAA 
TGGCAATACC 
GTTTTAAGTA 
CAGGATACTA 
GCTTGGGGAG 
ACATTCCTGA 
GGTACTCTGA 
TGAAACATAC 
TAGAAAGAAG 
GTTGTTTGGA 
TTATGAAGAA 
ACACATGGCA 
TGTTTGTGCC 
AAGTTGTGAT 



AAAGAGTCCC 
GAATTGCAGG 
GAGGAGTTAC 
ATCATGCAGT 
GAAAATGAAT 
GCAGGCACAT 
GAGCTAGAGC 
CCATATAAAT 
TAGTTCTGCT 
GTCCTCCTGT 
CATGCTTTTT 
CTTTAAGCCT 
TCAAGGTACA 
AATTGGTCAC 
AGAATGTTCT 
TTATATTCTG 
CCGATAAGTG 
AAGAATTGTT 
TGAAATAAGT 
ATATTTGTGC 
TTGACGACTG 



CAGATACAAA 
AAGGAAAAGT 
AGAAACAAGA 
TAAACTCATT 
GCTGCTGCTG 
CAACTGATCT 
CATTTAACAT 
TCTGATTTAA 
TCTGAAACTG 
ATCTATTGTA 
GTATCACACA 
CAGTTATATA 
ATTCTTTAAA 
TTTCCTATTA 
ATTGTCATTG 
GTAGCAAATT 
TGCTTAATCC 
GTAAGTTTTT 
CAGGTTCTTT 
CATTCCTTTA 
ATTTAAATAA 



TGGACCAAAA 
GCACAGTGAA 
GAGAGAATCT 
GTCTGTTTGG 
AGACTGAAGA 
CATTTCTAGA 
TTCCTCTTTC 
GGTCCTAAAT 
GAGTATCATT 
TATTTTTTTC 
GTATATGATG 
ACCCACAAAA 
ATTCACTAAT 
CACGGAGTGT 
GGAAATTTTG 
AACTTTACAT 
TGAGGCAAAG 
GATTCTACTC 
ACAAATGGTA 
AGAACAATGT 
AATATTTGCT 



CAAAAAGAGA 
TCAGATAAAG 
GCAAAGTCAG 
AAGACCAGAA 
CCAGTATTTC 
GAGCCCCTGA 
CAAGACCAAC 
TCTTAACCTG 
CTTTACATAT 
TTAATGTTTA 
GGGCATGTGC 
AACAGAGCCT 
GATTGAGGTC 
GCCAAAACTA 
CTTTCTAACC 
CCTTTTTCCT 
TAGTGAATAT 
TTATATGCTG 
TTTTGATAGA 
TGCAACACAT 
TCACTTAAAA 



5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 



Seq ID NO: 44 Protein sequence; 
Protein Accession #: NP_005091 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



i 
I 

MGAGSSTEQR 
OLSTINGVAE 
DKEMATKSAV 
KFTVKKDKTE 
TLKREQSHAE 
QGWAGWRKKT 
AESAHEPRLS 
STVEERTEEQ 
LSPDEKVLSK 
EESGEHTQVP 
KKREGVTPWA 
EEPKPEEPKR 
ETGTDGILAG 
DSIAGSGVEH 
DVPAWPLSE 
AATIIEERSP 
GDTWSEAEL 
VQEVEGGVPD 
EASGLKKETD 
ETLAGVKSQE 
SGTQSGGTEA 
EGTQEADQYA 
LQSHAKSPPS 
IIDGAKEVSS 
ETLEPAGAHIi 
GEKTTSLKWK 
VRTEETATEM 
AVGQAHSDIS 
ALLAERIEKS 
QEVELQEGKV 



11 
I 

SPEQPPEGSS 
QDELSLQEGD 
VHDITDDGQE 
KPDTVQLLTV 
ISPPAESGQA 
SFRKPKEDEV 
AEYEKVELPS 
KTEVEETAGS 
PPEGWSEVE 
ADSPDSQEEQ 
SFKKMVTPKK 
KVDTSVSWEA 
SQEHDPGQGS 
STPDTEPGKE 
YDAVEREKME 
SWISASVTEP 
TPEAVTAAET 
IEEQERRTQE 
WLKVDAQEA 
MVMEQAIPPD 
EAVPAQKERP 
DEKTKDVPFF 
PVEREMWQV 
LEGSPPPCLG 
VLEEKSSEKN 
SDEVDEQVAC 
LTSELQTQAH 
KDMSEASEKT 
LVEPKEDEKG 
HSESDKAXTP 



21 
I 

TPAEPEPSGG 
LNGQKGALNG 
ENRNIEQIPS 
KKDEGEGAAG 
VEECKEEGEE 
EASEKKKEQE 
EEQVSGSQGP 
VPAEELVGMD 
MLSSQERMKV 
KGESSASSPE 
RVRRPSESDK 
LICVGSSKKR 
SSPEQAGSPT 
ESWVSIKKFI 
AQQAQKGAEQ 
LEQVEAEAAIi 
AGPLGSEEGT 
VLQAVAEKVK 
KTEPFTQGKV 
SVETPTDSET 
PAPSSFVFQE 
EGLEGSIDTG 
EREKTEAEPT 
QEEAVCTKIQ 
EDFAAHPGED 
QEVKVSVAIE 
ViKADSQDAG 
MTVEVEGSTV 
DDVDDPENQN 
QAQEELQKQE 



31 
I 

GPSAEAAPDT 
QGALNSO.EEE 
SESNLEELTQ 
AGDHQDPSLG 
KQEKEPSKSA 
PEKVDTEEDG 
SEEKPAPLAT 
AEPQEAEPAK 
QGSPLKKLFT 
EPEEITCLEK 
EDELDKVKSA 
ARRRSS5DEE 
EGEGVSTWES 
PGRRKKRPDG 
PEQKAATEVS 
LTEEVLEREV 
EASAAEETTE 
EESQLPGTGG 
VGQTTPESFE 
DGSTPVADFD 
ETKEQSKMED 
ITVSREKVTE 
HVNEEKLEHE 
VQSSEASFTL 
AVPTGPDCQA 
DLEPENGILE 
QETEKEGEEP 
NDQQLEEWL 
SALADTDASG 
RESAKSELTE 



41 
I 

TADPAIAASD 
EVIVTEVGQR 
PTESQANDIG 
AGEAASKESE 
ESPTSPVTSE 
KAEVASEKLT 
EVFDEKIEVH 
ELVKLKETCV 
STGLKKLSGK 
GLAEVQQDGE 
TLSSTESTAS 
GGPKAMGGDH 
FKRLVTPRKK 
KQEQAPVEDA 
KELSESQVHM 
IAEEEPPTVT 
MVSAVSQLTD 
PEDVLQPVQR 
KAPQVTESIE 
APGTTQKDEI 
TLEHTDKEVS 
VALKGEGTEE 
TAVTVSEEVS 
TAAAEEEKVL 
KSTPVIVSAT 
LETKSSKLVQ 
QASAQDETPI 
PSEEEGGGAG 
GLTKESPDTN 
S 



51 

I 

PATKLLQKNG 
DSEDVSERDS 
FKKVFKFVGF 
PKQSTEKPEE 
TGSTFKKFFT 
ASEQAHPQEP 
QEEWAEVHV 
SGEDPTQGAD 
KQKGKRGGGD 
AEEGATSDGE 
EMQEEMKGSV 
QKADEAGKDK 
SKSKLEEKSE 
GPTGANEDDS 
MAAAVADGTR 
EPLPENREAR 
SPDTTEEATP 
AEAERPEEQA 
SSELVTTCQA 
VEIHEENEVA 
VETVSILSKT 
AECKKDDALE 
KQLLQTVNVP 
GETANILETG 
TKKGLSSDLE 
NIIQTAVDQF 
TSAKEESEST 
TKSVPEDDGH 
GPKQKEKEDA 



60 
120 
180 
.240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 



Seq ID NO: 45 Nucleotide sequence: 
Nucleic Acid Accession #: NM_0 01290 

Coding sequence: 110 1231 (underlined sequences correspond to start and stop codons) 



1 
I 

GTGAGCGTGT 
TGGACGTTAC 
ACCACATGAC 
CATGGTACAG 
GGATAGTGAC 
ATTAACCCTT 
CATCCCCCGT 
CAAACACTCG 
CATGGTCACC 



11 
I 

GTGCGTGCGT 
TGATACTCTG 
CCCTTCTATT 
CCAGAGTACC 
AACCTCTGGT 
TCATTTTGTT 
TACTTTAGCA 
AAAGAGTCAT 
CAGCACGGGA 



21 
I 

CTACTTTGTA 
TTTAGCTTGA 
CTTCTCCTTT 
GAATCTATGA 
GGGACGCCTT 
TGGAAGATGG 
CTGTGTTTGA 
ACCACAACTC 
AGCCCATGTT 



31 
I 

CTGGGAAGAA 
TTTTCGAAAA 
CGGCCCATTT 
GATGAACAAG 
TGCCACTGAA 
ACCAAAGCGA 
AGGAGGGGTG 
ATCCATCACG 
TACCAAGGTA 



41 
I 

CACAGCCCAT 
GCAGGCAAGA 
TATAGGAGGC 
AGACTGCAGT 
TTTTTTGAAG 
TACACTATCG 
ACCGACCTGT 
GTGGACTGCG 
TGTACAGAAG 



51 
I 

GTGCTCTGCA 60 

TGTCCAGCAC 120 

ATACACCATA 180 

CTCGCACAGA 240 

ATGACGCCAC 300 

GCAGGACCCT 360 

ATTACATTCT 420 

ACCAGTGTAC 480 

GCAGACTGAT 540 
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CTTGGAGTTC ACCTTTGATG ATCTCATGAG 
ATACCGAGAG TTAGTCCCGA GAAGCATCCT 
GGATCAGCTG TCCAAAAACA TCACCAGGAT 
CAGGTTGTGT GTAATATTGG AGCCAATGCA 
CCTCAGTCCC CGAGACTGCC TGAAGACCTG 
TCCGCCAGCA GAACCCACAA GGCAACCAAC 
CAGCAGCACT TCCAACAGCA GCGCTGGGAA 
GACCACAGCT GCAAACCTGA GTCTGTCCAG 
GCCAACTCTG ATGGGAGGTG AGTTTGGGGA 
AAACAOGCAA TATGATGCGG CCAACGGCAT 
CGCGCTGGGG AACAACAGCC CGTGGAACAG 
AGAAAACCCC CCACCCCAGG CTTCCCAATA 
AGGCCCGTGG GTGATCATTA CAATTGCAAA 
ATAAAAACTT TTCCATGCAA ATATCTATTT 
CTTTCTTTTT TTCTAATTGA GAGGATTATT 
GGCCTTCACA GGTAATACAG ATACTGGCAC 
GCGCATCTTC TGGCACGGTT TTAACAACGT 
AACGAAGGCC ATATTGTCCA TAAATGCTCA 
TAACTACAGA TGACTTTTTA ATATTGTAAA 
AGTTTCTTGT TTCAGTAAAA AAAGAAAAGA 
ATGTACCTTA TTTTTTTTTT CTTTATGTTT 
AGCAAGGTAA TTTATGGTTG AGCTGATGTC 
TAGCCCAAGT GCTGAAACAA GAAATGTCAT 
TTAAGTAAAG AAAGACAATT GGACCCTTAA 
TCCAAATATT TTCAAGCCAT GTAATCCATT 
CTTTGTGTGT TTTCTAATTG TACCTGAGTT 
TGTATGATAT TTTGTAAAGC TCTCACCTGG 
AACTCCAGTG TATTTATGTG AAACTTTATA 
TATGTTCCTC CACACATGTA AAGGCACAGT 
TATGTATGCT TTACTGATAA GTGTGCCAAT 



AATCAAAACA TGGCACTTTA CCATTAGACA 600 

AGCCATGCAT GCACAAGATC CTCAGGTCCT 660 

GGGGCTAACA AACTTCACCC TCAACTACCT 720 

GGAACTGATG TCGAGACATA AAACTTACAA 780 

CTTGTTTCAG AAGTGGCAGA GGATGGTGGC 840 

AACCAAACGG AGAAAAAGGA AAAATTCCAC 900 

CAATGCAAAC AGCACTGGCA GCAAGAAGAA 960 

TCAGGTACCT GATGTGATGG TGGTAGGAGA 1020 

CGAGGACGAA AGGCTAATCA CTAGATTAGA 1080 

GGACGACGAG GAGGACTTCA ACAATTCACC 1140 

TAAACCTCCC GCCACTCAAG AGACCAAATC 1200 

AGATGATCGG CACCAGAATC CACTGTCAAT 1260 

TCTTTACTTA CAGGAGAGGA AACAGAAGAG 1320 

CTAAACCACA ATGATCTGAT TTTCTTTCTT 1380 

CCCAGTAAGC TTCCATGACC CTTTCTTGGA 1440 

TGATTGTAAT TAAAATGAGA GAAAACTCTA 1500 

GTTTGTGTTG AATTTCCTTT TTATGCATCA 1560 

GTGCTCAGGA TCTCATTAAT ATGCCGAACC 1620 

ATATTTTCTG CTTTTTGACT TGCATCTGAG 1680 

CAAAAAAATC AGCTTTGGAA AGTAATTTAA 1740 

TCTTTCATTG GGCAACAGCT AAGAGGGCCC 1800 

AATTGGTTCT TGTCTTGAGT CGACTCAATT 1860 

TTTTTTCATC AAAGACACCA GGGCAGATTT 1920 

GAATTTATGC ATTTGTAAAG TTGCTGTTGA 1980 

GGTTTTGTGG GCAGTTTAAT AAACCTGAAC 2040 

GACCATCCTT TCTTTTTATA GTATATTTCT 2100 

TTCTTTTATG GGGACTTTTC GTTTTTGGGC 2160 

AGAGAATTAA TTTTTCCATT TGCATATTAA 2220 

GGCTCCGTGT GTTAAAAAAC AGCTGTATTT 2280 
AATAAACTGT GTTAATGACC 



Seq ID NO: 46 Protein sequence; 
Protein Accession #: NPJ301281 



1 11 21 31 41 51 

I I I I I I 

MSSTPHDPFY SSPFGPFYRR HTPYMVQPEY RIYEMNKRLQ SRTEDSDNLW WDAFATEFFE 
DDATLTLSFC LEDGPKRYTI GRTLIPRYFS TVFEGGVTDL YYILKHSKES YHNSSITVDC 
DQCTMVTQHG KPMFTKVCTE GRLILEFTFD DLMRIKTWHF TIRQYRELVP RSIIiAMHAQD 
PQVLDQLSKN ITRMGLTNFT LNYLRLCVTL EPMQELMSRH KTYNLSPRDC LKTCLFQKWQ 
RMVAPPAEPT RQPTTKRRKR KNSTSSTSNS SAGNNANSTG SKKKTTAANL SLSSQVPDVM 
WGEPTLMGG EFGDEDERLI TRLENTQYDA ANGMDDEEDF NNSPALGNNS PWNSKPPATQ 
ETKSENPPPQ ASQ 

Seq ID NO: 47 Nucleotide sequence: 
Nucleic Acid Accession #: NM_004126 
Coding sequence: 108.. 329 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I II I I I 

GGCACGAGCT CGTGCCGGCC TTCAGTTGTT TCGGGACGCG CCGAGCTTCG CCGCTCTTCC 60 

AGCGGCTCCG CTGCCAGAGC TAGCCCGAGC CCGGTTCTGG GGCGAAAATG CCTGCCCTTC 120 

ACATCGAAGA TTTGCCAGAG AAGGAAAAAC TGAAAATGGA AGTTGAGCAG CTTCGCAAAG 180 

AAGTGAAGTT GCAGAGACAA CAAGTGTCTA AATGTTCTGA AGAAATAAAG AACTATATTG 240 

AAGAACGTTC TGGAGAGGAT CCTCTAGTAA AGGGAATTCC AGAAGACAAG AACCCCTTTA 300 

AAGAAAAAGG CAGCTGTGTT ATTTC ATAAA TAACTTGGGA GAAACTGCAT CCTAAGTGGA 360 

AGAACTAGTT TGTTTTAGTT TTCCCAGATA AAACCAACAT GCTTTTTAAG GAAGGAAGAA 420 

TGAAATTAAA AGGAGACTTT CTTAAGCACC ATATAGATAG GGTTATGTAT AAAAGCATAT 480 

GTGCTACTCA TCTTTGCTCA CTATGCAGTC TTTTTTAAGA GAGCAGAGAG TATCAGATGT 540 

ACAATTATGG AAATAAGAAC ATTACTTGAG CATGACACTT CTTTCAGTAT ATTGCTTGAT 600 
GCTTCAAATA AAGTTTTGTC TT 

Seq ID NO: 48 Protein sequence: 
Protein Accession #: NP_004117 

1 11 21 

t I I 

MPAIiHIEDLP EKEKLKMEVE QLRKEVKLQR 

KNPFKEKGSC VIS 



60 
120 
180 
240 
300 
360 



31 41 51 

I I I 

QQVSKCSEEI KNYIEERSGE DPIiVKGIPED 60 



Seq ID NO: 49 Nucleotide sequence: 
Nucleic Acid Accession #: XM_051896 

Coding sequence: 139.. 2388 (underlined sequences correspond to start and stop codons) 
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WO 02/079492 
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1 11 21 31 41 51 

111)11 

GTTTTAAAGA CGCTAGAGTG CCAAAGAAGA CTTTGAAGTG TGAAAACATT TCCTGTAATT 60 

GAAACCAAAA TGTCATTTAT AGATCCTTAC CAGCACATTA TAGTGGAGCA CCAGTATTCC 120 

CACAAGTTTA CGGTAGTGGT GTTACGTGCC ACCAAAGTGA CAAAGGGGGC CTTTGGTGAC 180 

ATGCTTGATA CTCCAGATCC CTATGTGGAA CTTTTTATCT CTACAACCCC TGACAGCAGG 240 

AAGAGAACAA GACATTTCAA TAATGACATA AACCCTGTGT GGAATGAGAC CTTTGAATTT 300 

ATTTTGGATC CTAATCAGGA AAATGTTTTG GAGATTACGT TAATGGATGC CAATTATGTC 360 

ATGGATGAAA CTCTAGGGAC AGCAACATTT ACTGTATCTT CTATGAAGGT GGGAGAAAAG 420 

AAAGAAGTTC CTTTTATTTT CAACCAAGTC ACTGAAATGG TTCTAGAAAT GTCTCTTGAA 480 

GTTTGCTCAT GCCCAGACCT ACGATTTAGT ATGGCTCTGT GTGATCAGGA GAAGACTTTC 540 

AGACAACAGA GAAAAGAACA CATAAGGGAG AGCATGAAGA AACTCTTGGG TCCAAAGAAT 600 

AGTGAAGGAT TGCATTCTGC ACGTGATGTG CCTGTGGTAG CCATATTGGG TTCAGGTGGG 660 

GGTTTCCGAG CCATGGTGGG ATTCTCTGGT GTGATGAAGG CATTATACGA ATCAGGAATT 720 

CTGGATTGTG CTACCTACGT TGCTGGTCTT TCTGGCTCCA CCTGGTATAT GTCAACCTTG 780 

TATTCTCACC CTGATTTTCC AGAGAAAGGG CCAGAGGAGA TTAATGAAGA ACTAATGAAA 840 

AATGTTAGCC ACAATCCCCT TTTACTTCTC ACACCACAGA AAGTTAAAAG ATATGTTGAG 900 

TCTTTATGGA AGAAGAAAAG CTCTGGACAA CCTQTCACCT TTACTGATAT CTTTGGGATG 960 

TTAATAGGAG AAACACTAAT TCATAATAGA ATGAATACTA CTCTGAGCAG TTTGAAGGAA 1020 

AAAGTTAATA CTGCACAATG CCCTTTACCT CTTTTCACCT GTCTTCATGT CAAACCTGAC 1080 

GTTTCAGAGC TGATGTTTGC AGATTGGGTT GAATTTAGTC CATACGAAAT TGGCATGGCT 1140 

AAATATGGTA CTTTTATGGC TCCCGACTTA TTTGGAAGCA AATTTTTTAT GGGAACAGTC 1200 

GTTAAGAAGT ATGAAGAAAA CCCCTTGCAT TTCTTAATGG GTGTCTGGGG CAGTGCCTTT 1260 

TCCATATTGT TCAACAGAGT -TTTGGGCGTT TCTGGTTCAC AAAGCAGAGG CTCCACAATG 1320 

GAGGAAGAAT TAGAAAATAT TACCACAAAG CATATTGTGA GTAATGATAG CTCGGACAGT 1380 

GATGATGAAT CACACGAACC CAAAGGCACT GAAAATGAAG ATGCTGGAAG TGACTATCAA 1440 

AGTGATAATC AAGCAAGTTG GATTCATCGT ATGATAATGG CCTTGGTGAG TGATTCAGCT 1500 

TTATTCAATA CCAGAGAAGG ACGTGCTGGG AAGGTACACA ACTTCATGCT GGGCTTGAAT 1560 

CTCAATACAT CTTATCCACT GTCTCCTTTG AGTGACTTTG CCACACAGGA CTCCTTTGAT 1620 

GATGATGAAC TGGATGCAGC TGTAGGAGAT CCTGATGAAT TTGAGCGAAT ATATGAGCCT 1680 

CTGGATGTCA AAAGTAAAAA GATTCATGTA GTGGACAGTG GGCTCACATT TAACCTGCCG 1740 

TATCCCTTGA TACTGAGACC TCAGAGAGGG GTTGATCTCA TAATCTCCTT TGACTTTTCT 1800 

GCAAGGCCAA GTGACTCTAG TCCTCCGTTC AAGGAACTTC TACTTGCAGA AAAGTGGGCT 1860 

AAAATGAACA AGCTCCCCTT TCCAAAGATT GATCCTTATG TGTTTGATCG GGAAGGGCTG 1920 

AAGGAGTGCT ATGTCTTTAA ACCCAAGAAT CCTGATATGG AGAAAGATTG CCCAACCATC 1980 

ATCCACTTTG TTCTGGCCAA CATCAACTTC AGAAAGTACA GGGCTCCAGG TGTTCCAAGG 2040 

GAAACTGAGG AAGAGAAAGA AATCGCTGAC TTTGATATTT TTGATGACCC AGAATCACCA 2100 

TTTTCAACCT TCAATTTTCA ATATCCAAAT CAAGCATTCA AAAGACTACA TGATCTTATG 2160 

CACTTCAATA CTCTGAACAA CATTGATGTG ATAAAAGAAG CCATGGTTGA AAGCATTGAA 2220 

TATAGAAGAC AGAATCCATC TCGTTGCTCT GTTTCCCTTA GTAATGTTGA GGCAAGAAGA 2280 

TTTTTCAACA AGGAGTTTCT AAGTAAACCC AAAGCATAGT TCATGTACTG GAAATGGCAG 2340 

CAGTTTCTGA TGCTGAGGCA GTTTGCAATC CCATGACAAC TGGATTTAAA AGTACAGTAC 2400 

AGATAGTCGT ACTGATCATG AGAGACTGGC TGATACTCAA AGTTGCAGTT ACTTAGCTGC 2460 

ATGAGAATAA TACTATTATA AGTTAGGTTG ACAAATGATG TTGATTATGT AAGGATATAC 2520 

TTAGCTACAT TTTCAGTCAG TATGAACTTC CTGATACAAA TGTAGGGATA TATACTGTAT 2580 

TTTTAAACAT TTCTCACCAA CTTTCTTATG TGTGTTCTTT TTAAAAATTT TTTTTCTTTT 2640 

AAAATATTTA ACAGTTCAAT CTCAATAAGA CCTCGCATTA TGTATGAATG TTATTCACTG .2700 

ACTAGATTTA TT CAT AC CAT GAGACAACAC TATTTTTATT TATATATGCA TATATATACA 2760 
TACATGAAAT AAATACATCA ATATAAAAAT 

Seq ID NO: 50 Protein sequence! 
Protein Accession #: XP_051896 

1 11 21 31 41 51 

I I I I • ' I I 

MSFIDPYQHI IVEHQYSHKF TWVLRATKV TKGAFGDMLD TPDPYVELFI STTPDSRKRT 60 

RHFNNDINPV WNETFEFILD PNQENVLEIT LMDANYVMDE TLGTATFTVS SMKVGEKKEV 120 

PFIFNQVTEM VLEMSLEVCS CPDLRFSMAL CDQEKTFRQQ RKEHIRESMK KLIX5PKNSEG 180 

LHSARDVPW AILGSGGGFR AMVGFSGVMK ALYESGILDC ATYVAGLSGS TWYMSTLYSH 240 

PDFPEKGPEE INEELMKNVS HNPLLLLTPQ KVKRYVESLW KKKSSGQPVT FTDIFGMLIG 300 

■ ETLIHNRMNT TLSSLKEKVN TAQCPLPLFT CLHVKPDVSE LMFADWVEFS PYEIGMAKYG 360 

TFMAPDLFGS KFFMGTWKK YEENPLHFLM GVWGSAFSIL FNRVI/3VSGS QSRGSTMEEE 420 

LENITTKHIV SNDSSDSDDE SHEPKGTENE DAGSDYQSDN QASWIHRMIM ALVSDSALFN 480 

TREGRAGKVH NFMLGLNLNT SYPLSPLSDF ATQDSFDDDE LDAAVADPDE FERIYEPLDV 540 

KSKKIHWDS GLTFNLPYPL ILRPQRGVDL IISFDFSARP SDSSPPFKEL LLAEKWAKMN 600 

KLPFPKIDPY VFDREGLKEC YVFKPKNPDM EKDCPTIIHF VLANINFRKY KAPGVPRETE 660 

EEKEIADFDI FDDPESPFST FNFQYPNQAF KRLHDLMHFN TLNNIDVIKE AMVESIEYRR 720 
QNPSRCSVSL SNVEARRFFN KEFLSKPKA 



Seq ID NO: 51 Nucleotide sequence: 
Nucleic Acid Accession #: NM_006528 

Coding sequence: 57.. 764 (underlined sequences correspond to start and stop codons) 



204 



WO 02/079492 



PCT/US02/04915 



1 11 21 31 41 51 

I I I I I I 

GCCGCCAGCG GCTTTCTCGG ACGCCTTGCC CAGCGGGCCG CCCGACCCCC TGCACCATGG 60 

ACCCCGCTCG CCCCCTGGGG CTGTCGATTC TGCTGCTTTT CCTGACGGAG GCTGCACTGG 120 

GCGATGCTGC TCAGGAGCCA ACAGGAAATA ACGCGGAGAT CTGTCTCCTG CCCCTAGACT 180 

ACGGACCCTG CCGGGCCCTA CTTCTCCGTT ACTACTACGA CAGGTACACG CAGAGCTGCC 240 

GCCAGTTCCT GTACGGGGGC TGCGAGGGCA ACGCCAACAA TTTCTACACC TGGGAGGCTT 300 

GCGACGATGC TTGCTGGAGG ATAGAAAAAG TTCCCAAAGT TTGCCGGCTG CAAGTGAGTG 360 

TGGACGACCA GTGTGAGGGG TCCACAGAAA AGTATTTCTT TAATCTAAGT TCCATGACAT 420 

GTGAAAAATT CTTTTCCGGT GGGTGTCACC GGAACCGGAT TGAGAACAGG TTTCCAGATG 480 

AAGCTACTTG TATGGGCTTC TGCGCACCAA AGAAAATTCC ATCATTTTGC TACAGTCCAA 54 0 

AAGATGAGGG ACTGTGCTCT GCCAATGTGA CTCGCTATTA TTTTAATCCA AGATACAGAA 600 

CCTGTGATGC TTTCACCTAT ACTGGCTGTG GAGGGAATGA CAATAACTTT GTTAGCAGGG 660 

AGGATTGCAA ACGTGCATGT GCAAAAGCTT TGAAAAAGAA AAAGAAGATG CCAAAGCTTC 720 

GCTTTGCCAG TAGAATCCGG AAAATTCGGA AGAAGCAATT TTAAACATTC TTAATATGTC 780 

ATCTTGTTTG TCTTTATGGC TTATTTGCCT TTATGGTTGT ATCTGAAGAA TAATATGACA 840 

GCATGAGGAA ACAAATCATT GGTGATTTAT TCACCAGTTT TTATTAATAC AAGTCACTTT 900 

TTCAAAAATT TGGATTTTTT TATATATAAC TAGCTGCTAT TCAAATGTGA GTCTACCATT 960 

TTTAATTTAT GGTTCAACTG TTTGTGAGAC GAATTCTTGC AATGCATAAG ATATAAAAGC 1020 

AAATATGACT CACTCATTTC TTGGGGTCGT ATTCCTGATT TCAGAAGAGG ATCATAACTG 1080 

AAACAACATA AGACAATATA ATCATGTGCT TTTAACATAT TTGAGAATAA AAAGGACTAG 1140 
CC 

Seq ID NO: 52 Protein sequence: 
Protein Accession #: NP_006519 

1 11 21 31 41 51 

I I I 1 I I 

MDPARPLGLS ILLLFLTEAA LGDAAQEPTG NNAEICLLPL DYGPCRALLL RYYYDRYTQS 60 

CRQFLYGGCE GNANNFYTWE ACDDACWRIE KVPKVCRLQV SVDDQCEGST EKYFFNIiSSM 120 

TCEKFFSGGC HRNRIENRFP. DEATCMGFCA PKKIPSFCYS PKDEGLCSAN VTRYYFNPRY 180 
RTCDAFTYTG CGGNDNNFVS REDCKRACAK ALKKKKKMPK LRFASRIRKI RKKQF 



Seq ID NO: 53 Nucleotide sequence; 
Nucleic Acid Accession #: AA478778 
Coding sequence: no ORF found 



1 11 21 31 41 51 

I I I I I I 

TATTTTTGTA CGTAAAATGA TTCTATTATG ACTGCCTTTG CATGTAGTAA TATGACAAAG 60 

TGATCCTTCA TTATCACGGT ACACTATTGT TTACTTTTCA TCTGTAAATG TTTTATTGTT 120 

ACTTTTTTAA AATGAATTTT TTTAAAACAA TCTAGCCATC ATCAAGGTGC TATAAGAGTT 180 

GTATAAAAGA TATTTTTGGC ATTTCTAGGC AAGTATCAGC CAATAAGTAT GTTAGTGATA 240 

TCACAGATTG TACCAACTAT TAACTATGTT AAATAAGTAT TCAGTTTCAT GTGATCTCTG 300 

GGAAAAAAAT ATGCTGCCTT GGTGCTAATA TTGTATGTAT TTAAATGATC ATCTGACTCA 360 

GAAATATAAA CACTTTTAAT GAAAGGGAGG AACGGAAGGA CAATTTCCAG TGCACAGAAT 420 

CACTTGGATG AAATAAGACC AGCTCTTTAC CCTTATTTTT GGATATGCCT TTTTTGGAAG 4 80 

. AGACTTAGAC TTTATCCTTA TTGTTGTTAG TGTTGTTAAT ATTCGTTGCT TCAGCCCACG 540 

. GTGCCTTGGT CTCTCCACAA TCAAATGGAG GATCCCCCAA GCAGCTTCAT TACAGAGTGA 600 

TATTGGGAAA GTGAGATCCT CTCACCATTT TGCCAAGATA CTCTAAAATG ACATCCAAGT 660 

TTACCAGTAG AAAGACACAG GATGCACAGA ATGGGCATGA CCTTCAGCTC ACGAGCACAC 720 

CTGGAGAAAT TCAGAACCAG GTTCTGAATC ATCACGATTG CCTTTTGCAT GAAAACATCG 780 

' GCTGGTGATG TGACTTCTCT TCAGGCCATG AGCCTAACAY CCTGCCGGTT TTCATGCCCG 840 

CTGCAGTAAT GGACGTTTGT GTGAAGAAAT GAACTGTGGA GTACAAAATG CTTTGAGTCT 900 

TTCCGATTGC TCATTAATTC ACTTTTTTGT TACTTCTTTC CAAAATGGAA GTGCTGAAGC 960 

CATGGTCTTT CTGCCCCTCC AAGCTGATGA AGGGAAGCCT TTGCCAATGG CCCATGGAAG 102 0 

ACACTTGGTT TGAGAAACCC TGCCCACTTC CAAAGACCAA AGAGATTAGG AAAAGCCTGG 1080 

CAGTATTCTC CAACTCCAAA CAAGCTCTAG AGTGCTCCAG GAAAAGTTAT ATTCAGTATA 1140 

TGAATAAGTG TTATTCTCCA TTATTAATGT GTTCTGAAAA TATATTATGA ATAAATACAT 1200 
CACCACACCC AAAAAAAAAA AAAAAAAAAA AAAA 



Seq ID NO: 54 Nucleotide sequence: 
Nucleic Acid Accession #: NM_020663 

Coding sequence: 1..645 (underlined sequences correspond to staxt and stop codons) 



1 11 21 31 41 51 

I I I I I I 

ATGAACTGCA AAGAGGGAAC TGACAGCAGC TGCGGCTGCA GGGGCAACGA CGAGAAGAAG 60 

ATGTTGAAGT GTGTGGTGGT GGGGGACGGT GCCGTGGGGA AAACCTGCCT GCTGATGAGC 120 

TACGCCAACG ACGCCTTCCC AGAGGAATAC GTGCCCACTG TGTTTGACCA CTATGCAGTT 180 

ACTGTGACTG TGGGAGGCAA GCAACACTTG CTCGG ACTGT ATGACACCGC GGGACAGGAG 240 

GACTACAACC AGCTGAGGCC ACTCTCCTAC CCCAACACGG ATGTGTTTTT GATCTGCTTC 300 
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TCTGTCGTAA ACCCTGCCTC TTACCACAAT GTCCAGGAGG AATGGGTCCC CGAGCTCAAG 360 

GACTGCATGC CTCACGTGCC TTATGTCCTC ATAGGGACCC AGATTGATCT CCGTGATGAC 420 

CCAAAAACCT TGGCCCGTTT GCTGTATATG AAAGAGAAAC CTCTCACTTA CGAGCATGGT 480 

GTGAAGCTCG CAAAAGCGAT CGGAGCACAG TGCTACTTGG AATGTTCAGC TCTGACTCAG 540 

AAAGGTCTCA AAGCGGTTTT TGATGAAGCA ATCCTCACCA TTTTCCACCC CAAGAAAAAG 600 
AAGAAACGCT GTTCTGAGGG TCACAGCTGC TGTTCAATTA TC TGA 



Seq ID NO: 55 Protein sequence: 
Protein Accession #: NP_065714 

1 11 21 31 41 51 

I I I I I I 

MNCKEGTDSS CGCRGNDEKK MLKCVWGDG AVGKTCLLMS YANDAFPEEY VPTVFDHYAV 60 
TVTVGGKQHL LGLYDTAGQE DYNQLRPLSY PNTDVFLICF SWNPASYHN VQEEWVPELK 120 
DCMPHVPYVL IGTQIDIoRDD PKTLARLLYM KEKPLTYEHG VKIAKAIGAQ CYLECSALTQ 180 
KGLKAVFDEA ILTIFHPKKK KKRCSEGHSC CSII 



Seq ID NO: 56 Nucleotide sequence: 

Nucleic Acid Accession #: fgenesh prediction 

Coding sequence: 1-546 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

ATG GCCTTGG GCAGCTCCGC CCCTGTGGCT TTGCAGGGTA ATGCCCACTT CCCTGCTGCT 60 

TTCATGGCTG GCATTAAGTG TCTGTGGCTT TTCCAGGTAG TCCCCCTGGG GCTCCCCGAG 120 

TTGGTGCAAA GGCTCCTGGG TGGAGCTCGA ACTGAAACTC GCTTTGTGCC CGCAGCCCTG 180 

CAGCTCGCCG GTGCCCTCGA CCTGCCCGCT GGGTCCTGTG CCTTTGAAGA GAGCACTTGC 240 

GGCTTTGACT CCGTGTTGGC CTCTCTGCCG TGGATTTTAA ATGAGGAAGG CCAGCAACCT 300 

TTCTGGTCCT CAGGAGACAT GTCTGACTGG GACTACTGGG TTGGCTGGCG GAAGTTAATT 360 

CATTCTCCTC TGAGCACTCC AGGGTGGAGC AGGCAGGTTA GGCTCCAGTT GTTCCAGCTT 420 

CAGTTTGTCA AAGGCCAGAA CTTGGACGTA ACAGTGTACT GCAGGCTCCA GGGCAGTGAG 480 

AAACCCTTTG AAACTGGTTC CATGGTTCCA TTCACCTCCA TGTACTGGAT CCACCATGGA 540 
AAGTAG 



Seq ID NO: 57 Protein sequence : 

Protein Accession #: fgenesh prediction 



1 11 21 31 41 51 

I • I I I I I 

MALGSSAPVA LQGNAHFPAA FMAGIKCLWL FQWPLGLPE LVQRLLGGAR TETRFVPAAL 
QLAGALDLPA GSCAFEESTC GFDSVLASLP WILNEEGQQP FWSSGDMSDW DYWVGWRKLI 
HSPLSTPGWS RQVRLQLFQL QFVKGQNLDV TVYCRLQGSE KPFETGSMVP FTFMYWIHHG 
K 



60 
120 
180 



Seq ID NO: 58 Nucleotide sequence: 
Nucleic Acid Accession #: XM_050478 

Coding sequence: 27.. 4508 (underlined sequences correspond to start and stop codons) 
1 ~ 11 21 31 41 51 

I I II . 'I I 

CCGGCGGCGC CTGAGCCCAG CCGAGGATGG AGAACCGGCC TGGGTCCTTC CAGTACGTCC 60 

CTGTGCAGCT GCAAGGGGGG GCACCCTGGG GCTTCACCCT TAAGGGGGGT CTGGAACACT 120 

GTGAGCCGCT CACAGTGTCT AAGATTGAAG ATGGAGGCAA GGCAGCTTTG TCCCAGAAGA 180 

TGAGGACTGG TGATGAGCTG GTGAATATCA ATGGCACTCC ATTATATGGC TCCCGCCAAG 240 

AGGCCCTCAT TCTCATCAAA GGCTCCTTCC GGATTCTCAA GCTGATTGTC AGGAGGAGGA 300 

ACGCCCCTGT CAGTAGGCCG CACTCATGGC ATGTGGCCAA GCTGCTGGAG GGATGCCCTG 360 

AAGCAGCCAC CACCATGCAT TTCCCTTCTG AAGCCTTCAG CTTGTCCTGG CATTCTGGCT 420 

GCAACACAAG TGACGTGTGT GTGCAGTGGT GTCCACTCTC CCGGCATTGC AGCACCGAGA 480 

AAAGCAGCTC CATTGGCAGC ATGGAGAGCC TGGAGCAACC AGGCCAAGCC ACCTATGAGA 540 

GCCATCTGTT GCCTATTGAC CAGAACATGT ACCCTAACCA GCGTGACTCA GCCTACAGCT . 600 

CCTTCTCGGC CAGCTCAAAT GCTTCTGACT GTGCCCTTTC CCTCAGGCCA GAGGAGCCAG 660 

CCTCTACAGA CTGCATCATG CAAGGCCCAG GGCCAACTAA GGCCCCCAGT GGCCGGCCTA 720 

ATGTGGCTGA GACCTCAGGA GGTAGTCGGC GCACCAATGG GGGCCACCTG ACCCCCAGCT 780 

CTCAGATGTC ATCCCGTCCA CAGGAGGGAT ACCAGTCAGG GCCCGCCAAA GCAGTCAGGG 840 

GCCCACCACA ACCTCCAGTG AGGCGGGACA GCCTTCAGGC CTCCAGAGCC CAACTCCTCA 900 

ATGGAGAGCA GCGCAGGGCA TCTGAGCCTG TGGTCCCCTT GCCACAGAAG GAGAAACTGA 960 

GCTTAGAGCC TGTGCTACCC GCAAGGAACC CTAATAGGTT CTGTTGCCTC AGTGGGCATG 1020 

ACCAAGTGAC AAGTGAGGGC CATCAGAACT GTGAGTTCAG TCAGCCTCCT GAATCCAGCC 1080 

AACAGGGCTC TGAGCATCTA CTGATGCAGG CCTCAACCAA AGCTGTTGGA TCCCCAAAAG 1140 

CCTGTGACAG AGCTTCCAGC GTGGATTCCA ACCCACTCAA TGAGGCTTCT GCAGAGCTAG 1200 

CTAAGGCTTC TTTTGGCAGA CCTCCACATC TCATAGGACC CACAGGGCAT CGCCATAGTG 1260 

CCCCTGAACA GCTGCTGGCA TCCCACCTGC AGCATGTGCA CCTTGATACC AGGGGCAGCA 1320 

AAGGGATGGA GCTCCCACCC GTACAGGATG GGCACCAGTG GACTCTGTCC CCTTTGCACA 1380 
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GCAGCCACAA AGGGAAGAAA AGTCCATGCC CCCCTACAGG AGGAACCCAT GACCAGTCCA 1440 

GCAAAGAAAG AAAGACCAGA CAAGTGGATG ACAGGTCTTT AGTTTTGGGA CACCAGAGCC 1500 

AAAGCAGTCC CCCACATGGA GAGGCTGATG GACACCCCTC AGAAAAAGGT TTCCTGGACC 1560 

CAAACAGAAC AAGCAGAGCA GCCAGTGAAT TGGCCAACCA GCAACCCTCT GCCTCTGGCT 1620 

CCCTTGTTCA ACAAGCCACG GACTGTTCTT CAACCACTAA AGCAGCTAGT GGCACAGAGG 1680 

CAGGTGAAGA AGGGGACAGC GAGCCCAAGG AGTGCAGCCG GATGGGTGGT AGGCGAAGTG 1740 

GAGGGACCCG GGGCCGCTCG ATCCAAAACC GGCGGAAGAG TGAGCGTTTT GCTACCAATC 1800 

TGCGTAATGA AATTCAGAGG AGGAAGGCCC AGCTCCAGAA AAGCAAGGGT CCCTTGTCAC 1860 

AGCTGTGTGA CACTAAGGAG CCAGTGGAAG AGACCCAGGA GCCCCCAGAA AGTCCTCCAC 1920 

TCACTGCCTC TAACACATCT CTTCTATCTT CATGTAAAAA ACCTCCCAGC CCCAGAGACA 1980 

AGCTCTTCAA CAAAAGCATG ATGCTCAGAG CTAGGTCTTC CGAGTGCCTC AGCCAAGCCC 2040 

CTGAGAGCCA TGAATCTAGG ACAGGCTTAG AGGGACGAAT AAGCCCTGGC CAGAGGCCTG 2100 

GCCAGTCCTC TTTGGGCCTG AACACCTGGT GGAAAGCACC TGACCCATCC TCCTCAGACC 2160 

CTGAGAAAGC ACATGCTCAC TGTGGAGTCC GTGGAGGTCA TTGGAGATGG TCTCCAGAGC 2220 

ATAATTCACA GCCACTTGTG GCAGCAGCCA TGGAAGGCCC TTCCAACCCA GGTGACAACA 2280 

AGGAATTGAA GGCTTCTACT GCTCAAGCTG GGGAGGATGC CATCCTCTTG CCTTTTGCAG 2340 

ACAGAAGAAA GTTCTTTGAA GAGAGTAGCA AATCCTTATC TACATCTCAT TTGCCAGGTT 2400 

TAACCACTCA TAGCAACAAG ACTTTTACCC AGAGACCAAA ACCTATAGAC CAAAACTTCC 2460 

AGCCAATGAG CTCCAGCTGT AGGGAATTGA GGCGCCATCC CATGGACCAA TCATATCATT 2520 

CCGCAGACCA ACCATATCAT GCCACAGACC AATCATATCA TTCCATGTCA CCCCTTCAGT 2580 

CAGAAACTCC CACTTACTCA GAATGTTTTG CAAGCAAAGG TCTAGAAAAT TCCATGTGTT 2640 

GTAAGCCACT ACACTGTGGT GATTTTGATT ACCACAGGAC CTGCTCTTAC TCCTGCAGTG 2700 

TTCAAGGAGC TCTAGTCCAT GATCCTTGCA TTTATTGTTC TGGGGAAATC TGCCCTGCCT 2760 

TGCTAAAGAG AAATATGATG CCAAATTGCT ACAACTGCCG GTGCCACCAC CACCAATGCA 2820 

TTCGGTGTTC AGTTTGCTAT CATAATCCTC AGCACAGTGC CCTCGAGGAC AGCAGCTTGG 2880 

CACCTGGCAA CACTTGGAAA CCCAGGAAGC TGACAGTGCA GGAATTTCCT GGGGACAAAT 2940 

GGAATCCAAT AACAGGAAAC AGGAAGACCA GCCAGTCAGG GAGGGAAATG GCTCATTCCA 3000 

AGACTAGCTT TTCATGGGCA ACCCCTTTCC ATCCTTGCCT TGAGAACCCA GCACTGGACT 3060 

TGTCAAGCTA CCGAGCAATT TCTTCTCTTG ACCTCCTTGG AGACTTCAAA CATGCTTTGA 3120 

AAAAATCAGA GGAAACTTCA GTTTATGAGG AGGGGAGCTC CCTTGCCTCC ATGCCCCACC 3180 

CACTGCGCAG CCGTGCCTTC TCAGAGAGTC ACATCAGCTT GGCGCCCCAA AGCACCCGGG 3240 

CCTGGGGGCA GCATAGGAGG GAGCTCTTTA GCAAAGGTGA TGAGACCCAG TCGGATCTTC 3300 

TCGGAGCCAG GAAGAAGGCC TTTCCTCCTC CTCGCCCTCC TCCTCCCAAC TGGGAGAAGT 3360 

ACAGGCTCTT TCGTGCAGCC CAGCAGCAGA AGCAGCAACA GCAGCAGCAG AAGCAACAGG 3420 

AGGAGGAGGA GGAGGAGGAA GAAGAAGAAG AAGAGGAAGA GGAAGAGGAG GAGGAGGAGG 3480 

CAGAGGAGGA GGAAGAGGAG CTGCCACCCC AGTATTTCAG TTCAGAAACC TCTGGTTCCT 3540 

GTGCTCTCAA TCCTGAGGAG GTCCTAGAGC AGCCACAACC CCTCAGCTTT GGCCACCTGG 3600 

AGGGCTCGAG ACAGGGTTCA CAAAGTGTCC CAGCAGAGCA AGAATCCTTT GCACTCCATT 3660 

CCAGTGATTT CTTGCCTCCA ATAAGGGGTC ACTTGGGATC TCAACCTGAG CAGGCTCAGC 3720 

CCCCTTGCTA CTATGGCATT GGTGGGCTTT GGAGGACATC GGGACAGGAA GCCACTGAAT 3780 

CCGCCAAACA AGAGTTTCAG CACTTTTCGC CTCCTTCAGG GGCCCCAGGA ATCCCTACCT 3840 

CTTACTCAGC TTATTACAAT ATTTCTGTGG CCAAGGCAGA GCTGCTGAAC AAACTGAAAG 3900 

ACCAACCTGA GATGGCAGAG ATTGGCCTAG GAGAGGAGGA AGTTGACCAT GAACTGGCTC 3960 

AAAAAAAGAT ACAGCTTATC GAAAGCATCA GCAGAAAACT TTCTGTCTTG CGGGAGGCCC -4020 

AGCGAGGGCT GCTAGAGGAC ATCAATGCCA ATTCTGCCCT TGGGGAGGAG GTGGAGGCCA 4080 

ACTTAAAAGC CGTCTGCAAA TCCAATGAAT TTGAAAAGTA CCACTTGTTT GTTGGGGACC 4140 

TGGACAAAGT GGTCAACCTG TTGCTGTCAC TCTCTGGACG ACTGGCCCGG GTGGAGAATG 4200 

CTCTGAACAG CATCGATTCA GAGGCCAACC AGGAGAAGTT GGTACTGATA GAGAAGAAGC 4260 

AGCAGCTGAC GGGGCAGTTG GCAGATGCCA AGGAGCTGAA GGAGCACGTG GACCGCCGGG 4320 

AGAAGTTGGT GTTTGGCATG GTCTCCCGCT ACCTGCCTCA GGACCAGCTC CAAGATTACC 4380 

AGCACTTTGT CAAGATGAAA TCTGCTCTCA TCATTGAACA GCGAGAGCTG GAGGAGAAGA 4440 

TCAAGCTCGG GGAAGAGCAA CTCAAATGTC TCAGGGAGAG TCTACTCCTG GGGCCCAGCA 4500 

ATTTCTAATT CTACCAGCAC TCTG CCACAG CATCCCTGCC CAGCCATGTG GGAAGTGCTT 4560 

TCAATCTTCT TTGTTAGCAG TTTCTCAGCA AGTAGATAGC AATTAGCAGT TTGTTCCAGC 4620 

CCTCTACCCT GGATGTCTCT CACTACCCCT TCCCTAGCAG TGGTCCTAAC CAG CTAGGAG 4680 

ACCCTGGGGA AGCCACAAGC TTCTACCCAA GGGAGCTGCA GCAAGGTGTG ATCTTAGAAC 4740 

CACACTCTCC TTCCCACAGT TGCCAAGGGC AAGTACTTGC TGCACAGAGA ACCAAGGAAG 4800 

TGCCTTCATT CTGCTTTGTA CTAGGACACC AAAGACATCA AGTACTCATC ACCCACCCAT 4 860 

ATCATCAACA GCCTCTAAAG GCTCAGAGGG AATCTGCCTT GCAGCTCTAC TCTGCCCCAG 4920 

GGCTTGTGGC CAGCCATTTC TCACAGAGAG CTGGCTGCCT TGAGGGCATT CACCTGGCAC 4980 

CAGTTTCAGG GCCTCACCCA AGCTTTGCAG GGGAAAGCAC AGAGGGAGGA ATTACACTGA 5040 

AAAAAATGCA AGCAAAGGTT GAGTACCCCC AGGTGCCCCT TAGGAAGGAA CCAGGTTTAA 5100 

ATAGGCTCTA CCCTTACCTT TCCCAGCAGC AAGTTCAGGG GAAGAGGCCT ACTCTTAGCC 5160 

CTGGCTAGTG TGACCCTCTT CCTGTCCTAA GACTTTGGTC CTACCACCTC TTGTTTCATC 5220 

TTTCCTTTAC ATTGCTGGGG GTTACCGCAG GTGCCTACCC CAGGGCTTCA CCATATGGGC 5280 

CATTAATAGC TCTACTAAAA CTGACTTCTA GATGTAGGTT TCATTATTGG GGGAGGGGGT 5340 

TCTTATTGTT ATATTTTAAA TGGCCTTTTG ATTTTATTTA TTTTTATGTT TTGATTATTT 5400 

TTTTCTTTTT TAACTAATAA GGCGAGAAGA GGGAAGTTGG AGAGGGAAAA GTTAGCCCAG 5460 

AAGGAAAGCA TTTTCTGCAG ATCAGCCTGA ATCCACCGTG GCTAGGCATA TTCTTGCTCT 5520 

TCTCGTGTTG CTCACAACTA CCTGCCTGGA TGAATTTAGG AAAGTTGCAG GATACAAGGT 5580 

TAAAACACAA GATCAAATGA ACAATCCGAA AATGTTATTA AGAAAACAGT TCCGGCCGGG 5640 

CATGGTGGCT CACGCCTGAA ATCCCAGCAC TTTGGGAGGC CGAGGCAGGT GGATCACGAG 5700 

GTCAGGAGAT CAAGACCATC CTGGCTAACA CGGTGAAACC CTATCTCTAC TAAAAATACA 5760 

AAAAATTAGC CAGGTGTGGT GGCACGCACC AGTAGTCCCA GCTACTCGGG AGGCTGAGGC 5820 

AGGAGAATTG CTTGAACCTG GAAGGCAGAG ATTGCAGTGA GCTGAGACCA CACCACTGCA 5880 

CTCCATCCTG GGCAACAGAG TGAGACTTTG TCTCAAAAAG AAAGAAAGAA AGAAAGAAAG 5940 
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AAAGAAAGAA AGAAAAGAAA GAAAGAAAGA AAGAAAGAAA ACAGTTCCAT TTACAATAGC 6000 
ATC 

Seq ID NO: 59 Protein sequence:, 
Protein Accession XPJ)50478 

1 11 21 31 41 51 

I I I I I I 

MENRPGSFQY VPVQLQGGAP WGFTLKGGLE HCEPLTVSKI EDGGKAALSQ KMRTGDELVN 60 

INGTPLYGSR QEALILIKGS FRILKLIVRR RNAPVSRPHS WHVAKLLEGC PEAATTMHFP 120 

SEAFSLSWHS GCNTSDVCVQ WCPLSRHCST EKSSSIGSME SLEQPGQATY ESHLLPIDQN 180 

MYPNQRDSAY SSFSASSNAS DCALSLRPEE PASTDCIMQG PGPTKAPSGR PNVABTSGGS 240 

RRTNGGHLTP SSQMSSRPQE GYQSGPAKAV RGPPQPPVRR DSLQASRAQL LNGEQRRASE 300 

PWPLPQKEK LSLEPVLPAR NPNRFCCLSG HDQVTSEGHQ NCEFSQPPES SQQGSEHLLM 360 

QASTKAVGSP KACDRASSVD SNPLNEASAE LAKASFGRPP HLIGPTGHRH SAPEQLLASH 420 

LQHVHIiDTRG SKGMELPPVQ DGHQWTLSPL HSSHKGKKSP CPPTGGTHDQ SSKERKTRQV 480 

DDRSLVLGHQ SQSSPPHGEA DGHPSEKGFL DPNRTSRAAS ELANQQPSAS GSLVQQATDC 540 

SSTTKAASGT EAGEEGDSEP KECSRMGGRR SGGTRGRSIQ NRRKSERFAT NLRNEIQRRK 600 

AQLQKSKGPL SQLCDTKEPV EETQEPPESP PLTASNTSLL SSCKKPPSPR DKLFNKSMML 660 

RARSSECLSQ APESHESRTG LEGRISPGQR PGQSSLGLNT WWKAPDPSSS DPEKAHAHCG 720 

VRGGHWRWSP EHNSQPLVAA AMEGPSNPGD NKELKASTAQ AGEDAILLPF ADRRKFFEES 780 

SKSLSTSHLP GLTTHSNKTF TQRPKPIDQN FQPMSSSCRE LRRHPMDQSY HSADQPYHAT 840 

DQSYHSMSPL QSETPTYSEC FASKGLENSM CCKPLHCGDF DYHRTCSYSC SVQGALVHDP 900 

CIYCSGEICP ALLKRNMMPN CYNCRCHHHQ CIRCSVCYHN PQHSALEDSS LAPGNTWKPR 960 

KLTVQEFPGD KWNPITGNRK TSQSGREMAH SKTSFSWATP FHPCLENPAL DLSSYRAISS 1020 

LDLLGDFKHA LKKSEETSVY EEGSSXiASMP HPIiRSRAFSE SHISliAPQST RAWGQHRREL 1080 

FSKGDETQSD LLGARKKAFP PPRPPPPNWE KYRLFRAAQQ QKQQQQQQKQ QEEEEEEEEE 1140 

EEEEEEEEEE EAEEEEEELP PQYFSSETSG SCALNPEEVL EQPQPLSFGH LEGSRQGSQS 1200 

VPAEQESFAL HSSDFLPPIR GHLGSQPEQA QPPCYYGIGG LWRTSGQEAT ESAKQEFQHF 1260 

SPPSGAPGIP TSYSAYYNIS VAKAELLNKL KDQPEMAEIG LGEEEVDHEL AQKKIQLIES 1320 

ISRKLSVLRE AQRGLLEDIN AMSALGEEVE ANLKAVCKSN EFEKYHLFVG DLDKWNLLIi 1380 

SLSGRLARVE NALNSIDSEA NQEKLVLIEK KQQLTGQLAD AKELKEHVDR REKLVFGMVS 1440 
RYLPQDQLQD YQHFVKMKSA LIIEQRELEE KIKLGEEQLK CLRESLLLGP SNF 



Seq ID NO: 60 Nucleotide sequence; 
Nucleic Acid Accession #: NM_014705 

Coding sequence: 192.. 2489 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

] I I I I 1 

GGGAGAAGCT AGGAAAAAAT GTCTTTGAGC TGTGAGATGC TTGTATATTT TGAAAATATG 60 

ATTATATGCA TGTGTTTGTA TTTTATGACT TGGATAATCT GAAAATCAAT TTGCTTTGTC 120 

AATGCTTCCT GGATTAGAAT TCCACTATTT GGTCCCTATC CTAGTCTACT AAAGAAAATT 180 

GAGCGGGAAA CATGGCGGGA AAGTGGCGTT TCATTAATTG CTACTGTAAC TCGTCTAATG 240 

GAGAGGTTGT TAGATTACAG AACTTCTATA AGACTGAACT GAACAAGGAG GAGATGTATA 300 

TACGCTACAT TCACAAACTC TATGATCTGC ATCTCAAAGC ACAGAACTTT ACAGAAGCTG 360 

CATATACCCT CCTCTTATAT GACGAGCTAC TGGAATGGTC TGATCGGCCC CTCAGGGAGT 420 

TCCTGACCTA CCCCATGCAA ACAGAATGGC AGCGCAAAGA GCACCTGCAC CTCACCATCA 480 
TCCAGAACTT TGACAGAGGC AAATGTTGGG AGAATGGCAT TATCTTGTGC CGGAAGATTG • ■• 540 

CAGAGCAGTA TGAGAGTTAT TATGACTACA GAAACCTGAG CAAGATGCGG ATGATGGAAG 600 

CCTCTTTGTA TGACAAAATT ATGGACCAGC AACGTCTTGA ACCAGAGTTC TTCAGAGTTG 660 

GATTTTATGG AAAAAAATTT CCATTTTTCT TAAGAAATAA GGAGTTTGTG TGTCGAGGGC 720 

ATGACTACGA GAGGCTGGAA GCCTTCCAAC AGAGAATGCT GAACGAGTTC CCCCATGCCA 780 

TCGCCATGCA GCACGCCAAC CAGCCCGATG AGACCATCTT CCAGGCAGAA GCTCAGTATT 840 

TGCAGATATA TGCTGTGACT CCCATTCCAG AGAGCCAGGA GGTCCTGCAG AGAGAGGGTG 900 

TTCCGGACAA CATCAAAAGC TTCTATAAAG TGAATCACAT CTGGAAATTC CGCTATGACC 960 

GACCATTTCA CAAAGGCACA . AAAGAT AAAG AGAATGAATT CAAGAGTCTC TGGGTGGAGA 1020 

GAACGTCATT ATACTTGGTG CAGAGTTTGC CTGGCATCTC TCGCTGGTTT GAAGTGGAAA 1080 

AGCGTGAAGT GGTAGAAATG AGTCCTCTGG AAAATGCAAT TGAAGTGCTA GAAAATAAGA 1140 

ATCAGCAGCT GAAGACTCTG ATTAGTCAGT GTCAGACAAG ACAGATGCAG AATATTAATC 1200 

CCCTGACTAT GTGCCTGAAT GGAGTTATAG ATGCTGCAGT TAATGGTGGC GTTTCCAGGT 1260 

ATCAAGAGGC ATTCTTTGTC AAAGAATATA TCTTAAGTCA CCCTGAAGAT GGGGAGAAAA 1320 

TTGCACGATT AAGAGAGCTG ATGCTTGAGC AGGCACAGAT TCTGGAATTT GGTTTGGCCG 1380 

TGCATGAGAA GTTTGTACCT CAAGATATGA GACCCCTTCA CAAAAAGCTG GTTGACCAAT 1440 

TCTTTGTGAT GAAGTCGAGC TTAGGGATAC AGGAGTTCTC TGCTTGTATG CAAGCCAGTC 1500 

CTGTCCATTT TCCTAATGGA AGCCCTCGTG TGTGTAGAAA CTCAGCACCT GCTTCTGTGA 1560 

GCCCAGATGG TACCAGGGTA ATTCCTAGAC GCAGCCCGTT AAGTTACCCA GCTGTCAACC 1620 

GATATTCTTC CTCCTCACTG TCCTCACAAG CTTCTGCTGA AGTAAGCAAT ATTACAGGGC 1680 

AATCAGAAAG CTCTGATGAA GTCTTTAACA TGCAGCCAAG TCCATCTACC TCAAGCTTGA 1740 

GTTCTACTCA CTCGGCTTCA CCTAATGTGA CAAGTTCTGC TCCATCGAGT GCCAGAGCTT 1800 

CTCCTTTGTT GTCTGACAAA CACAAACATT CCCGAGAAAA CTCTTGCCTG TCACCAAGAG 1860 

AGAGACCATG CAGTGCCATC TATCCAACAC CTGTGGAGCC TTCGCAGAGG ATGCTGTTTA 1920 

ATCATATTGG AGACGGGGCC TTGCCACGCA GTGACCCAAA TCTCTCTGCA CCTGAAAAAG 1980 

CTTCACCAGC AAGACACACG ACATCAGTAT CCCCCTCGCC TGCCGGGCGA TCTCCATTGA 2040 
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AGGGCTCTGT GCAGTCTTTC ACCCCCTCTC CAGTGGAGTA 
CCAACTCCCC TGTCTTGTCG GGCAGCTACA GCAGTGGGAT 
GCACGTCGGA AACCTCAGGC TTTGAAAATC AGGTGAATGA 
TGCCAGTGCC GGTGCCCGTG CCGAGCTACG GCGGGGAGGA 
AGACTCCGCC CCCGTACAGC GTCTACGAGC GGACTCTGCG 
ACAGCCTCTC CATCCCCGTC ACGTCGGAGC CGCCCGCGCT 
CGCGATCCAG CCACCTGGAG AATGGGGCCC GGAGGACTGA 
CCCTGCCCCG CAAGGTCTCT CAGTTATAAG TCACTTTTCT 
TTGCCCGTTT ACAAAATAAG AAGTATGATG AGAAGACATT 
CTTACTCAGC TCCTTCGATG AATGGAATTA AAACTTGCTT 
ATATTAAAAG TTGCTGATCT AAAACGCCAG ATGTTAAATG 
AAAACGTTTC TCATTTGGAA GTGGTAAATA GTGATAAAGA 
GTTCACTTTT TTTTATATAG TTTAATCTTA AAACCAATAC 
TGTGTGACAA TGTTGTATCG TTTTTACTGA ATACTTGATA 
TCAGTGCACA TCCTAACACA GTGGTCCTTA TTTTAGAAGA 
TTTATCAGTG CAGATCATCA GAATTAAAGT TCAAGCAGGC 
GGGGTTGCAA AGCTTGGGAC TGGAAATTGT TTTGTTCTTG 
GTTGCTTTTG CTGTTTGACT GCTGTCTACA TTCGTAAAAT 
CTAAATCCCT TACTACCCTG ACACCGTGGT ATCTACTGTA 
TGCTTCAGAG TTCCAATCAG CTAGATTAAG CAAGAGGCTC 
ATTTTGCGCT TCCTTTCTTG ATAGTTTCCT ATATAAAATT 
TGCTGAAGTA TTAATGAGGC ACAAATGACT GTGCCCCATT 
TACAGACAGT ATTAAATTAA TAGCTTAAGT GAAGAAAAAA 
TTAGCACGAT TAAATGGCAA AAGGACTTAT AAAAGGCAAG 
CACAAAATAA AAAATTCCTC ACGACTCTCC ACTTTTACCA 
CCTGTCGTCT TTCTCTTGAA GGAGGATTGC TGTAGACTTC 
ATAGCATCTT AGGTCTAGAT AGGGATGCTA ATGCCAGTTG 
CTTGTATGTA GTAATGTATT TTATATCTTT GTTTTTTCTT 
TCAATTGACA ATAGATATGA ACTGTATTTT AAATCATACT 
TGTTGGGAAG CTCATTTTAG TTTAACCATG TTTGTTTTGT 
AGTGACCACT TTTTTATATT CTCTTAATGA AACCATTCAG 
TGGTTATAGA GGTTTTCTAT AATAAATGTT CAAGTATTTT 
TAATAAGAGA TACCATTATG TGTAAAAAAA AGTAAAAATA 
CAGTATGATT GTTATAATTA TGCCAAATAC TTTACGTATG 
ATGTGCTTTT AACAATTCTG CCATATTGAC TTTACAATTT 
ATATGTTAAA TATTTATGTT TAGTGAAAGT GTTCATAATT 
TTTAGCTTTG TATCTTGCAA GTTTTGCAGT CAGAAATTTT 
GATAACACTT CGTGTTTGTA ACCACATTCA TATATATATA 
TATTTCTGTT GCTTTAAAGA AGTAAAACCT TCCATTTAAA 
AACAAAGCTT CCTTGATTTC CTTTTCCTGT GTAATTTAAT 
GGGCACATTA TAAATCAGTG TTATTTGCTC TTGGAGCCAT 
AGTGAGCAGT TGAATTTATC TTGAATTTAT CATGTGTGTG 
AGCAGAACAT TTTAAGAGAT TCTGTTAGCC CACATGTTCA 
TAAATATTAA ATAAAATTAC CAGATTAATC TT 

Seq ID NO: 61 Protein sequence: 
Protein Accession #: NP_055520 

1 11 21 31 

I I I I 

MAGKWRFINC YCNSSNGEW RLQNFYKTEL NKEEMYIRYI 
LLYDELLEWS DRPLREFLTY PMQTEWQRKE HLHLTIIQNF 
ESYYDYRNLS KMRMMEASLY DKIMDQQRLE PEFFRVGFYG 
RLEAFQQRML NEFPHAIAMQ HANQPDETIF QAEAQYLQIY 
IKSFYKVNHI WKFRYDRPFH KGTKDKENEF KSLWVERTSL, 
VEMSPLENAI EVLENKNQQIj KTLISQCQTR QMQNINPLTM 
FFVKEYILSH PEDGEKIARL RELMLEQAQI LEFGLAVHEK 
KSSLGIQEFS ACMQASPVHF PNGSPRVCRN SAPASVSPDG 
SSLSSQASAE VSNITGQSES SDEVFNMQPS PSTSSLSSTH 
SDKHKHSREN SCLSPRERPC SAIYPTPVEP SQRMLFNHIG 
RHTTSVSPSP AGRSPLKGSV QSFTPSPVEY HSPGLISNSP 
TSGFENQVNE QSAPLPVPVP VPVPSYGGEE PVRKESKTPP 
IPVTSEPPAL PPKPLAARSS HLENGARRTD PGPRPRPLPR 

Seq ID NO: 62 Nucleotide sequence: 

Nucleic Acid Accession #: fgenesh predictio: 

Coding sequence: 1..2561 (underlined sequent 



CCACTCGCCA 


GGACTCATCT 


2100 


TTCTTCTCTC 


AGCCGGTGCA 


2160 


ACAGTCGGCC 


CCCCTGCCGG 


2220 


GCCAGTGCGC 


AAGGAGAGCA 


2280 


GCGCCCCGTC 


CCGCTACCTC 


2340 


GCCCCCCAAG 


CCTCTGGCAG 


2400 


CCCCGGCCCG 


CGGCCCAGGC 


2460 


ATGTACCTGC 


GATGCATTCT 


2520 


TAGTGTAGGC 


ACTTTAATAA 


2580 


ATTAAATATC 


ATGTTGCACA 


2640 


AAGTATGGCT 


GAATTTCATT 


2700 


CTCCTTTTGT 


ACCTTTTTAT 


2760 


GATATTGTCA 


AACGATACAA 


2820 


CTTGGAGAAA 


GCTTATTAAG 


2880 


CTTCTGTAAA 


TAAGGCAAGG 


2940 


GAGCAAGACA 


GTATACTTAA 


3000 


AAACAAAATA 


CTTCTTTAAG 


3060 


TCTATTTTGT 


GAATTGGTAG 


3120 


TTTCTTTTCA 


AGGTGCAATT 


3180 


CAGAAGAAAT 


GTTTACTTGA 


3240 


TGTCATTGAA 


CAAGAGCAAA 


3300 


AGCAAGAATT 


CAGGAATCAA 


3360 


AAAAACTTAG 


TGAAAATGTA 


3420 


GGCATTAACT 


TTCAGTCCTG 


3480 


GTGGAGTTTG 


TCTTAGCTGA 


3540 


TCTAGCTTGA 


ATATTGCAAC 


3600 


TAGAAGTGTG 


AAAAAAGCAC 


3660 


TTACTGACTG 


TTTATAACAC 


3720 


GTTAAATATT 


TTCCCTCTTT 


3780 


TGGTAGCTTA 


CCTGGAAGGC 


3840 


CAGGTATATG 


CTGTTGAGGC 


3900 


TGTATATAAC 


TGGJTAATTT 


3960 


AACGCAAACA 


GTTGTTGATG 


4020 


GAAAAAGAAT 


ATTTGTACAT 


4080 


TGAATGTCGG 


AAAAATTAAT 


4140 


GAGAAAAGGA 


ACATATGCAT 


4200 


TTGAACTAGC 


TTTTGCTTTT 


4260 


CATATATATG 


TGAAGCTCCA 


4320 


TAAGATGACA 


TGCATAAGAT 


4360 


AGATTTGTTG 


ACTAGTGCTT 


4440 


TTTTTAAAAA 


AAATTTTGGC 


4500 


TATTTCTGAA 


GCAGCTACAT 


4560 


TGTTGGTTGC 


TGCTGAATGG 


4620 


41 


51 




1 

HKLYDLHIiKA 


i 

QNFTEAAYTL 


60 


DRGKCWENGI 


ILCRKIAEQY 


120 


KKFPFFLRNK 


EFVCRGHDYE 


180 


AVTPIPESQE 


VLQREGVPDN 


240 


YLVQSLPGIS 


RWFEVEKREV 


300 


CLNGVIDAAV 


NGGVSRYQEA 


360 


FVPQDMRPLH 


KKLVDQFFVM 


420 


TRVIPRRSPL 


SYPAVNRYSS 


480 


SASPNVTSSA 


PS SARAS PLL 


540 


DGALPRSDPN 


LSAPEKASPA 


600 


VLSGSYSSGI 


SSLSRCSTSE 


660 


PYSVYERTLR 


RPVPLPHSLS 


720 


KVSQL 







correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

ATGGACCGAG GCCAGGGTAA GAGGGGCCGC GACGCCCGCA CTTGTTGCGG CGCCGGGCGG 60 

GAAAGGGAGA CTGGACGATC TGAAGCCGGA GAGGAGGAGG GAGAGAGGCG GGCGGTGGGG 120 

CGGGGGCTGA GGAACGCTCG GAGGGGACTG GGAGACGCGG CGCTTATGCA AAGGTGCCTT 180 

CGGCTGCCGG GACAACCCGC CAGCAACCAG GTACAGCTCT CAGAGGTTCC ACAGAGGAAG 240 

CTCAGGGTCC CTGAATCTCC CAGTGTGGCA GAGAAAGTGA AACTTGGTCA CCGATGCCTG 300 
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GAACTGCTGG AGCAGCTGCT CCCAGAGCTC ACCGGGCTGC TCAGCCTCCT GGACCACGAG 360 

TACCTCAGCG ATACCACCCT GGAAAAGAAG ATGGCCGTGG CCTCCATCCT GCAGAGCCTG 420 

CAGCCCCTTC CAGCAAAGGA GGTCTCCTAC CTGTATGTGA ACACAGCAGA CCTCCACTCG 480 

GGGCCCAGCT TCGTGGAATC CCTCTTTGAA GAATTTGACT GTGACCTGAG TGACCTTCGG 540 

GACATGCCAG AGGATGATGG GGAGCCCAGC AAAGGAGCCA GCCCTGAGCT AGCCAAGAGC 600 

CCACGCCTGA GAAACGCGGC CGACCTGCCT CCACCGCTCC CCAACAAGCC TCCCCCTGAG 660 

GACTACTATG AAGAGGCCCT TCCTCTGGGA CCCGGCAAGT CGCCTGAGTA CATCAGCTCC 720 

CACAATGGCT GCAGCCCCTC ACACTCGATT GTGGATGGCT ACTATGAGGA CGCAGACAGC 780 

AGCTACCCTG CAACCAGGGT GAACGGCGAG CTTAAGAGCT CCTATAATGA CTCTGACGCA 840 

ATGAGCAGCT CCTATGAGTC CTACGATGAA GAGGAGGAGG AAGGGAAGAG CCCGCAGCCC 900 

CGACACCAGT. GGCCCTCAGA GGAGGCCTCC ATGCACCTGG TGAGGGAATG CAGGATATGT 960 

GCCTTCCTGC TGCGGAAAAA GCGTTTCGGG CAGTGGGCCA AGCAGCTGAC GGTCATCAGG 1020 

GAGGACCAGC TCCTGTGTTA CAAAAGCTCC AAGGATCGGC AGCCACATCT GAGGTTGGCA 1080 

CTGGATACCT GCAGCATCAT CTACGTGCCC AAGGACAGCC GGCACAAGAG GCACGAGCTG 1140 

CGTTTCACCC AGGGGGCTAC CGAGGTCTTG GTGCTGGCAC TGCAGAGCCG AGAGCAGGCC 1200 

GAGGAGTGGC TGAAGGTCAT CCGAGAAGTG AGCAAGCCAG TTGGGGGAGC TGAGGGAGTG 1260 

GAGGTCCCCA GATCCCCAGT CCTCCTGTGC AAGTTGGACC TGGACAAGAG GCTGTCCCAA 1320 

GAGAAGCAGA CCTCAGATTC TGACAGCGTG GGTGTGGGTG ACAACTGTTC TACCCTTGGC 1380 

CGCCGGGAGA CCTGTGATCA CGGCAAAGGG AAGAAGAGCA GCCTGGCAGA ACTGAAGGGC 1440 

TCAATGAGCA GGGCTGCGGG CCGCAAGATC ACCCGTATCA TTGGCTTCTC CAAGAAGAAG 1500 

ACACTGGCCG ATGACCTGCA GACGTCCTCC ACCGAGGAGG AGGTTCCCTG CTGTGGCTAC 1560 

CTGAACGTGC TGGTGAACCA GGGCTGGAAG GAACGCTGGT GCCGCCTGAA GTGCAACACT 1620 

CTGTATTTCC ACAAGGATCA CATGGACCTG CGAACCCATG TGAACGCCAT CGCCCTGCAA 1680 

GGCTGTGAGG TGGCCCCGGG CTTTGGGCCC CGACACCCAT TTGCCTTCAG GATCCTGCGC 1740 

AACCGGCAGG AGGTGGCCAT CTTGGAGGCA AGCTGTTCAG AGGACATGGG TCGCTGGCTC 1800 

GGGCTGCTGC TGGTGGAGAT GGGCTCCAGA GTCACTCCGG AGGCGCTGCA CTATGACTAC 1860 

GTGGATGTGG AGACCTTAAC CAGCATCGTC AGTGCTGGGC GCAACTCCTT CCTATATGCA 1920 

AGATCCTGCC AGAATCAGTG GCCTGAGCCC CGAGTCTATG ATGATGTTCC TTATGAAAAG 1980 

ATGCAGGACG AGGAGCCCGA GCGCCCCACA GGGGCCCAGG TGAAGCGTCA CGCCTCCTCC 2040 

TGCAGTGAGA AGTCCCATCG TGTGGACCCG CAGGTCAAAG TCAAACGCCA CGCCTCCAGT 2100 

GCCAATCAAT ACAAGTATGG CAAGAACCGA GCCGAGGAGG ATGCCCGGAG GTACTTGGTA 2160 

GAAAAAGAGA AGCTGGAGAA AGAGAAAGAG ACGATTCGGA CAGAGCTGAT AGCACTGAGA 2220 

CAGGAGAAGA GGGAACTGAA GGAAGCCATT CGGAGCAGCC CAGGAGCAAA ATTAAAGGCT 2280 

CTGGAAGAAG CCGTGGCCAC CCTGGAAGCT CAGTGTCGGG CAAAGGAGGA GCGCCGGATT 2340 

GACCTGGAGC TGAAGCTGGT GGCTGTGAAG GAGCGCTTGC AGCAGTCCCT GGCAGGAGGG 2400 

CCAGCCCTGG GGCTCTCCGT GAGCAGCAAG CCCAAGAGTG GGCAACTCTC TGAGGAAGAT 2460 

ACGCTCACCT CCAATGGTGC TCTCTCAGAG AGAACTTCTC TGACCTCATC TACACCAGGG 2520 
CTTCTCAACC CCAACACTAC TGACATTTTG GACCAGTM 



Seg ID NO: 63 Protein sequence: 

Protein Accession #: fgenesh prediction 



1 11 21 31 - 41 51 

I I I I I I 

MDRGQGKRGR DARTCCGAGR ERETGRSEAG EEEGERRAVG RGLRNARRGL GDAALMQRCL 60 

RLPGQPASNQ VQLSEVPQRK LRVPESPSVA EKVKLGHRCL ELLEQLLPEL TGLLSLLDHE 120 

YLSDTTLEKK MAVASILQSL QPLPAKEVSY LYVNTADLHS GPSFVESLFE EFDCDLSDLR 180 

DMPEDDGEPS KGASPELAKS PRLRNAADLP PPLPNKPPPE DYYEEALPLG PGKSPEYISS 240 

HNGCSPSHSI VDGYYEDADS SYPATRVNGE LKSSYNDSDA MSSSYESYDE EEEEGKSPQP 300 

RHQWPSEEAS MHLVRECRIC AFLLRKKRFG QWAKQLTVIR EDQLLCYKSS KDRQPHLRIA 360 

LDTCSIIYVP KDSRHKRHEL RFTQGATEVL VLALQSREQA EEWLKVIREV SKPVGGAEGV 420 

EVPRSPVLLC KLDLDKRLSQ EKQTSDSDSV GVGDNCSTLG RRETCDHGKG KKSSLAELKG ...480. 

SMSRAAGRKI TRIIGFSKKK TLADDLQTSS TEEEVPCCGY LNVLVNQGWK ERWCRLKCNT 540 " 

LYFHKDHMDL RTHVNAIALQ GCEVAPGFGP RHPFAFRILR NRQEVAILEA SCSEDMGRWL 600 

GLLLVEMGSR VTPEALHYDY VDVETLTSIV SAGRNSFLYA RSCQNQWPEP RVYDDVPYEK 660 

MQDEEPERPT GAQVKRHASS CSEKSHRVDP QVKVKRHASS ANQYKYGKNR AE EDARRYLV 720 

EKEKLEKEKE TIRTELIALR QEKRELKEAI RSSPGAKLKA LEEAVATLEA QCRAKEERRI 780 

DIiELKLVAVK ERLQQSLAGG PALGLSVSSK PKSGQLSEED TLTSNGALSE RTSLTSSTPG 840 
LLNPNTTDIL DQ 

Seq ID NO: 64 Nucleotide sequence: 
Nucleic Acid Accession #: NM_004126.1 

Coding sequence: 108-129 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

GGCACGAGCT CGTGCCGGCC TTCAGTTGTT TCGGGACGCG CCGAGCTTCG CCGCTCTTCC 60 

AGCGGCTCCG CTGCCAGAGC TAGCCCGAGC CCGGTTCTGG GGCGAAAATg CCTGCCCTTC 120 

ACATCGAAGA TTTGCCAGAG AAGGAAAAAC TGAAAATGGA AGTTGAGCAG CTTCGCAAAG 180 

AAGTGAAGTT GCAGAGACAA CAAGTGTCTA AATGTTCTGA AGAAATAAAG AACTATATTG 240 

AAGAACGTTC TGGAGAGGAT CCTCTAGTAA AGGGAATTCC AGAAGACAAG AACCCCTTTA 300 

AAGAAAAAGG CAGCTGTGTT ATTTC ATAA A TAACTTGGGA GAAACTGCAT CCTAAGTGGA 360 

AGAACTAGTT TGTTTTAGTT TTCCCAGATA AAACCAACAT GCTTTTTAAG GAAGGAAGAA 420 
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TGAAATTAAA AGGAGACTTT CTTAAGCACC ATATAGATAG GGTTATGTAT AAAAGCATAT 480 

GTGCTACTCA TCTTTGCTCA CTATGCAGTC TTTTTTAAGA GAGCAGAGAG TATCAGATGT 540 

ACAATTATGG AAATAAGAAC ATTACTTGAG CATGACACTT CTTTCAGTAT ATTGCTTGAT 600 
GCTTCAAATA AAGTTTTGTC TT 

Seq ID NO: 65 Protein sequence: 
Protein Accession #: HP 004117 



10 



15 



20 



25 



30 



35. 



40 



45 



50 



55 



1 11 21 31 41 51 

I I I I I I 

MPALHIEDLP EKEKLKMEVE QLRKEVKLQR QQVSKCSEEI KNYIEERSGE DPI/VKGIPED 

KNPFKEKGSC VIS 



60 



Seq ID NO: 66 Nucleotide sequence: 
Nucleic Acid Accession #: NM_003842.I 

Coding sequence: 1-1236 (underlined sequences correspond to start and stop codons) 



ATGGAACAAC 
GGACCCAGGG 
GTTGTCGCCG 
CTAGCTCCCC 
TGTCCACCTG 
CAGGACTATA 
TCAGGTGAAG 
GAAGGCACCT 
CCCAGAGGGA 
AAAGAATCAG 
TTTGTTTGCA 
GGTGGTGGTG 
AATGTCCTCA 
GAAGTCCAGG 
CATCTGCTGG 
AATGAAGGTG 
CCCTTTGACT 
GTGGCTAAAG 
GTCAACAAAA 
GGAGAGAGAC 
TATCTAGAAG 



11 

I 

GGGGACAGAA 
AGGCGCGGGG 
CGGTCCTGCT 
AGCAGAGAGC 
GACACCATAT 
GCACTCACTG 
TGGAGCTAAG 
TCCGGGAAGA 
TGGTCAAGGT 
GCATCATCAT 
AGTCTTTACT 
GGGACCCTGA 
ATGAGATCGT 
AGCCAGCAGA 
AACCGGCAGA 
ATCCCACTGA 
CCTGGGAGCC 
CTGAGGCAGC 
CCGGGCGAGA 
TTGCCAAGCA 
GTAATGCAGA 



21 
I 

CGCCCCGGCC 
AGCCAGGCCT 
GTTGGTCTCA 
GGCCCCACAA 
CTCAGAAGAC 
GAATGACCTC 
TCCCTGCACC 
AGATTCTCCT 
CGGTGATTGT 
AGGAGTCACA 
GTGGAAGAAA 
GCGTGTGGAC 
GAGTATCTTG 
GCCAACAGGT 
AGCTGAAAGG 
GACTCTGAGA 
GCTCATGAGG 
GGGCCACAGG 
TGCCTCTGTC 
GAAGATTGAG 
CTCTGCCATG 



31 

I 

GCTTCGGGGG 
GGGCCCCGGG 
GCTGAGTCTG 
CAAAAGAGGT 
GGTAGAGATT 
CTTTTCTGCT 
ACGACCAGAA 
GAGATGTGCC 
ACACCCTGGA 
GTTGCAGCCG 
GTCCTTCCTT 
AGAAGCTCAC 
CAGCCCACCC 
GTCAACATGT 
TCTCAGAGGA 
CAGTGCTTCG 
AAGTTGGGCC 
GACACCTTGT 
CACACCCTGC 
GACCACTTGT 
TCCTAA 



Seq ID NO: 67 Protein sequence: 
Protein Accession #: NP 003833.1 



1 
I 

MEQRGQNAPA 
IiAPQQRAAPQ 
SGEVEIiSPCT 
KESGIIIGVT 
NVLNEIVSIL 
NEGDPTETLR 
VNKTGRDASV 



11 
I 

ASGARKRHGP 
QKRSSPSEGL 
TTRNTVCQCE 
VAAWLiVAV 
QPTQVPEQEM 
QCFDDFADLV 
HTLLDALETL 



21 
I 

GPREARGARP 
CPPGHHISED 
EGTFREEDSP 
FVCKSLLWKK 
EVQEPAEPTG 
PPDSWEPLMR 
GERLAKQKIE 



31 
I 

GPRVPKTLVL 
GRDCISCKYG 
EMCRKCRTGC 
VLPYLKGICS 
VNMLSPGESE 
KLGLMDNEIK 
DHLLSSGKFM 



A1 


51 




1 

CCCGGAAAAG 


1 

GCACGGCCCA 


60 


TCCCCAAGAC 


CCTTGTGCTC 


120 


CTCTGATCAC 


CCAACAAGAC 


180 


CCAGCCCCTC 


AGAGGGATTG 


240 


GCATCTCCTG 


CAAATATGGA 


300 


TGCGCTGCAC 


CAGGTGTGAT 


360 


ACACAGTGTG 


TCAGTGCGAA 


420 


GGAAGTGCCG 


CACAGGGTGT 


480 


GTGACATCGA 


ATGTGTCCAC 


540 


TAGTCTTGAT 


TGTGGCTGTG 


600 


ACCTGAAAGG 


CATCTGCTCA 


'660 


AACGACCTGG 


GGCTGAGGAC 


720 


AGGTCCCTGA 


GCAGGAAATG 


780 


TGTCCCCCGG 


GGAGTCAGAG 


840 


GGAGGCTGCT 


GGTTCCAGCA 


900 


ATGACTTTGC 


AGACTTGGTG 


960 


TCATGGACAA 


TGAGATAAAG 


.1020 


ACACGATGCT 


GATAAAGTGG 


1080 


TGGATGCCTT 


GGAGACGCTG 


1140 


TGAGCTCTGG 


AAAGTTCATG 


1200 


41 


51 




1 

WAAVLLLVS 


1 

AESALITQQD 


60 


QDYSTHWNDL 


LFCLRCTRCD 


120 


PRGMVKVGDC 


TPWSDIECVH 


180 


GGGGDPERVD 


RSSQRPGAED 


240 


HLLEPAEAER 


SQRRRLLVPA 


300 


VAKAEAAGHR 


DTLYTMLIKW 


360 


YLEGNADSAM 


S 





60 



Seq ID NO: 68 Nucleotide aecruence: 

Nucleic Acid Accession #: FGENESH predicted ORF 

Coding sequence: 361- 2220 (underlined sequences correspond to start and stop codons) 



65 



70 



75 



GGCACCATCT 
TGTCGCTGCC 
GGTTACACTG 
GCTGAGACGT 
TGCGAACACG 
GGTCTCAGCT 
ATGAACGGGG 
CCGCAGGACA 
TGCCAGGCTA 
AGTCTTTGTC 
GCCATCGCCT 
AACTGCTCTG 
TGTGCCCATG 



11 
I 

GCTCCCTGCC 
ACAACGGCGG 
GGGATCGGTG 
GCGACTGCGC 
GCTTCACTGG 
GCCAGGCCCC 
AGTGCTCCTG 
CGCATGGGCC 
CCAGCGGCCT 
CTCCTGACAC 
GCTCACCCAT 
TGCCCTGCCC 
AGGCAGTCTG 



21 
I 

CTGCCCAGAG 
CCTCTGTGAC 
CCGGGAGGAG 
CCCGGACGCC 
GGACCGCTGC 
CTGCACCTGC 
CCTGCCGGGC 
AGGGTGCCAG 
CTGTCAGTGC 
CTACGGTGTC 
CGAOGGCGAG 
ACCCGGAACC 
CAGCCCCCAA 



31 

I 

GGCTTTCACG 
CGATTCACTG 
TGCCCGGTGG 
CGTTGCTTCC 
ACGGATCGCC 
GACCGGGAGC 
TGGGCGGGCC 
GAGCACTGTC 
GCGCCGGGTT 
AACTGTTCTG 
TGCGTCTGCA 
TGGGGCTTCA 
ACTGGAGCCT 



41 

r 

GACCCAACTG 
GGCAGTGCCG 
GCCGCTTTGG 
CGGCCAACGG 
TCTGCCCCGA 
ACAGCCTCAG 
TCCACTGCAA 
TCTGCCTGCA 
ACACGGGCCC 
CACGCTGCTC 
AGGAAGGTTG 
GTTGCAATGC 
GTACCTGCAC 



51 
I 

CTCCCAGGAA 
CTGCGCTCCG 
GCAGGACTGT 
CGCATGTCTG 
CGGCTTCTAC 
CTGCCACCCG 
CGAGAGCTGC 
CGGTGGCGTC 
TCACTGTGCT 
ATGTGAAAAT 
GCAGCGTGGT 
CAGCTGCCAG 
CCCTGGGTGG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
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10 



15 



20 



25 



CATGGGGCCC 
CGCTGTGACT 
GCTGGCTGGA 
TGTAGCAACA 
GTGTGTGCAC 
GGCAAACGCT 
ACCTGCTACT 
ACATTTGGTG 
GAGACTGGGG 
CAGGAGCCCT 
ATTGGCATTG 
CGGCACTGGC 
CTGGACGGCT 
AACCCCAGCT 
CCAGGCCCGC 
GATAACCACA 
CTGGACAGGG 
GGCCCATTCT 
CTGAGCAGTG 
CGGGAGAGCA 
CCTCAGTTCT 
TACGAGCAGC 
CCTCCGGGCC 
TATGACTTGC 



ACTGCCAGCT 
GTGACCACTC 
TGGGTGCCCG 
CCTGCACCTG 
CCGGATTCCG 
GTGTGCCCTG 
GCCTGGCTGG 
CTAACTGCTC 
CCTGTGTATG 
TTACTGTGAT 
CAGTGCTGGG 
AAAAAGGCAA 
CCGAGTATGT 
ACCACACCCT 
TCTTTGCCAG 
CCACCCTGCC 
GGAGCAGCCG 
ACAATAAAGG 
AGAACCCATA 
GCTACATGGA 
GGGACAGCCA 
CCAGCCCCCT 
TACCCCCCGG 
CTCCAGTACG 



GCCCTGTCCG 
TGATGGCTGT 
CTGCCACCTG 
CAAGAATGGG 
GGGCCCCTCC 
CAAGTGCGCT 
CTGGACAGGC 
CCAGCCATGC 
TCCCCCAGGG 
GCCGACCACT 
GTCCCTTGTG 
GGAGCACCAC 
CATGCCAGAT 
GTCGCAGTGC 
CCTGCAGAAC 
TGCTGACTGG 
CCTGGACCGA 
GCTCATCTCT 
TGCCACCATC 
GATGAAAGGC 
GAGGCGGCGG 
GATCCATGAC 
CCACTATGAC 
GCATCCCCCA 



AAGGGGCAGT 
GACCCTGTTC 
TCCTGCCCTG 
GGCACCTGTC 
TGCCAGAGAT 
AACCACTCCT 
CCCGACTGCT 
CAGTGTGGTC 
CACAGTGGTG 
CCAGTAGCGT 
GTAGCCCTGG 
CACCTGGCTG 
GTCCCTCCGA 
TCCCCAAACC 
CCTGAGCGGC 
AAGCACCGCC 
AGCTACAGCT 
GAAGAGGAGC 
CGGGACCTGC 
CCTCCCTCAG 
CAACCCCAGC 
CGAGACTCTG 
TCACCCAAGA 
TCACCTCCAC 



TTGGAGAAGG 
ATGGACGCTG 
AGGGCTTATG 
TCCCTGAGAA 
CCTGTCAGCC 
TCTGCCACCC 
CCCAGCGCTG 
CTGGAGAAAA 
CACCTTGCAG 
ATAACTCGCT 
TGGCACTGTT 
TGGCTTACAG 
GCTACAGTCA 
CCCCACCCCC 
CAGGTGGGGC 
GGGAGCCCCC 
ATAGCTACAG 
TCGGGGCCAG 
CCAGCTTGCC 
GATCTCCCCC 
CACAGAGAGA 
TGGGCTCCCA 
ACAGCCACAT 
TTCGACGCCA 



TTGTGCCAGT 
TCAGTGCCAG 
GGGAGTCAAC 
TGGCAACTGC 
TGGCCGCTAT 
CTCGAACGGG 
CCCTCTGGGG 
GTGCCACCCA 
GATTGGAATC 
GGGTGCAGTG 
CATTGGCTAT 
CAGCGGGCGC 
CTACTACTCC 
TAACAAGGTT 
CCAAGGGCAT 
TCCAGGGCCT 
CAATGGCCCA 
TGTGGCTTCC 
AGGGGGCCCC 
CAGGCAGCCT 
CAGTGGCACC 
GCCCCCTCTG 
CCCTGGACAT 
GGACCGTTGA 



840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 



Seq ID NO: 69 Protein sequence: 

Protein Accession #: FGENESH prediction 



30 



35 



40 



45 



GTICSLPCPE 
AETCDCAPDA 
MNGECSCLPG 
SLCPPDTYGV 
CAHEAVCSPQ 
AGWMGARCHL 
GKRCVPCKCA 
ETGACVCPPG 
RHWQKGKEHH 
PGPLFASLQN 
GPFYNKGLIS 
PQFWDSQRRR 
YDLPPVRHPP 



11 
I 

GFHGPNCSQE 
RCFPANGACL 
WAGLHCNESC 
NCSARCSCEN 
TGACTCTPGW 
SCPEGLWGVN 
NHSFCHPSNG 
HSGAPCRIGI 
HLAVAYSSGR 
PERPGGAQGH 
EEELGASVAS 
QPQPQRDSGT 
SPPLRRQDR 



21 
I 

CRCHNGGLCD 
CEHGFTGDRC 
PQDTHGPGCQ 
AIACSPIDGE 
HGAHCQLPCP 
CSNTCTCKNG 
TCYCLAGWTG 
QEPFTVMPTT 
LDGSEYVMPD 
DNHTTIiPADW 
LSSENPYATI 
YEQPSPLIHD 



31 
I 

RFTGQCRCAP 
TDRLCPDGFY 
EHCLCLHGGV 
CVCKEGWQRG 
KGQFGEGCAS 
GTCLPENGNC 
PDCSQRCPLG 
PVAYNSLGAV 
VPPSYSHYYS 
KHRREPPPGP 
RDLPSLPGGP 
RDSVGSQPPL 



41 
I 

GYTGDRCREE 
GLSCQAPCTC 
CQATSGLCQC 
NCSVPCPPGT 
RCDCDHSDGC 
VCAPGFRGPS 
TFGANCSQPC 
IGIAVLGSLV 
NPSYHTLSQC 
LDRGSSRLOR 
RESSYMEMKG 
PPGIiPPGHYD 



51 
I 

CPVGRFGQDC 
DREHSLSCHP 
APGYTGPHCA 
WGFSCNASCQ 
DPVHGRCQCQ 
CQRSCQPGRY 
QCGPGEKCHP 
VALVALFIGY 
SPNPPPPNKV 
SYSYSYSNGP 
PPSGSPPRQP 
SPKNSHIPGH 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



50 



55 



60 



65 



70 



75 



Seq ID NO: 70 Nucleotide sequence: 
Nucleic Acid Accession #: NM_005458 

Coding sequence: 1. .2826 {underlined sequences correspond to start and stop codons) 



ATGGCTTCCC 
GCGCGCCTGC 
GGCTGGGCGC 
CTCATGCCGC 
GTGGAACTGG 
CTGCGGCTCT 
ATAAAATACG 
ATCATTGCAG 
CCTGTTCTAG 
GCGGTGAATC 
CTGACGCAAG 
GGCGAGGACA 
AAAAAGCTGA 
GCAAAAGTGT 
ATTCCGGGCT 
CGCTGCCTCC 
CCCCTGAGCT 
GAGTACAACA 
GGCATCTGGG 
CGGCACCAGC 
AATGCCATGA 
GAGAGAATGG 
GAGTACAACG 
TCCGAACCAC 



11 

I 

CGCGGAGGTC 
TACTGCTACT 
GGGGCGCCCC 
TCACCAAGGA 
CCATCGAGCA 
ATGACACGGA 
GGCCGAACCA 
AGTCCCTCCA 
CCGATAAGAA 
CAGCCATTCT 
ACGTTCAGAG 
TTGAGATTTC 
AGGGGAATGA 
TCTGTTGTGC 
GGTACGAGCC 
GGAAGAATCT 
CCAAGCAGAT 
ACAAGCGGTC 
TCATCGCCAA 
GGATCCAGGA 
ACGAGACCAA 
GGACCATTAA 
CTGTGGCCGA 
CAAAAGACAA 



21 
I 

CGGGCAGCCA 
GCTGCTGCCG 
CCGGCCGCCG 
GGTGGCCAAG 
GATCCGCAAC 
GTGCGACAAC 
CTTGATGGTG 
AGGCTGGAAT 
AAAATACCCT 
GAAGTTGCTC 
GTTCTCTGAG 
AGACACCGAG 
TGTGCGGATC 
ATACGAGGAG 
TTCTTGGTGG 
GCTTGCTGCC 
CAAGACCATC 
AGGCGTGGGG 
GACACTGCAG 
CTTCAACTAC 
CTTCTTCGGG 
ATTTACTCAA 
CACACTGGAG 
GACCATCATC 



31 
I 

GGGCGGCCGC 
CTGCTGCTGC 
CCCAGCAGCC 
GGCAGCATCG 
GAGTCACTCC 
GCAAAAGGGT 
TTTGGAGGCG 
CTGGTGCAGC 
TATTTCTTTC 
AAGCACTACC 
GTGCGGAATG 
AGCTTCTCCA 
ATCCTTGGCC 
AACATGTATG 
GAGCAGGTGC 
ATGGAGGGCT 
TCAGGAAAGA 
CCCAGCAAGT 
AGGGCCATGG 
ACGGACCACA 
GTCACGGGTC 
TTTCAAGACA 
ATCATCAATG 
CTGGAGCAGC 



41 

I 

CGCCGCCGCC 
CTCTGGCGCC 
CGCCGCTCTC 
GGCGCGGTGT 
TGCGCCCCTA 
TGAAAGCCTT 
TCTGTCCATC 
TTTCTTTTGC 
GGACCGTCCC 
AGTGGAAGCG 
ACCTGACTGG 
ACGATCCCTG 
AGTTTGACCA 
GTAGTAAATA 
ACACGGAAGC 
ACATTGGCGT 
CTCCACAGCA 
TCCACGGGTA 
AGACACTGCA 
CGCTGGGCAG 
AAGTTGTATT 
GCAGGGAGGT 
ACACCATCAG 
TGCGGAAGAT 



51 
I 

ACCGCCGCCC 
CGGGGCCTGG 
CATCATGGGC 
GCTCCCCGCC 
CTTCCTCGAC 
CTACGATGCA 
CGTCACATCC 
TGCAACCACG 
ATCAGACAAT 
CGTGGGCACG 
AGTTCTGTAT 
TACCAGTGTC 
GAATATGGCA 
TCAGTGGATC 
CAACTCATCC 
GGATTTCGAG 
GTATGAGAGA 
CGCCTACGAT 
TGCCAGCAGC 
GAT CATC CTC 
CCGGAATGGG 
GAAGGTGGGA 
GTTCCAAGGA 
CTCCCTACCT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
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CTCTACAGCA TCCTCTCTGC CCTCACCATC 
TTCTTCAACA TCAAGAACCG GAATCAGAAG 
AACCTTATCA TCCTTGGAGG GATGCTCTCC 
GGATCCTTTG TCTCTGAAAA GACCTTTGAA 
ACCGTGGGCT ACACGACCGC TTTTGGGGCC 
ATCTTCAAAA ATGTGAAAAT GAAGAAGAAG 
GTGGGGGGCA TGCTGCTGAT CGACCTGTGT 
CTGCGAAGGA CAGTGGAGAA GTACAGCATG 
ATCCGCCCTC TCCTGGAGCA CTGTGAGAAC 
TATGCCTACA AGGGACTTCT CATGTTGTTC 
GTCAGCATCC CCGCACTCAA CGACAGCAAG 
ATCATGTGCA TCATCGGGGC CGCTGTCTCC 
TTCTGCATCG TGGCTCTGGT CATCATCTTC 
GTGCCGAAGC TCATCACCCT GAGAACAAAC 
CAGTTCACTC AGAATCAGAA GAAAGAAGAT 
AACCAAGCCA GCACATCCCG CCTGGAGGGC 
AAGATCACAG AGCTGGATAA AGACTTGGAA 
GAAAAGACCA CCTACATTAA ACAGAACCAC 
GGAAACTTCA CTGAGAGCAC AGATGGAGGA 
AATCCCCAGC TACAGTGGAA CACAACAGAG 
GATATAAACT CTCCAGAACA CATCCAGCGT 
CACGCCTACC TCCCATCCAT CGGAGGCGTG 
CCCACCGCCA GCCCCCGCCA CAGACATGTG 
CTGTAA 



CTCGGGATGA TCATGGCCAG TGCTTTTCTC 1500 

CTCATAAAGA TGTCGAGTCC ATACATGAAC 1560 

TATGCTTCCA TATTTCTCTT TGGCCTTGAT 1620 

ACACTTTGCA CCGTCAGGAC CTGGATTCTC 1680 

ATGTTTGCAA AGACCTGGAG AGTCCACGCC 1740 

ATCATCAAGG ACCAGAAACT GCTTGTGATC 1600 

ATCCTGATCT GCTGGCAGGC TGTGGACCCC 1860 

GAGCCGGACC CAGCAGGACG GGATATCTCC 1920 

ACCCATATGA CCATCTGGCT TGGCATCGTC 1980 

GGTTGTTTCT TAGCTTGGGA GACCCGCAAC 2040 

TACATCGGGA TGAGTGTCTA CAACGTGGGG 2100 

TTCCTGACCC GGGACCAGCC CAATGTGCAG 2160 

TGCAGCACCA TCACCCTCTG CCTGGTATTC 2220 

CCAGATGCAG CAACGCAGAA CAGGCGATTC 2280 

TCTAAAACGT CCACCTCGGT CACCAGTGTG 2340 

CTACAGTCAG AAAACCATCG CCTGCGAATG 2400 

GAGGTCACCA TGCAGCTGCA GGACACACCA 2460 

TACCAAGAGC TCAATGACAT CCTCAACCTG 2520 

AAGGCCATTT TAAAAAATCA CCTCGATCAA 2S80 

CCCTCTCGAA CATGCAAAGA TCCTATAGAA 2640 

CGGCTGTCCC TCCAGCTCCC CATCCTCCAC 2700 

GACGCCAGCT GTGTCAGCCC CTGCGTCAGC 2760 

CCACCCTCCT TCCGAGTCAT GGTCTCGGGC 2820 



Seq ID NO: 71 Protein sequence : 
Protein Accession #: NP_005449 

1 11 21 31 41 51 

III III 
MASPRRSGQP GRPPPPPPPP ARLLLLLLLP LLLPLAPGAW GWARGAPRPP PSSPPIiSIMG 
LMPLTKEVAK GSIGRGVLPA VELAIEQIRN ESLLRPYFLD LRLYDTECDN AKGLKAFYDA 
IKYGPNHLMV FGGVCPSVTS IIAESLQGWN LVQLSFAATT PVLADKKKYP YFFRTVPSDN 
AVNPAILKLL KHYQWKRVGT LTQDVQRFSE VRNDLTGVLY GEDIEISDTE SFSNDPCTSV 
KKLKGNDVRI ILGQFDQNMA AKVFCCAYEE NMYGSKYQWI IPGWYEPSWW EQVHTEANSS 
RCLRKNLLAA KEGYIGVDFE PLSSKQIKTI SGKTPQQYBR EYNNKRSGVG PSKFHGYAYD 
GIWVIAKTLQ RAMETLHASS RHQRIQDFNY TDHTLGRIIIj NAMNETNFFG VTGQWFRNG 
ERMGTIKFTQ FQDSREVKVG EYNAVADTLE IINDTIRFQG SEPPKDKTII LEQLRKISLP 
LYSILSALTI LGMIMASAFL FFNIKNRNQK LIKMSSPYMN NLIILGGMLS YASIFLFGLD 
GSFVSEKTFE TLCTVRTWIL TVGYTTAFGA MFAKTWRVHA IFKNVKMKKK IIKDQKLLVI 
VGGMLLIDLC ILICWQAVDP LRRTVEKYSM EPDPAGRDIS IRPLLEHCEN THMTIWLGIV 
YAYKGLLMLF GCFLAWETRN VSIPALNDSK YIGMSVYNVG IMCIIGAAVS FLTPJDQPNVQ 
FCIVALVIIF CSTITLCLVF VPKLITLRTN PDAATQNRRF QFTQNQKKED SKTSTSVTSV 
NQASTSRLEG LQSENHRLRM KITELDKDLE EVTMQLQDTP EKTTYIKQNH YQELNDILNL 
GNFTESTDGG KAILKNHLDQ NPQLQWNTTE PSRTCKDPIE DINSPEHIQR RLSLQLPILH 
HAYIiPSIGGV DASCVSPCVS PTASPRHRHV PPSFRVMVSG L 

Seq ID NO: 72 Nucleotide sequence: 
Nucleic Acid Accession #: NM_005795 
Coding sequence: 522-1940 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

GCACGAGGGA ACAACCTCTC TCTCTSCAGC AGAGAGTGTC ACCTCCTGCT TTAGGACCAT 60 

CAAGCTCTGC TAACTGAATC TCATCCTAAT TGCAGGATCA CATTGCAAAG CTTTCACTCT 120 

TTCCCACCTT GCTTGTGGGT AAATCTCTTC TGCGGAATCT CAGAAAGTAA AGTTCCATCC 180 

TGAGAATATT TCACAAAGAA TTTCCTTAAG AGCTGGACTG GGTCTTGACC CCTGGAATTT 240 

AAGAAATTCT TAAAGACAAT GTCAAATATG ATCCAAGAGA AAATGTGATT TGAGTCTGGA 300 

GACAATTGTG CATATCGTCT AATAATAAAA ACCCATACTA GCCTATAGAA AACAATATTT 360 

GAATAATAAA AACCCATACT AGCCTATAGA AAACAATATT TGAAAGATTG CTACCACTAA 420 

AAAGAAAACT ACTACAACTT GACAAGACTG CTGCAAACTT CAATTGGTCA CCACAACTTG 480 

ACAAGGTTGC TATAAAACAA GATTGCTACA ACTTCTAGTT TATGT TATAC AGCATATTTC 540 

ATTTGGGCTT AATGATGGAG AAAAAGTGTA CCCTGTATTT TCTGGTTCTC TTGCCTTTTT 600 

TTATGATTCT TGTTACAGCA GAATTAGAAG AGAGTCCTGA GGACTCAATT CAGTTGGGAG 660 

TTACTAGAAA TAAAATCATG ACAGCTCAAT ATGAATGTTA CCAAAAGATT ATGCAAGACC 720 

CCATTCAACA AGCAGAAGGC GTTTACTGCA ACAGAACCTG GGATGGATGG CTCTGCTGGA 780 

ACGATGTTGC AGCAGGAACT GAATCAATGC AGCTCTGCCC TGATTACTTT CAGGACTTTG 840 

ATCCATCAGA AAAAGTTACA AAGATCTGTG ACCAAGATGG AAACTGGTTT AGACATCCAG 900 

CAAGCAACAG AACATGGACA AATTATACCC AGTGTAATGT TAACACCCAC GAGAAAGTGA 960 

AGACTGCACT AAATTTGTTT TACCTGACCA TAATTGGACA CGGATTGTCT ATTGCATCAC 1020 

TGCTTATCTC GCTTGGCATA TTCTTTTATT TCAAGAGCCT AAGTTGCCAA AGGATTACCT 1080 

TACACAAAAA TCTGTTCTTC TCATTTGTTT GTAACTCTGT TGTAACAATC ATTCACCTCA 1140 

CTGCAGTGGC CAACAACCAG GCCTTAGTAG CCACAAATCC TGTTAGTTGC AAAGTGTCCC 1200 

AGTTCATTCA TCTTTACCTG ATGGGCTGTA ATTACTTTTG GATGCTCTGT GAAGGCATTT 1260 

ACCTACACAC ACTCATTGTG GTGGCCGTGT TTGCAGAGAA GCAACATTTA ATGTGGTATT 1320 



60 
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180 
240 
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840 
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ATTTTCTTGG CTGGGGATTT CCACTGATTC CTGCTTGTAT ACATGCCATT GCTAGAAGCT 13 80 

TATATTACAA TGACAATTGC TGGATCAGTT CTGATACCCA TCTCCTCTAC ATTATCCATG 1440 

GCCCAATTTG TGCTGCTTTA CTGGTGAATC TTTTTTTCTT GTTAAATATT GTACGCGTTC 1500 

TCATCACCAA GTTAAAAGTT ACACACCAAG CGGAATCCAA TCTGTACATG AAAGCTGTGA 1560 

GAGCTACTCT TATCTTGGTG CCATTGCTTG GCATTGAATT TGTGCTGATT CCATGGCGAC 1620 

CTGAAGGAAA GATTGCAGAG GAGGTATATG ACTACATCAT GCACATCCTT ATGCACTTCC 1680 

AGGGTCTTTT GGTCTCTACC ATTTTCTGCT TCTTTAATGG AGAGGTTCAA GCAATTCTGA 1740 

GAAGAAACTG GAATCAATAC AAAATCCAAT TTGGAAACAG CTTTTCCAAC TCAGAAGCTC 1800 

TTCGTAGTGC GTCTTACACA GTGTCAACAA TCAGTGATGG TCCAGGTTAT AGTCATGACT 1860 

GTCCTAGTGA ACACTTAAAT GGAAAAAGCA TCCATGATAT TGAAAATGTT CTCTTAAAAC 1920 

CAGAAAATTT ATATAA TTGA AAATAGAAGG ATGGTTGTCT CACTGTTTGG TGCTTCTCCT 1980 

AACTCAAGGA CTTGGACCCA TGACTCTGTA GCCAGAAGAC TTCAATATTA AATGACTTTG 2040 

GGGAATGTCA TAAAGAAGAG CCTTCACATG AAATTAGTAG TGTGTTGATA AGAGTGTAAC 2100 

ATCCAGCTCT ATGTGGGAAA AAAGAAATCC TGGTTTGTAA TGTTTGTCAG TAAATACTCC 2160 

CACTATGCCT GATGTGACGC TACTAACCTG ACATCACCAA GTGTGGAATT GGAGAAAAGC 2220 

ACAATCAACT TTTCTGAGCT GGTGTAAGCC AGTTCCAGCA CACCATTGAT GAATTCAAAC 2280 

AAATGGCTGT AAAACTAAAC ATACATGTTG GGCATGATTC TACCCTTATT CSCCCCAAGA 2340 

GACCTAGCTA AGGTCTATAA ACATGAAGGG AAAATTAGCT TTTAGTTTTA AAACTCTTTA 2400 

TCCCATCTTG ATTGGGGCAG TTGACTTTTT TTTTTTCCCA GAGTGCCGTA GTCCTTTTTG 2460 

TAACTACCCT CTCAAATGGA CAATACCAGA AGTGAATTAT CCCTGCTGGC TTTCTTTTCT 2520 

CTATGAAAAG CAACTGAGTA CAATTGTTAT GATCTACTCA TTTGCTGACA CATCAGTTAT 2580 

ATCTTGTGGC ATATCCATTG TGGAAACTGG ATGAACAGGA TGTATAATAT GCAATCTTAC 264 0 

TTCTATATCA TTAGGAAAAC ATCTTAGTTG ATGCTACAAA ACACCTTGTC AACCTCTTCC 2700 

TGTCTTACCA AACAGTGGGA GGGAATTCCT AGCTGTAAAT ATAAATTTTG TCCCTTCCAT 2760 

TTCTACTGTA TAAACAAATT AGCAATCATT TTATATAAAG AAAATCAATG AAGGATTTCT 2820 

TATTTTCTTG GAATTTTGTA AAAAGAAATT GTGAAAAATG AGCTTGTAAA TACTCCATTA 2880 

TTTTATTTTA TAGTCTCAAA TCAAATACAT ACAACCTATG TAATTTTTAA AGCAAATATA 2940 

TAATGCAACA ATGTGTGTAT GTTAATATCT GATACTGTAT CTGGGCTGAT TTTTTAAATA 3000 

AAATAGAGTC TGGAATGCTA TATTTGGTAA ATATTTTAAA GACAACCAGA TGCCAGCATC 3060 

AGAAGTCTGT TTGAGAACTA AGAGAACAGA AACATCTATC ATAAGATATA TTTATTTTAA 3120 

AAACACAAGG TCACTATTTT ACTGAATATA TTTGTTTTGA TAACTCATAC CTTAATAATA 3180 

GGTGTGTTTG ACATATTTCT TTTTTCATTT TGACAATGAA CTCACATTCT AATCCAGAAA 3240 

TTTTAAACAA CTACTGTGAT AAATACCAAT CTGCTACTTT TATAGATTTT ACCCCATTAA 3300 

AATATTACTT TACTGACTTT TACTATGTGA AGATATATAG CTTTGGAAAT GTCCCAGGCT 3360 

ATTCAAGAAA TATAAAAAAC TAGAAGGATA CTATATATAC CATATACAAT GCTTTAATAT 3420 

TTTAATAGAG CTACTGTATA TAATACAAAT TAGGGAAATA CTTGAATATA TCATTGAGAA 3480 

AAAATTATTG TCAGATCTTA CTGAATTATT GTCAGACTTT ATTAAATAAA GATAGAAGAA 3540 

AACCTTGCTA ATGAATTAAA GTGAAATTTG CATGGGATTC AGTTTCTCTA ATGTTATTTT 3600 

CCGCTGAAAT CTCTAAAGAA CAAGAATGAC TTCAATTAGT AAAAGTCAAT TTTGGGAAAA 3660 

GTCATGGGTA TCTGTTTTTT AAGTGTGTCA ATCTGATTAA AATGGATGAA ACAAATTACT 3720 

CATCATAAGT TGTTTCTTAA GCTGTCAATA TGTCAATAGA TGGTGAGTTC AGAACTTATT 3780 

TCAAATTGCT AAGACAAATT ATCTAAATTC GTAAGAATTA ACATATAGAA TGGTCTGGTC 3840 

AGTACATTTA TAATTTATCT ATGCATGAAA AAGTATTGTT TTGTTTGAAA CATGAATTTC 3900 
ATAGCAAGCT GCCATAGAAA GGA 

Seq ID NO: 73 Protein sequence,! 
Protein Accession #: NM 005795 



1 11 21 31 41 51 

I I I I I I 

MLYSIPHLGL MMEKKCTLYF LVLLPFFMIL VTAELEESPE DSIQLGVTRN KIMTAQYECY 
QKIMQDPIQQ AEGVYCNRTW DGWLCWNDVA AGTESMQLCP DYFQDFDPSE KVTKICDQDG 
NWFRHPASNR TWTNYTQCNV NTHEKVKTAL NLFYLTIIGH GLSIASLLIS LGIFFYFKSL 
SCQRITLHKN LFFSFVCNSV VTIIHLTAVA NNQAIiVATNP VSCKVSQFIH LYLMGCNYFW 
MLCEGIYIiHT LIWAVFAEK QHLMWYYFLG WGFPLIPACI HAIARSLYYN DNCWISSDTH 
LLYIIHGPIC AALLVNLFFL LNIVRVLITK LKVTHQAESN LYMKAVRATL ILVPIiLGIEF 
VLIPWRPEGK IAEEVYDYIM HILMHFQGLL VSTIFCFFNG EVQAILRRNW NQYKIQFGNS 
FSNSEALRSA SYTVSTISDG PGYSHDCPSE HLNGKSIHDI ENVLLKPENL YN 

Seq ID NO: 74 Nucleotide sequence ;_ 
Nucleic Acid Accession #: NM_000450.1 
Coding sequence: 117.. 1949 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

CCTGAGACAG AGGCAGCAGT GATACCCACC TGAGAGATCC TGTGTTTGAA CAACTGCTTC 60 

CCAAAACGGA AAGTATTTCA AGCCTAAACC TTTGGGTGAA AAGAACTCTT GAAGTCATGA 120 

TTGCTTCACA GTTTCTCTCA GCTCTCACTT TGGTGCTTCT CATTAAAGAG AGTGGAGCCT 180 

GGTCTTACAA CACCTCCACG GAAGCTATGA CTTATGATGA GGCCAGTGCT TATTGTCAGC 240 

AAAGGTACAC ACACCTGGTT GCAATTCAAA ACAAAGAAGA GATTGAGTAC CTAAACTCCA 300 

TATTGAGCTA TTCACCAAGT TATTACTGGA TTGGAATCAG AAAAGTCAAC AATGTGTGGG 360 

TCTGGGTAGG AACCCAGAAA CCTCTGACAG AAGAAGCCAA GAACTGGGCT CCAGGTGAAC 420 

CCAACAATAG GCAAAAAGAT GAGGACTGCG TGGAGATCTA CATCAAGAGA GAAAAAGATG 460 

TGGGCATGTG GAATGATGAG AGGTGCAGCA AGAAGAAGCT TGCCCTATGC TACACAGCTG 540 

CCTGTACCAA TACATCCTGC AGTGGCCACG GTGAATGTGT AGAGACCATC AATAATTACA 600 
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CTTGCAAGTG TGACCCTGGC TTCAGTGGAC TCAAGTGTGA GCAAATTGTG AACTGTACAG 660 

CCCTGGAATC CCCTGAGCAT GGAAGCCTGG TTTGCAGTCA CCCACTGGGA AACTTCAGCT 720 

ACAATTCTTC CTGCTCTATC AGCTGTGATA GGGGTTACCT GCCAAGCAGC ATGGAGACCA 780 

TGCAGTGTAT GTCCTCTGGA GAATGGAGTG CTCCTATTCC AGCCTGCAAT GTGGTTGAGT 840 

GTGATGCTGT GACAAATCCA GCCAATGGGT TCGTGGAATG TTTCCAAAAC CCTGGAAGCT 900 

TCCCATGGAA CACAACCTGT ACATTTGACT GTGAAGAAGG ATTTGAACTA ATGGGAGCCC 960 

AGAGCCTTCA GTGTACCTCA TCTGGGAATT GGGACAACGA GAAGCCAACG TGTAAAGCTG 1020 

TGACATGCAG GGCCGTCCGC CAGCCTCAGA ATGGCTCTGT GAGGTGCAGC CATTCCCCTG 1080 

CTGGAGAGTT CACCTTCAAA TCATCCTGCA ACTTCACCTG TGAGGAAGGC TTCATGTTGC 1140 

AGGGACCAGC CCAGGTTGAA TGCACCACTC AAGGGCAGTG GACACAGCAA ATCCCAGTTT 1200 

GTGAAGCTTT CCAGTGCACA GCCTTGTCCA ACCCCGAGCG AGGCTACATG AATTGTCTTC 1260 

CTAGTGCTTC TGGCAGTTTC CGTTATGGGT CCAGCTGTGA GTTCTCCTGT GAGCAGGGTT 1320 

TTGTGTTGAA GGGATCCAAA AGGCTCCAAT GTGGCCCCAC AGGGGAGTGG GACAACGAGA 1380 

AGCCCACATG TGAAGCTGTG AGATGCGATG CTGTCCACCA GCCCCCGAAG GGTTTGGTGA 1440 

GGTGTGCTCA TTCCCCTATT GGAGAATTCA CCTACAAGTC CTCTTGTGCC TTCAGCTGTG 1500 

AGGAGGGATT TGAATTATAT GGATCAACTC AACTTGAGTG CACATCTCAG GGACAATGGA 1560 

CAGAAGAGGT TCCTTCCTGC CAAGTGGTAA AATGTTCAAG CCTGGCAGTT CCGGGAAAGA 1620 

TCAACATGAG CTGCAGTGGG GAGCCCGTGT TTGGCACTGT GTGCAAGTTC GCCTGTCCTG 1680 

AAGGATGGAC GCTCAATGGC TCTGCAGCTC GGACATGTGG AGCCACAGGA CACTGGTCTG 1740 

GCCTGCTACC TACCTGTGAA GCTCCCACTG AGTCCAACAT TCCCTTGGTA GCTGGACTTT 1800 

CTGCTGCTGG ACTCTCCCTC CTGACATTAG CACCATTTCT CCTCTGGCTT CGGAAATGCT 1860 

TACGGAAAGC AAAGAAATTT GTTCCTGCCA GCAGCTGCCA AAGCCTTGAA TCAGACGGAA 1920 

GCTACCAAAA GCCTTCTTAC ATCCT TTAA G TTCAAAAGAA TCAGAAACAG GTGCATCTGG 1980 

GGAACTAGAG GGATACACTG AAGTTAACAG AGACAGATAA CTCTCCTCGG GTCTCTGGCC 2040 

CTTCTTGCCT ACTATGCCAG ATGCCTTTAT GGCTGAAACC GCAACACCCA TCACCACTTC 2100 

AATAGATCAA AGTCCAGCAG GCAAGGACGG CCTTCAACTG AAAAGACTCA GTGTTCCCTT 2160 

TCCTACTCTC AGGATCAAGA AAGTGTTGGC TAATGAAGGG AAAGGATATT TTCTTCCAAG 2220 

CAAAGGTGAA GAGACCAAGA CTCTGAAATC TCAGAATTCC TTTTCTAACT CTCCCTTGCT 2280 

CGCTGTAAAA TCTTGGCACA GAAACACAAT ATTTTGTGGC TTTCTTTCTT TTGCCCTTCA 2340 

CAGTGTTTCG ACAGCTGATT ACACAGTTGC TGTCATAAGA ATGAATAATA ATTATCCAGA 2400 

GTTTAGAGGA AAAAAATGAC TAAAAATATT ATAACTTAAA AAAATGACAG ATGTTGAATG 2460 

CCCACAGGCA AATGCATGGA GGGTTGTTAA TGGTGCAAAT CCTACTGAAT GCTCTGTGCG 2520 

AGGGTTACTA TGCACAATTT AATCACTTTC ATCCCTATGG GATTCAGTGC TTCTTAAAGA 2580 

GTTCTTAAGG ATTGTGATAT TTTTACTTGC ATTGAATATA TTATAATCTT CCATACTTCT 2640 

TCATTCAATA CAAGTGTGGT AGGGACTTAA AAAACTTGTA AATGCTGTCA ACTATGATAT 2700 

GGTAAAAGTT ACTTATTCTA GATTACCCCC TCATTGTTTA TTAACAAATT ATGTTACATC 2760 

TGTTTTAAAT TTATTTCAAA AAGGGAAACT ATTGTCCCCT AGCAAGGCAT GATGTTAACC 2820 

AGAATAAAGT TCTGAGTGTT TTTACTACAG TTGTTTTTTG AAAACATGGT AGAATTGGAG 2880 

AGTAAAAACT GAATGGAAGG TTTGTATATT GTCAGATATT TTTTCAGAAA TATGTGGTTT 2940 

CCACGATGAA AAACTTCCAT GAGGCCAAAC GTTTTGAACT AATAAAAGCA TAAATGCAAA 3000 

CACACAAAGG TATAATTTTA TGAATGTCTT TGTTGGAAAA GAATACAGAA AGATGGATGT 3060 

GCTTTGCATT CCTACAAAGA TGTTTGTCAG ATGTGATATG TAAACATAAT TCTTGTATAT 3120 

TATGGAAGAT TTTAAATTCA CAATAGAAAC TCACCATGTA AAAGAGTCAT CTGGTAGATT 3180 

TTTAACGAAT GAAGATGTCT AATAGTTATT CCCTATTTGT TTTCTTCTGT ATGTTAGGGT 3240 

GCTCTGGAAG AGAGGAATGC CTGTGTGAGC AAGCATTTAT GTTTATTTAT AAGCAGATTT 3300 

AACAATTCCA AAGGAATCTC CAGTTTTCAG TTGATCACTG GCAATGAAAA ATTCTCAGTC 3360 

AGTAATTGCC AAAGCTGCTC TAGCCTTGAG GAGTGTGAGA ATCAAAACTC TCCTACACTT 3420 

CCATTAACTT AGCATGTGTT GAAAAAAAAA GTTTCAGAGA AGTTCTGGCT GAACACTGGC 3480 

AACGACAAAG CCAACAGTCA AAACAGAGAT GTGATAAGGA TCAGAACAGC AGAGGTTCTT 3540 

TTAAAGGGGC AGAAAAACTC TGGGAAATAA GAGAGAACAA CTACTGTGAT CAGGCTATGT 3600 

ATGGAATACA GTGTTATTTT CTTTGAAATT GTTTAAGTGT TGTAAATATT TATGTAAACT 3660 

GCATTAGAAA TTAGCTGTGT GAAATACCAG TGTGGTTTGT GTTTGAGTTT TATTGAGAAT 3720 

TTTAAATTAT AACTTAAAAT ATTTTATAAT TTTTAAAGTA T AT ATT T ATT TAAGCTTATG 3780 
TCAGACCTAT TTGACATAAC ACTATAAAGG TTGACAATAA ATGTGCTTAT GTTT 

Seq ID NO: 75 Protein sequence: 
Protein Accession #: NP_000441 

1 11 21 31 41 51 

I I I I I I 

MIASQFLSAL TLVLLIKESG AWSYNTSTEA MTYDEASAYC QQRYTHLVAI QNKEEIEYLN 60 

SILSYSPSYY WIGIRKVNNV WVWVGTQKPL TEEAKNWAPG EPNNRQKDED CVEIYIKREK 120 

DVGMWNDERC SKKKLALCYT AACTNTSCSG HGECVETINN YTCKCDPGPS GLKCEQIVNC 180 

TALES PEHGS LVCSHPLGNP SYNSSCSISC DRGYLPSSME TMQCMSSGEW SAPIPACNW 240 

ECDAVTNPAN GFVECFQNPG SFPWNTTCTF DCEEGFELMG AQSLQCTSSG NWDNEKPTCK 300 

AVTCRAVRQP QNGSVRCSHS PAGEFTFKSS CNFTCEEGFM LQGPAQVECT TQGQWTQQIP 360 

VCEAFQCTAL SNPERGYMNC LPSASGSFRY GSSCEFSCEQ GFVLKGSKRL QCGPTGEWDN 420 

EKPTCEAVRC DAVHQPPKGL VRCAHSPIGE FTYKSSCAFS CEEGFELYGS TQLECTSQGQ 480 

WTEEVPSCQV VKCSSLAVPG KINMSCSGEP VFGTVCKFAC PEGWTLKGSA ARTCGATGHW 540 

SGLLPTCEAP TESNIPLVAG LSAAGLSLLT LAPFLLWLRK CLRKAKKFVP ASSCQSLESD 600 
GSYQKPSYIL 



Seq ID NO: 76 Nucleotide sequence: 
Nucleic Acid Accession #: NMJJ31439 

Coding sequence: 69.. 123 5 (underlined seguences correspond to start and stop codons) 



215 



WO 02/079492 



PCT/US02/04915 



10 



15 



20 



25 



30 



35 



40 



CCCGACCCGT 
GTGCGGCCAJ 
CCCTGGACGC 
ACAAGGGCTC 
ACGAGAGGAA 
TGCTGGGAAA 
CGGAGCGGCT 
GGAAGAAGCA 
TCTCCCGGGA 
AGAAGGAGGA 
ACCACGAGGG 
CGTACGGGCT 
CCTTCTTCTC 
CAGGGCACCC 
GCTCCCTGGC 
CCCCATCTCC 
CCCACCTGGG 
TGAGCCAGGT 
CTCCTGGCCA 
CCCAGGTGAC 
CGGCCACGTA 
CCCTCGCGCC 
TTTCCTCCCA 
GCTGGACTCT 
GCCCACATTT 
CTCCCAGTGG 
GACAGACTTG 
ATTAATAAAG 
TGATAATTTT 
TCCAAAGTGA 
GATTTGAGAA 
TATTTTATTT 
ACACACACTT 
TATAAAATTC 
GATGTTTAAA 



11 
I 

GCGAGGGCCA 
GGCTTCGCTG 
CGAGCTGTCG 
CGAGAGCCGT 
ACGGCTGGCA 
GTCGTGGAAG 
GCGCCTGCAG 
GGCCAAGCGG 
CCAGAACGCC 
CAGGGGTGAG 
GCCGGCTGGT 
GCCCACACCT 
CTCCCCCTGC 
GTACTCACCG 
CCTTGGCCAG 
TGCCTATTAC 
CCAGCTTTCC 
GGAACTCCTG 
CCCAGACTCC 
ACCAACGGGT 
CTACAACAGC 
CTCTCCTTCT 
CCGCTCAGGG 
CCTTATCCGA 
TAAGTATATT 
AATGTTCACT 
ATAGCCAAGG 
GAAGATGGGG 
GTGTGCACAG 
CCACAAAATT 
ATTAACCAGT 
TAAATATACA 
CAAGAGCCAC 
AGTGTATTAG 
AACAAAACAG 



21 

I 

GGTCCGCGCC 
CTGGGAGCCT 
GATGGACAAT 
ATCCGGCGGC 
GTGCAGAACC 
GCGCTGACGC 
CACATGCAGG 
CTGTGCAAGC 
CTGCCGGAGA 
TACTCCCCCG 
GGTGGCGGCG 
CCTGAAATGT 
CAGGAGGAGC 
GAGTACGCCC 
TCCCCCGGCG 
TCCCCGGCCA 
CCGCCTCCTG 
GGGGACATGG 
GCCACAGGGG 
CCCACAGAGA 
TACAGTGTGT 
TGTGCCTTGA 
CAGGGAGGTC 
GTGCCGCCTC 
CCTTCAAGTG 
GACGTCTTTT 
TCCCTTCTGG 
AAATTTGACT, 
CCCAAGGACC 
TCAAAGGGAC 
ATGGCTAACT 
TTTTAAAGCA 
CGCGCCCAGC 
TTTCATTACA 
GCTGTTGTAA 



31 
I 

TGCCCCGCCA 
ACCCTTGGCC 
CGCCGCCGGC 
CCATGAACGC 
CGGACCTGCA 
TGTCCCAGAA 
ACTACCCCAA 
GCGTGGACCC 
AGAGAAGCGG 
GCACTGCCCT 
GCGGCACCCC 
CTCCCCTGGA 
ATGGCCATCC 
CAAGCCCTCT 
TCTCCATGAT 
CCTACCACCC 
AGCACCCTGG 
ATCGCAATGA 
CCATGGCCCT 
CCAGCCTCAT 
CATAGAGCTG 
GTGGCAGAGG 
TGAACTGCGG 
TATCCCCTTC 
AGTTTTCCTC 
CTTGGTAGCC 
TCCAGTTTTC 
CATTAATGAG 
ACGAGGCTTT 
TCATACAATT 
ATATCACAGA 
GTTCTTTTTT 
CTACATTTAT 
TAGGAGAAAT 
AAAAAAAAAA 



41 
I 

GGCGAAGCGA 
CGAGGGTCTC 
CGTCCCCCGG 
CTTCATGGTT 
CAACGCCGAG 
GAGGCCGTAC 
CTACAAGTAC 
GGGCTTCCTT 
CAGCCGGGGG 
GCCCAGCCTC 
GAGCAGTGTG 
CGTGCTGGAG 
CCGCCGCATC 
CCACTGTAGC 
GTCCCCTGTA 
ACTCCACTCC 
CTTCGACGCC 
ATTCGACCAG 
CAGTGGGCAT 

GAGGCGCCCC 
AGCCGTCCAG 
CCCCAGAGCC 
CCCACGTTCC 
CAGCCCCTGA 
ATCATCGAAA 
TGATTTAGGG 
CTCGCTAACC 
CTGCACTTTC 
TGAGAAAAAA 
AAATGGGATT 
TTTGTTAATT 
AATTTTCATT 
TATATTTCTA 
AAAAAAAAA 



51 
I 

GGCGACCCGC 
GAGTGCCCGG 
CCCCCGGGGG 
TGGGCCAAGG 
CTCAGCAAGA 
GTGGACGAGG 
CGGCCGCGCA 
CTGAGCTCCC 
GCGCTGGGGG 
CGGGGCTGCT 
GACACGTACC 
CCGGAGCAGA 
CCCCACCTGC 
CACCCCCTGG 
CCCGGCTGTC 
AACCTCCAAG 
CTGGATCAAC 
TATTTGAACA 
GTTCCGGTCT 
GCTGATGCCA 
GTCCGGTCAG 
CCACACCAGC 
TTTGGCCTAA 
AGCCCCTGCA 
GAGTTGCTGT 
CTAATGGGGG 
TTCTCTCAAG 
TACGATCTGG 
TGCACCCCCT 
CAGTCAACCT 
GAGTTAAAAC 
TGTTTATTAT 
CTCTTTTACC 
AACATTTTAT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620. 
1680 
1740 
1600 
1660 
1920 
1980 
2040 



45 



50 



55 



60 



65 



70 



75 



Seq ID NO: 77 Protein sequence: 
Protein Accession #: NP 113627 



1 
I 

MASLLGAYPW 
KRLAVQNPDL 
QAKRLCKRVD 
GPAGGGGGGT 
PYSPEYAPSP 
GOLSPPPEHP 
TPTGPTETSL 



11 
I 

PEGLECPALD 
HNAELSKMLG 
PGFLLSSLSR 
PSSVDTYPYG 
LHCSHPLGSL 
GFDALDQLSQ 
ISVLADATAT 



21 
I 

AELSDGQSPP 
KSWKALTLSQ 
DQNALPEKRS 
LPTPPEMSPL 
ALGQSPGVSM 
VELLGDMDRN 
YYNSYSVS 



31 
I 

AVPRPPGDKG 
KRPYVDEAER 
GSRGALGEKE 
DVLEPEQTPF 
MSPVPGCPPS 
EFDQYLNTPG 



41 
I 

SESRIRRPMN 
LRLQHMQDYP 
DRGEYSPGTA 
SSPCOEEHGH 
PAYYSPATYH 
HPDSATGAMA 



51 
I 

AFMVWAKDER 60 

NYKYRPRRKK 120 

LPSLRGCYHE 180 

PRRIPHLPGH 240 

PLHSNLQAHL 300 

LSGHVPVSQV 360 



Seq ID NO: 78 Nucleotide sequence: 
Nucleic Acid Accession #: XM_035787 

Coding sequence: 329.. 949 (underlined sequences correspond to start and stop codons) 



1 
I 

TGCCCCGCCC 
AGGAGGAGGG 
CCTCCTCCTC 
CCGCTCCCGG 
GCAGCCGCGG 
AATCCTTTGG 
TTTGGCTTTA 
TCTACAGTTC 
TCGGAAGAAG 
GCAGGAGGAC 
TGTTTTCTCT 
TTTAAGAGTA 
AGAAGATAAA 
TAACTACGTG 



11 

I 

CGCTCCCCAG 
TGGATCTCCC 
CTCCTCCAGC 
CGCGGGGCCT 
CTTCCGGAGC 
TGAAAACTGA 
CACAAAGTCA 
ATGTACGATG 
GTAGTGCTAG 
TACGCTGCAA 
ATTACAGAAA 
AAAGAAGATG 
AGACAGGTTT 
GAAACATCTG 



21 
I 

CGCCCCGGAA 
CAGAGCAAAG 
CGCCCAGGCT 
TCCAGGCGAC 
CCTCGGGGCG 
GACACAAAAI 
TCATGGTGGG 
AGTTTGTGGA 
ATGGGGAGGA 
TTAGAGACAA 
TGGAATCCTT 
AGAATGTTCC 
CTGTAGAAGA 
CTAAAACACG 



31 
I 

GTGATCTGTG 
CGTCGGAGTC 
CCCCCGCCAC 
AAGGACCGAG 
GCGGACTGGC 
GGCTGCAAAT 
CAGTGGTGGC 
GGACTATGAG 
AGTCCAGATC 
CTACTTCCGA 
TGCAGCTACA 
ATTTCTACTG 
GGCAAAAAAC 
AGCTAATGTT 



41 

I 

GCGGCTGCTG 
CTCCTCCTCC 
CCGTCAGACT 
TACCCTCCGG 
TCGCGGTGCA 
AAGCCCAAGG 
GTGGGCAAGT 
CCTACCAAAG 
GATATCTTAG 
AGTGGGGAGG 
GCTGACTTCA 
GTTGGTAACA 
AGAGCTGAGC 
GACAAGGTAT 



51 
I 

CAGAGCCGCC 
TTCTCCTCCT 
CCTCCTTCGA 
CCGGAGCCAC 
GATTCTTCTT 
GTCAGAATTC 
CAGCTCTGAC 
CAGACAGCTA 
ATACAGCTGG 
GGTTCCTCTG 
GGGAGCAGAT 
AATCAGATTT 
AGTGGAATGT 
TTTTTGATTT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
040 



216 



WO 02/079492 



PCT/US02/04915 



AATGAGAGAA ATTCGAGCGA GAAAGATGGA AGACAGCAAA GAAAAGAATG GAAAAAAGAA 900 

GAGGAAAAGT TTAGCCAAGA GAATCAGAGA AAGATGCTGC ATTTTATAAT CAAAGCCCAA 960 

ACTCCTTTCT TATCTTGACC ATACTAATAA ATATAATTTA TAAGCATTGC CATTGAAGGC 1020 

TTAATTGACT GAAATTACTT TAACATTTTG GAAATTGTTG TATATCACTA AAAGCATGAA 1080 

TTGGAACTGC AATGAAAGTC AAATTTACTT TAAAAAGAAA TTAATATGGC TTCACCAAGA 1140 

AGCAAAGTTC AACTTATTTC ATAATTGCCT ACATTTATCA TGGTCCTGAA TGTAGCGTGT 1200 

AAGCTTGTGT TTCTTGGGCA GTCTTTCTTG AAATTGAAGA GGTGAAATGG GGGTGGGGAG 1260 

TGGGAGGAAA GGTGACTTCC TCTGGTGTTT ATTATAAAGC TTAAATTTTA TATCATTTTA 1320 

AAATGTCTTG GTCTTCTACT GCCTTGAAAA ATGACAATTG TGAACATGAT AGTTAAACTA 1380 

CCACTTTTTT TAACCATTAT TATGCAAAAT TTAGAAGAAA AGTTATTGGC ATGGTTGTTG 1440 

CATATAGTTA AACTGAGAGT AATTCATCTG TGAATCTGCT TTAATTACCT GGTGAGTAAC 1500 

TTAGAAAAGT GGTGTAAACT TGTACATGGA ATTTTTTGAA TATGCCTTAA TTTAGAAACT 1560 

GAAAAATATC TGGTTATATC ATTCTGGGTG TGTTCTTACT GACACCAGGG GTCCGCTGCC 1620 

CCATGTGTCC TGGTGAGAAA ATATATGCCT GGCACAGCTT TTGTATAGAA AATTCTTGAG 1680 

AAGTAACTGT CCGCTAGAAG TCTGTCCAAA TTTAAAATGT GTGCCATATT CTGGTTCTTG 1740 

AAAATAAGAT TCCAGAGCTC TTTGATCGCT TTTAATAAAC TGCAAGTTCA TTTTAAATGA 1800 

AGGGCCAGCA TATATACTTG CAAGATAATT TTCAGCTGCA AGGATTCAGC ACCAGTTATG 1860 

TTTGAATGAA CCCTCCTTTT CTCTGAGATT CTGGTCCCTG GAAATCCCTT TCTGCTAGTG 1920 

GTGAGCATGT AAGTGTTAAG TTTTTAATCT GGGAGCAGGG CATAGGAAGA AAATGTCAGT 1980 

AGTGCTAATG CATTTTGCAC TAGAACGCTT CGGGAAAATA TTCATGCTTG CCATCTGTTC 2040 

ATTTCTAAAT TTATATTCAT AAAGTTACAG TTTGATACAG GAATTATTAG GAGTAATTCT 2100 

TTTCTGTTTC TGTTTATAAT GAAGAACACT GTAGCTACAT TTTCAGAAGT TAACATCAAG 2160 

CCATCAAACC TGGGTATAGT GCAGAAAACG TGGCACACAC TGACCACACA TTAGGCTGTG 2220 

TCACCATTGT GTGGTGTACC TGCTGGAAGA ATTCTAGCAT GCTACTTGGG GACATAATTT 2280 

CAGTGGGAAA TATGCCACTG ACCGATTTTT TTTTTTTCCT CTTTGCAGTG GGGCTAGGAC 2340 

AGTTGATTCA ACAAAGTATT TTTTTCTTTT TTCTCAGTCC TAATTTGAAC AGGTCAAAGA 2400 

TGTGTTCAGG CATTCCAGGT AACAGGTGTG TATGTAAAGT TAAAAATAGG CTTTTTAGGA 2460 

ACTCACTCTT TAGATATTTA CATCCAGCTT CTCATGTTAA ATATTTGTCC TTAAAGGGTT 2520 

TGAGATGTAC ATCTTTCATT TCGTATTTCT CATAGGCTAT GCCATGTGCG GAATTCAAGT 2580 

TACCAATGTA ACACTGGCCA GCGGGCCCAG CAATCTCCAT GTGTACTTAT TACAGTCTTA 2640 

TTTAACCAGG GGTCCTAACC ACTAACATTG TGACTTTGCT TTGAGACCTT TCCTCTCCTG 2700 

GGTACTGAGG TGCTATGAAG CCAACTGACA AAGATGCATC ACGTGTCTTA GGCTGATGCC 2760 
ACTACCCGAT TTGTTTATTT GCAATTTGAG CCATTTAAAG ACCAATAAAC TTCCTTTTTT 

Seq ID NO: 79 Protein sequence: 
Protein Accession #: XP_035787 

1 11 21 31 41 51 

I I I I 1 I 

MAANKPKGQN SLALHKVIMV GSGGVGKSAL TLQFMYDEFV EDYEPTKADS YRKKWLDGE 60 

EVQIDILDTA GQEDYAAIRD NYFRSGEGFL CVFSITEMES FAATADFREQ ILRVKEDENV 120 

PFLLVGNKSD LEDKRQVSVE EAKNRAEQWN VNYVETSAKT RANVDKVFFD LMREIRARKM 180 
EDSKEKNGKK KRKSLAKRIR ERCCIL 



Seq ID NO: 80 Nucleotide sequence: 
Nucleic Acid Accession #: NM_003467 

Coding sequence: 89.. 1147 (underlined sequences correspond to start and stop codons) 



GTTTGTTGGC 
CACCGCATCT 
CTACACCGAG 
AGAAAATGCT 
TGGCATTGTG 
CATGACGGAC 
TCCCTTCTGG 
AGTCCATGTC 
TCTGGACCGC 
GGCTGAAAAG 
CTTCATCTTT 
CAATGACTTG 
TGGTATTGTC 
CCACCAGAAG 
TTGGCTGCCT 
GCAAGGGTGT 
TTTCTTCCAC 
CTCTGCCCAG 
AGGAAAGCGA 
CAGCTAACAC 
ACATTTTTCA 
TTGTCTTGTG 
TGTTTCATAT 
CTCGTGGTAG 
AAGCTAGAAA 



11 
I 

TGCGGCAGCA 
GGAGAACCAG 
GAAATGGGCT 
AATTTCAATA 
GGCAATGGAT 
AAGTACAGGC 
GCAGTTGATG 
ATCTACACAG 
TACCTGGCCA 
GTGGTCTATG 
GCCAACGTCA 
TGGGTGGTTG 
ATCCTGTCCT 
CGCAAGGCCC 
TACTACATTG 
GAGTTTGAGA 
TGTTGTCTGA 
CACGCACTCA 
GGTGGACATT 
AGATGTAAAA 
GATATAAAAG 
TTTCTTTAGT 
TGATGTGTGT 
GACTGTAGAA 
TGATCCCCAG 



21 
I 

GGTAGCAAAG 
CGGTTACCAT 
CAGGGGACTA 
AAATCTTCCT 
TGGTCATCCT 
TGCACCTGTC 
CCGTGGCAAA 
TCAACCTCTA 
TCGTCCACGC 
TTGGCGTCTG 
GTGAGGCAGA 
TGTTCCAGTT 
GCTATTGCAT 
TCAAGACCAC 
GGATCAGCAT 
ACACTGTGCA 
ACCCCATCCT 
CCTCTGTGAG 
CATCTGTTTC 
GACTTTTTTT 
ACTGACCAAT 
TTTTGTGAAG 
CTAGGCAGGA 
AAGGGAACTG 
CTGTTTATGC 



31 
I 

TGACGCCGAG 
GGAGGGGATC 
TGACTCCATG 
GCCCACCATC 
GGTCATGGGT 
AGTGGCCGAC 
CTGGTACTTT 
CAGCAGTGTC 
CACCAACAGT 
GATCCCTGCC 
TGACAGATAT 
TCAGCACATC 
TATCATCTCC 
AGTCATCCTC 
CGACTCCTTC 
CAAGTGGATT 
CTATGCTTTC 
CAGAGGGTCC 
CACTGAGTCT 
TATACGATAA 
ATTGTACAGT 
TTTAATTGAC 
CCTGTGGCCA 
AACATTCCAG 
ATAGATAATC 



41 



I 



GGCCTGAGTG 
AGTATATACA 
AAGGAACCCT 
TACTCCATCA 
TACCAGAAGA 
CTCCTCTTTG 
GGGAACTTCC 
CTCATCCTGG 
CAGAGGCCAA 
CTCCTGCTGA 
ATCTGTGACC 
ATGGTTGGCC 
AAGCTGTCAC 
ATCCTGGCTT 
ATCCTCCTGG 
TCCATCACCG 
CTTGGAGCCA 
AGCCTCAAGA 
GAGTCTTCAA 
ATAACTTTTT 
TTTTATTGCT 
TTATTTATAT 
AGTTCTTAGT 
AGCGTGTAGT 
TCTCCATTCC 



51 
I 

CTCCAGTAGC 
CTTCAGATAA 
GTTTCCGTGA 
TCTTCTTAAC 
AACTGAGAAG 
TCATCACGCT 
TATGCAAGGC 
CCTTCATCAG 
GGAAGCTGTT 
CTATTCCCGA 
GCTTCTACCC 
TTATCCTGCC 
ACTCCAAGGG 
TCTTCGCCTG 
AAATCATCAA 
AGGCCCTAGC 
AATTTAAAAC 
TCCTCTCCAA 
GTTTTCACTC 
TTTAAGTTAC 
TGTTGGATTT 
AAATTTTTTT 
TGCTGTATGT 
GAATCACGTA 
CGTGGAACGT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 



217 



WO 02/079492 



PCT7US02/04915 



TTTTCCTGTT CTTAAGACGT GATTTTGCTG TAGAAGATGG CACTTATAAC CAAAGCCCAA 1560 
AGTGGTATAG AAATGCTGGT TTTTCAGTTT TCAGGAGTGG GTTGATTTCA GCACCTACAG 1620 
TGTACAGTCT TGTATTAAGT TGTTAATAAA AGTACATGTT AAACTTACTT AGTGTTATG 

Seq ID NO: 81 Protein sequence: 
Protein Accession #: NP_003458 



MEGISIYTSD 
LVMGYQKKIaR 
YSSVLILAFI 
DDRYICDRFY 
TVILILAFFA 
LYAFLGAKFK 



11 
I 

NYTEEMGSGD 
SMTDKYRLHL 
SLDRYIiAIVH 
PNDLWWVFQ 
CWLPYYIGIS 
TSAQHALTSV 



21 
! 

YDSMKEPCFR 
SVADLLFVIT 
ATNSQRPRKL 
FQHIMVGLIL 
IDSFILLEII 
SRGSSLKILS 



31 
I 

EENANFNKIF 
LPFWAVDAVA 
LAEKWYVGV 
PGIVILSCYC 
KQGCEFENTV 
KGKRGGHSSV 



41 
I 

LPTTYSIIFL 
NWYFGNFLCK 
WIPALLLTIP 
IIISKLSHSK 
HKWISITEAL 
STESESSSFH 



51 
I 

TGIVGNGLVX 
AVHVIYTVNL 
DFIFANVSEA 
GHQKRKALKT 
AFFHCCLNPI 
SS 



60 
120 
180 
240 
300 



Seq ID NO: 82 Nucleotide sequence: 
Nucleic Acid Accession #: NM_014959 

Coding sequence: 314.. 1609 (underlined sequences correspond to start and stop codons) 



CTGGTTCTCA 
GGGAAACAAG 
GTGCAAGCAT 
ATTTGACAAA 
CCGAGACGGG 
TTTTTCCAAG 
TGTCAACTAT 
TTCCTCTTAT 
TCTGGGGCCT 
CGTTTGGTTC 
AAGGGATGAG 
CCTGCAGCAC 
AGAGGAGGCT 
CGTCTCCTGG 
CCGGGTGGAG 
GCTGCGGATC 
TTATCACCCC 
GCTAACAAAG 
GCCCCCAATG 
GAAAGTAATG 
CTCAAAATTC 
ACATGGGACT 
ATCAGCCCCT 
AGCCAGGATG 
TGAGAATGAG 
GCTGAGCATG 
TGAAAGGGAC 
TAGGTAGTCT 
TCATTGATTT 
AGGACAGACT 
AATGTCTGAA 
CACTATTTTA 
' GTGTTTATGA 
CCTATGGACA 
AAGCTTCTCA 
GTTAGGACTT 
TGAAAGAGAG 
GTGTGTAGCC 
CTGAGATGAA 
CAGCTTATTC 
ACATTTGGAT 
ATGTTTTCGT 
TAGTAGACAC 
AGTCTCTTTT 
ATTTATCCTC 
CCACTTAAAG 
TAGGATATAG 
ATATATTTTC 
TTATGTTTTT 
GTTCTGCAAA 
TAGGTACAGA 
TCAGTGATGT 



11 
I 

ACTTCTTTTG 
CTCTCAGGAC 
CTGGGCCATC 
TGGAAAAAAA 
TATACAGGGA 
AAGATGATGA 
CTGGGGGGGA 
GCTTCTAAAG 
GAAGGAAATG 
CCCACTGCTG 
GTCACAGTGA 
CATGAACAGT 
GTCGCCGAAA 
TTTCTCGTTG 
CCTTTCTATG 
GCCAGTGGGA 
CACCCCGAAG 
GCGATAGATG 
GAACCCCTGA 
CCCAAGGAGT 
TATGCTGGGC 
TTGGTGTGGG 
CCTCCTTTCT 
GGGGACCTGA 
AAGGAGCTGG 
GTGGAGAAGA 
CCTTACCTCG 
GGAAGAGAGA 
CAGTGTTCAA 
TGTAACCTGG 
GAAGGTAGTA 
TCCATTGACA 
AGGATGGGGC 
CTCCGTTTGA 
GCTGAGGACA 
TAACACTTTA 
CACAAAAATG 
TGCTACAAGG 
ATGTGGTAAA 
TGAGACAGAC 
TCTACCATAG 
GGCACACATA 
CATCTGGTTG 
AACTGGAAGA 
AGTCAGCCAG 
TGGAATCTAG 
TTATCTTCTA 
TACTTTTAAT 
CCAGATATTT 
TTTCCACTTC 
TTCTCTTATT 
TTTTTGCTTG 



21 
I 

AAATAATGTT 
TTCCGGTCGC 
TTCAATGGTA 
GGAGTGTCCA 
GCTACCCTGT 
GACAGAGGCA 
CATTCCCAGG 
TCTGTTTTGA 
TGGATGTTGA 
GCTGGTATCT 
CGATTGCGTT 
GGCTGGTGGG 
TCCACCTCCC 
CCCATTTTAA 
CTGTCCTGGA 
CTCGCCTCTC 
ATATTAAGTT 
ATGAGGAAGA 
ACTTTGGTTC 
TGAAATTGTC 
AGATGAAGGA 
ATACTGAGGT 
CAGGTGCAGC 
AAGGGGTGCT 
TGGAGCAGGA 
AAGGGGACCT 
TGTCCTATCT 
ATCCAGCGTT 
GACAGAAGAA 
CATGTACCTA 
ATATTCCTTT 
TGATTCTTGA 
CTGGAAAGGC 
AGTATCACCT 
CTCAAGGCAT 
TCTATGGCTA 
GGAGAAAATG 
AGTTGTTGGG 
TCAACTCCAC 
ATTCCTGGCA 
ACTCTGTCAT 
TTTCCATCCT 
AGTCAGTTTT 
TTTCAATTAC 
TTTGTTATGT 
GCACTTTATC 
CATAATCTTT 
CACTCAGAAG 
ACCATTTCTG 
TTCTGATAGA 
TTTTGCTTCC 
TAGTATTTTT 



31 
I 

CATAGAGAAG 
CATGATGGCT 
AAAAAGATAC 
GAAAAGAGTA 
GTTTCTGAGA 
GAGCCATTAT 
AGACATTTGC 
GATCGAAGAA 
GTTGATTGAT 
GTGGTCAGCC 
TGGTTCCTGG 
CGGCCCCTTG 
CCACTTCATC 
GAATGAAGGG 
AAGCCCCAGC 
CATCCCCATC 
CCACTTGTAC 
TCGCTTCCAT 
CAGTTATATT 
CTACAGGAGC 
ACCCATTCAA 
GAAGCCAGTG 
CTTTGTGAAG 
CGATGATCTC 
AAAGACACGG 
GGCCCTGGAC 
TAGACAGCAG 
CTCATTGGAA 
GACTGGGTAA 
TTGACTGTAT 
TAAATTTTTT 
AGACCCAGGA 
AACTTTTCCT 
TCTCATAACT 
ACATGATGAC 
CTGTTATTAG 
CAAACATGAG 
TTAAATGTTC 
AGAACCACCA 
ATGTACCATA 
TTTGTAGCCA 
TTTATGTTTA 
TTTTATGGTG 
TTACGTTAAT 
CTTTTCTATT 
ACCATTTAGA 
CTGTATCTTA 
ATTTAAAAAA 
TTGCTCTTCC 
CGTTTTTTAG 
TCTGAGGACA 
AGTTGACATT 



41 
I 

GAGGGCTGTC 
GTGGGCGGTA 
AGTAAAGACA 
GCAGCAGTGA 
CCCTTTGTGA 
TGTTCCGTGC 
TCAGAAGAGA 
GATTATAAAA 
AAGAGCACAA 
ACAGGCCTCG 
AGTCAGCACC 
TTTGATGTCA 
TCCCTCCAAG 
ATGGTCCTGG 
TTCTCTCTGA 
ACTTCCAACA 
CTTGTCCCCA 
GGTGTGCGCC 
GTGTCTAATT 
CCTGGAGAAA 
CTTGAGATTA 
GATCTCCAGC 
GAGAACCACC 
CAGGACAATG 
CAGAGCAAGA 
GTGCTCTTCA 
AATTTG TAAA 
ATGGATAAAC 
CATCTATCAC 
CCTCATGCAT 
CCAACCATTG 
TAAAGGACAT 
GATTAATGTG 
AAAAGCAGAA 
AGTCTTTTTT 
AACAATGTAA 
CAGAAAATAT 
ATGGTCAACT 
AAAAGAAAAT 
CAAAAAATAA 
TTTCAGCTGT 
ATCTGTTTAA 
TATTTTGAAC 
GTAATTATTA 
CTACTGTTAT 
TCCTATTACC 
AAACCCATCA 
CTCATGAGAA 
TTCATTATTT 
TTCTTTTAGA 
TCTTTTTCTC 
GTTTTCTGTT 



51 



TGAGATTCGA 


60 


AACGCGGTTA 


120 


TAAATACCAC 


180 


GGAAGAGCTG 


240 


CATCTCACAT 


300 


TGTTCCTGAG 


360 


ATCAAATAGT 


420 


ATCGTCAGTT 


480 


ACAGATACAG 


540 


GCTTCCTGGT 


600 


TGGCCCTGGA 


660 


CTGCAGAGCC 


720 


GTGAGGTGGA 


780 


AGCATCCAGC 


840 


TGGGCATCCT 


900 


CATTGATCTA 


960 


GCGACGCCTT 


1020 


TGCAGACTTC 


1080 


CTGCTAACCT 


1140 


TTCAGCACTT 


1200 


CTGAAAAAAG 


1260 


TTGTAGCTGC 


1320 


GGCAACTCCA 


1380 


AGGTTCTTAC 


1440 


ATGAGGCCTT 


1500 


GAAGCATTAG 


1560 


ATGAGTCAGT 


1620 


AGAAATGTGA 


1680 


ACAGGCTTTC 


1740 


TTTCCTCAAG 


1800 


CTTGATATAT 


1860 


CCGGATAGGT 


1920 


AAAAATAATT 


1980 


AAGCTAACAA 


2040 


TTTTTTGTAT 


2100 


ATGTATTTGC 


2160 


TTTCCCACTG 


2220 


CCAAGGAATA 


2280 


GAGGGTAATT 


2340 


GCCAACTCTG 


2400 


CTTTTGATTA 


2460 


AACAAGTTCC 


2520 


CCATTCTGAT 


2580 


ATATGTTAGG 


2640 


CACATTTGTA 


2700 


TTTTCTCATC 


2760 


ATAAATTATT 


2820 


GAGTAATCTG 


2880 


TCCAAATTTC 


2940 


GTGGTTCTGA 


3000 


ACCTTCATTC 


3060 


CAGCAGTTTC 


3120 
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CTTTTAGCTT CCGTATTTCC TGATGAGAAA TCTGCAGTCA TTCAAATTGT TGTTTCCCTG 3180 

TATGTAGTGT GTCATTTTTC TGTCAGATTT CAAGGTATTT ATCTTTAGTT TTTAGCCATT 3240 

TCATTATGTT GGGGATGAGT TTCCTTGTTT TATTCCCTTT GGAATTTGCT CCAATTCATA 3300 

AATTTGCAGT TTTATGTCTT TTACCAAACT TAGAGGTTTT CAGCCTAATT TCTAAAAATA 3360 

CTTTTTATTA GCCTGATTTT CATCTTTATA GGAAATAGTT TAAGTGATGA CAAGTTCCAA 3420 

TAGCTTATAT GCCCAGAAGG CCTTCAAAAT AAGAATTTTG AAAGAATACA GAAAACAAAC 34 80 

TTTTATATCC TTCTCATGTC TTCTACTGTA AAATTCATAT GCTTTGCTAC TCTAAACCTA 3540 

GTTTGAAATC AACAGTCTTG AGAATAGATG AAAATTTTGA TGAATAGTGG AATTCTTTTA 3600 

AATGGAAACC TCTTACATGT GATTTTCCTT GCCATCTAGA AATAAACCAT AGTATTTATG 3660 

TTGAATCAAT CAATATTATA TTTTGTTTTT TTCCTCCTCT TCTGAGACTC TTATTGTGGA 3720 

AATGTTAGAC TTTTATGTTT TCCTAAATGT CCCTGATATT CTACTTATTT AGAACATCTT 3780 

TTCATTTTTT CCATTATTCT GATTGGGTAA TTTTAATTTG TCTATTTTCA AATTTGCTGG 3840 

AGTGTTCACC TGTTGTTGTC TGTGTCGTCC CACTGAGTGC ATTCACCACC TTTTAAATTT 3900 

TGGTCACTGT ATGTATCAGT TCTAAAATTT CCATTTTGTT CTCTATATTT TAAATTTCTT 3960 

GGCTTATATT CTATTTTCCT GCAAATGTGT CAGCATTTGC TTGTTTGAGC TTTTTTTTTT 4020 

TCAAGACAGG GTCTCAACTC TGTTACCCAG GCTGGAGTGC AGTGGTGCGA TCTCAGCTCA 4080 

CTGCAACCTC TGCCTCCTGG TTCAAGCGAT TATTGTGCCT CAGCCTCCTG AGTAGCTGGG 4140 

ATTACAGGCA TGCACCACCA CAGCCCAGCT AATTTTTTGT ATTTTTAGTA GAGACAGAGT 4200 

TTTGCTATGT TGGCCAGGCT GGTTTTGAAC TCCTGGCCTC AAGTGATCCA CCCACCTCAG 4260 

CCTCCCAAAG TGCTGGGATT ACAGG CCACT ACACCTGGCA CATTTGAGTA TTTTTTTTTT 4320 

TTTTTTTTTT TTGAGATGGA GTCTCGCTCT GTCATCTAGG CTGGAGTGCA GTGGTGTGAT 4380 

CTCAGCTCAC TGCAGCCTCT GTCTCCCGGG CTCAAGCGAT TCTCTTGCCT CAGCCTCCTG 4440 

AGTAGCTAGG ACTACAGGTG CATGCCAACA CGCCCGGCTA ATTTTTTTAA AAAATATTTT 4500 

TAGTAGAGAC AGGGTTTCAC CATTTTGGCC AGGATGGTCT CGATCTCCTG ACCTCATGAT 4560 

CCACCCGCCT CGGCCTTCCA AAGTGCTGGG ATTACAGGCA TGAGCCACCG TGCCTGGCCT 4620 

CATTTGAGTA TTTTTATAAT GTCTCTTTTA AAGTCTTTGT CAGATAATTC CACTGTACAT 4680 

GTTATTCAGT GTTTGGTGTC CACTGAGTTG TCATTTGCCA GACAAGTGGA GATTTTTGCA 4740 

GCTCATCCTT GTATTCTCAG TAGTTCCGAT ATGTACCCTC GACATGTGAA TGTTATCTTA 4800 

TGAGACTCTG TTTTATTTGT ATCCAACAGA AGATGTTTAT TATTTATTTG GCTTTCTGTG 4860 

AACTGAGGTC TTAATATCAG CTCATTTTAA AAGTCTTTGC AGTGGTATTC GGATCTATCC 4920 

TGTGTGTGCC TATGAGATTG GGTGCAGTGT ATCCTGTTAG CTCCATTCTC AGGGCGTTTG 4980 

AATGTGAATT AGGACCAGCG CAATGAATGC TCAAGTTGGG GTTGGGCGTT AGAATTCATA 5040 
AAAGTCTTTA TATGCTCAG 



Seq ID NO: 83 Protein 3eouence: 
Protein Accession #: NP_055774 



1 11 21 31 41 51 

I I I I I I 

MMRQRQSHYC SVLFLSVNYL GGTPPGDICS EENQIVSSYA SKVCFEIEED YKNRQFLGPE 60 

GNVDVELIDK STNRYSVWFP TAGWYLWSAT GLGFLVRDEV TVTIAFGSWS QHLALDLQHH 120 

EQWLVGGPLF DVTAEPEEAV ABIHLPHFIS LQGEVDVSWF LVAHFKNEGM VLEHPARVEP 180 

FYAVLESPSF SLMGILLRIA SGTRLSIPIT SNTLIYYHPH PEDIKFHLYL VPSDALLTKA 240 

IDDEEDRFHG VRLQTSPPME PLNFGSSYIV SNSANLKVMP KELKLSYRSP GEIQHFSKFY 300 

AGQMKEPIQL EITEKRHGTL VWDTEVKPVD LQLVAASAPP PFSGAAFVKE NHRQLQARMG 360 

DLKGVLDDLQ DNEVLTENEK ELVEQEKTRQ SKNEALLSMV EKKGDLALDV LFRSISERDP 420 
YLVSYliRQQN L 



Seq ID NO: 84 Nucleotide sequence: 
Nucleic Acid Accession #: NM_007036 

Coding sequence: 56-610 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

CTTCCCACCA GCAAAGACCA CGACTGGAGA GCCGAGCCGG AGGCAGCTGG GAAACATGAA 60 

GAGCGTCTTG CTGCTGACCA CGCTCCTCGT GCCTGCACAC CTGGTGGCCG CCTGGAGCAA 120 

TAATTATGCG GTGGACTGCC CTCAACACTG TGACAGCAGT GAGTGCAAAA GCAGCCCGCG 180 

CTGCAAGAGG ACAGTGCTCG ACGACTGTGG CTGCTGCCGA GTGTGCGCTG CAGGGCGGGG 240 

AGAAACTTGC TACCGCACAG TCTCAGGCAT GGATGGCATG AAGTGTGGCC CGGGGCTGAG 300 

GTGTCAGCCT TCTAATGGGG AGGATCCTTT TGGTGAAGAG TTTGGTATCT GCAAAGACTG 360 

TCCCTACGGC ACCTTCGGGA TGGATTGCAG AGAGACCTGC AACTGCCAGT CAGGCATCTG 420 

TGACAGGGGG ACGGGAAAAT GCCTGAAATT CCCCTTCTTC CAATATTCAG TAACCAAGTC 480 

TTCCAACAGA TTTGTTTCTC TCACGGAGCA TGACATGGCA TCTGGAGATG GCAATATTGT 540 

GAGAGAAGAA GTTGTGAAAG AGAATGCTGC CGGGTCTCCC GTAATGAGGA AATGGTTAAA 600 

TCCACGCTGA TCCCGGCTGT GATTTCTGAG AGAAGGCTCT ATTTTCGTGA TTGTTCAACA 660 

CACAGCCAAC ATTTTAGGAA CTTTCTAGAT ATAGCATAAG TACATGTAAT TTTTGAAGAT 720 

CCAAATTGTG ATGCATGGTG GATCCAGAAA ACAAAAAGTA GGATACTTAC AATCCATAAC 780 

ATCCATATGA CTGAACACTT GTATGTGTTT GTTAAATATT CGAATGCATG TAGATTTGTT 840 

AAATGTGTGT GTATAGTAAC ACTGAAGAAC TAAAAATGCA ATTTAGGTAA TCTTACATGG 900 

AGACAGGTCA ACCAAAGAGG GAGCTAGGCA AAGCTGAAGA CCGCAGTGAG TCAAATTAGT 960 

TCTTTGACTT TGATGTACAT TAATGTTGGG ATATGGAATG AAGACTTAAG AGCAGGAGAA 1020 

GATGGGGAGG GGGTGGGAGT GGGAAATAAA ATATTTAGCC CTTCCTTGGT AGGTAGCTTC 1080 

TCTAGAATTT AATTGTGCTT TTTTTTTTTT TTTGGCTTTG GGAAAAGTCA AAATAAAACA 1140 
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10 



15 



ACCAGAAAAC 
AGCTTTGAAC 
TGAAGGACGG 
CCACCTCAGA 
GTAAATATTT 
ATTTATCATC 
AACCTATGAC 
TATAGGAGTC 
TAAACATAAG 
GGAGGTTTGT 
TGGAATTAGG 
CTTAGGAAAT 
AGTATTTACC 
GCCTTTGAAT 
TTGTTCAATA 



CCCTGAAGGA 
TGAGAGCAAT 
TTCTGGGGCA 
GATAAATCTA 
ATATATTTTT 
CTCTTGAGGA 
TCTATAAGGT 
ACTCTGGATT 
TGCTGTGACT 
AAAAGAAGAA 
AGTATATTTG 
ATCTCAGAAG 
TGTATTTTAT 
GTAAAGCTGC 
AAAAAGAACA 



AGTAAGATGT 
TTCAAAAGGC 
TAGGAAACAC 
AGAAGTATTT 
ATAAATAAAT 
AAGAAATCTA 
TTTCAAACAT 
TCAAAAAATG 
TCGGTGAATT 
TCAATTTTCA 
AAAGAATCTT 
TATTTTATTT 
TCTTGAAGTT 
ATAAGCTGTT 
AGATAC 



TTGAAGCTTA 
TGCTGATGTA 
ATACACTTCC 
TACCCACTGG 
GTGTTAGTGC 
GTATTATTTG 
CTGAGGCATG 
TCAAAAAATG 
TTCAATTTAA 
GCAGAAAACA 
AGCACAAACA 
GAAGTGAAGA 
GGCCAACAGA 
AGGTTTTGTT 



TGGAAATTTG 
GTTCCCGGGT 
ATAAATAGCT 
TGGTTTGTGT 
AAGTCATCTT 
TTGAAAATGG 
ATAAATTTAT 
AGCAACAGAG 
GGTATGAAAA 
TGTCAACTTT 
GGACTGTTGT 
ACTTATTTAA 
GTTGTGAATG 
TTAAAAGGAC 



AGTAACAAAC 
TACCTGTATC 
TTAACGTATG 
GTGTATGAAG 
CCCTACCCAT 
TTAGAATAAA 
TATCCATAAT 
GGACCTTATT 
TAAGTTTTTA 
AAAATATAGG 
ACTAGATGTT 
GAATTATTTC 
TGTGTGGAAG 
ATGTTTATTA 



1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 



20 



25 



Seq ID NO: 85 Protein sequence: 
Protein Accession #: NP 008967.1 



11 



21 



31 



41 



51 



I ! I ! I I 

MKSVLLLTTL LVPAHLVAAW SNNYAVDCPQ HCDSSBCKSS PRCKRTVLDD CGCCRVCAAG 
RGETCYRTVS GMDGMKCGPG LRCQPSNGED PPGEEFGICK DCPYGTFGMD CRETCNCQSG 
ICDRGTGKCL KFPFFQYSVT KSSNRFVSLT EHDMASGDGN IVREEWKEN AAGSPVMRKW 
LNPR 



60 
120 
180 



Seq ID NO: 86 Nucleotide sequence: 
30 Nucleic Acid Accession #: D86983 

Coding sequence: 52-4491 (underlined sequences correspond to start and stop codons) 



35 



40 



45 



50 



55 



60 



65 



70 



75 



AGCCGGCCGT 
CGCTCCAGGG 
ACGCTGGCCG 
CGCACCACCG 
ACCTCCATCC 
CGGCTGAGGA 
GGAGCATTTG 
TCAATTGACA 
AATCAGATAG 
TTTTTGCATA 
ATGAAGAGAT 
GCGGATTTGC 
TATCCCAGAC 
GAAAGGCCCC 
TACTTCACCT 
AATGAGCTGA 
ATCCAGAACA 
GGAGAGGTGA 
TTTGTAATCC 
AGCGCCACAG 
CCAGTTGACC 
CAGGGGGACA 
ACCGCTTTCA 
GTTATTGAGG 
' ATCGCCTGGA 
TCGGGAACAC 
GCTGTCAACA 
ACCCCAGTGT 
CTCCCGTGCA 
CAGGTGACAG 
GTTGGCCCTG 
TCGGTGAGCA 
GTAGCTACCT 
ACACATTTGT 
CCGAGGGATC 
CAGCTCATTC 
CACTACAACG 
ACCGCCCACC 
CACGACGGCA 
GAGCGCCTGC 
CACCGACTGT 
GGGACGGAGA 



11 
I 

GGTGGCTCCG 
GCCCCGGGCG 
TGGTGGCCCA 
TGCGCTGCAT 
TAGATCTTCG 
ACTTGAACAC 
AAGACTTGGA 
GGCAAGCATT 
AAACTTTGGA 
ACAACCGGAT 
TGCGACTGGA 
TGAAAACCTA 
GCATCCAGGG 
GGATCACCTC 
GCAGAGCCGA 
GCATGAAGAC 
CACAGGAGAC 
AGACGCAAGA 
AGCCACAGAA 
GCCACCCCCC 
CGCGGGTGAA 
GCGGAGAGTA 
TCATCGTCCA 
GCCAGACCGT 
CCAAGGGAGG 
TTAGAATCTC 
TCATCGGCTC 
TTGCCAGCAT 
GCTCCCAGGG 
AAAGTGGAAA 
CAGACGCAGG 
TGGTGCTCAG 
CCATCGTGGA 
TTGACAGCCG 
CTTACACAGT 
AGGAGCATGT 
ACCTGGTGTC 
GGCGCGTGAA 
CCTGTAACAA 
TGAAATCCGT 
ACAACGGGCA 
CCGTCACACC 



21 
I 

TGCGTCCGAG 
CCGCTGCCTG 
GAAGCCGGGC 
GCATCTGCTG 
CTTTAACAGA 
ATTGCTTCTC 
AAATTTAAAA 
TAAGGGACTT 
CCCAGATTCG 
TACACATTTA 
CTCAAACACA 
CGCGGAGTCG 
ACGCTCAGTG 
CGAGCCCCAG 
AGGCAACCCC 
AGATTCCCGC 
AGACCAGGGT 
GGTGACCCTC 
TACAGAGGTG 
GCCGCGGATC 
CATCACGCCT 
TGCGTGCTCT 
GGCTCTTCCT 
GGATTTCCAG 
GAGCCAGCTC 
TGGTGTTGCC 
CCAGAAGGTC 
TCCCAGCGAC 
CGAGCCCGAG 
ATTTCACATC 
TCGCTATGAG 
TGTGAACGTT 
AGCGATTGCG 
TCCTCGTTCT 
TGAACAGGCA 
ACAGCATGGC 
TCCACAGTAC 
CAACTGCTCG 
CCTGCAGCAC 
GTACGAGAAT 
CGCCCTTCCC 
CGACGAGCAG 



31 
I 

CGTCCGTCCG 
TTGGCGCTCG 
GCAGGGTGTC 
CTGGAGGCCG 
ATCAGAGAGA 
AATAATAATC 
TATCTCTATC 
GCCTCTCTAG 
TTCCAGCATC 
GTTCCAGGGA 
CTTCACTGCG 
GGGAACGCGC 
GCAACCATCA 
GACGCAGATG 
AAGCCTGAGA 
CTAAACTTGC 
ATCTACCAGT 
AGGTACTTCG 
CTGGTTGGGG 
TCCTGGACGA 
TCTGGCGGGC 
GCGACCAACA 
CAGTTCACTG 
TGTGAAGCCA 
TCCGTGGACC 
CTCCACGACC 
GTGGCCCACC 
ACAACAGTGG 
CCAGCCATCA 
AGCCCTGAAG 
TGTGTGGCCC 
CCTGACGTCA 
ACTGTTGACA 
CCAAATGATT 
CGGGCGGGAG 
TTGATGGTCG 
CTGAACCTCA 
GACATGTGCT 
CCCATGTGGG 
GGCTTCAACA 
ATGCCGCGCC 
TTCACCCACA 



41 

I 

CGCCGTCGGC 
TGCTGTTCTG 
CGAGCCGCTG 
TGCCCGCCGT 
TCCAACCTGG 
AGATCAAGAG 
TGTACAAGAA 
AGCAACTATA 
TCCCGAAGCT 
CATTTAATCA 
ACTGTGAAAT 
AGGCAGCGGC 
CCCCGGAAGA 
TGACCTCGGG 
TCATCTGGCT 
TGGACGATGG 
GCATGGCAAA 
GGTCTCCAGC 
AGAGCGTCAC 
GAGGTGACCG 
TTTACATACA 
ACATTGACAG 
TGACGCCTCA 
AGGGCAACCC 
GGCGGCACCT 
AGGGCCAGTA 
TGACTGTGCA 
AGGTGGGCGC 
CCTGGAACAA 
GATTCTTGAC 
GGAACACCAT 
GTCGAAATGG 
GAGCTATAAA 
TGCTGGCCTT 
AAATCTTTGA 
ACCTCAACGG 
TCGCAAACCT 
TCCACCAGAA 
GCGCCTCGCT 
CCCCTCGGGG 
TGGTGTCCAC 
TGCTGATGCA 



51 
I 

CATGGCCAAG 
CGCCTGGGGG 
CCTGTGCTTC 
GGCGCCGCAG 
GGCATTCAGG 
GATACCTAGT 
TGAGATCCAG 
CCTGCACTTT 
CGAGAGGCTA 
CTTGGAATCT 
CCTGTGGTTG 
CATCTGTGAA 
GCTGAACTGT 
GAACACCGTG 
GCGAAACAAT 
GACCCTGATG 
GAACGTGGCC 
TCGACCCACT 
GCTGGAGTGC 
CACACCCTTG 
GAACGTCGTA 
CGTCCATGCC 
GGACAGAGTC 
GCCGCCCGTC 
GGTCCTGTCA 
CGAATGCCAG 
GCCCAGAGTC 
CAATGTGCAG 
GGATGGGGTT 
CATCAATGAC 
TGGGTCGGCC 
AGATCCGTTT 
CTCAACCCGA 
GTTCCGGTAT 
ACGGACATTG 
AACAAGTTAC 
GTCGGGCTGT 
GTACCGGACG 
GACCGCCTTC 
CATCAACCCC 
CACCCTGATC 
GTGGGGCCAG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1660 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
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TTCCTGGACC ACGACCTCGA CTCCACGGTG GTGGCCCTGA GCCAGGCACG CTTCTCCGAC 2580 

GGACAGCACT GCAGCAACGT GTGCAGCAAC GACCCCCCCT GCTTCTCTGT CATGATCCCC 2640 

CCCAATGACT CCCGGGCCAG GAGOGGGGCC CGCTGCATGT TCTTCGTGCG CTCCAGCCCT 2700 

GTGTGCGGCA GCGGCATGAC TTCGCTGCTC ATGAACTCCG TGTACCCGCG GGAGCAGATC 2760 

AACCAGCTCA CCTCCTACAT CGACGCATCC AACGTGTACG GGAGCACGGA GCATGAGGCC 2820 

CGCAGCATCC GCGACCTGGC CAGCCACCGC GGCCTGCTGC GGCAGGGCAT CGTGCAGCGG 2880 

TCCGGGAAGC CGCTGCTCCC CTTCGCCACC GGGCCGCCCA CGGAGTGCAT GCGGGACGAG 2940 

AACGAGAGCC CCATCCCCTG CTTCCTGGCC GGGGACCACC GCGCCAACGA GCAGCTGGGC 3000 

CTGACCAGCA TGCACACGCT GTGGTTCCGC GAGCACAACC GCATTGCCAC GGAGCTGCTC 3060 

AAGCTGAACC CGCACTGGGA CGGCGACACC ATCTACTATG AGACCAGGAA GATCGTGGGT 3120 

GCGGAGATCC AGCACATCAC CTACCAGCAC TGGCTCCCGA AGATCCTGGG GGAGGTGGGC 31 BO 

ATGAGGACGC TGGGAGAGTA CCACGGCTAC GACCCCGGCA TCAATGCTGG CATCTTCAAC 3240 

GCCTTCGCCA CCGCGGCCTT CAGGTTTGGC CACACGCTTG TCAACCCACT GCTTTACCGG 3300 

CTGGACGAGA ACTTCCAGCC CATTGCACAA GATCACCTCC CCCTTCACAA AGCTTTCTTC 3360 

TCTCCCTTCC GGATTGTGAA TGAGGGCGGC ATCGATCCGC TTCTCAGGGG GCTGTTCGGG 3420 

GTGGCGGGGA AAATGCGTGT GCCCTCGCAG CTGCTGAACA CGGAGCTCAC GGAGCGGCTG 3480 

TTCTCCATGG CACACACGGT GGCTCTGGAC CTGGCGGCCA TCAACATCCA GCGGGGCCGG 3540 

GACCACGGGA TCCCACCCTA CCACGACTAC AGGGTCTACT GCAATCTATC GGCGGCACAC 3600 

ACGTTCGAGG ACCTGAAAAA TGAGATTAAA AACCCTGAGA TCCGGGAGAA ACTGAAAAGG 3660 

TTGTATGGCT CGACACTCAA CATCGACCTG TTTCCGGCGC TCGTGGTGGA GGACCTGGTG 3720 

CCTGGCAGCC GGCTGGGCCC CACCCTGATG TGTCTTCTCA GCACACAGTT CAAGCGCCTG 3780 

CGAGATGGGG ACAGGTTGTG GTATGAGAAC CCTGGGGTGT TCTCCCCGGC CCAGCTGACT 3840 

CAGATCAAGC AGACGTCGCT GGCCAGGATC CTATGCGACA ACGCGGACAA CATCACCCGG 3900 

GTGCAGAGCG ACGTGTTCAG GGTGGCGGAG TTCCCTCACG GCTACGGCAG CTGTGACGAG 3960 

ATCCCCAGGG TGGACCTCCG GGTGTGGCAG GACTGCTGTG AAGACTGTAG GACCAGGGGG 4020 

CAGTTCAATG CCTTTTCCTA TCATTTCCGA GGCAGACGGT CTCTTGAGTT CAGCTACCAG 4080 

GAGGACAAGC CGACCAAGAA AACAAGACCA CGGAAAATAC CCAGTGTTGG GAGACAGGGG 4140 

GAACATCTCA GCAACAGCAC CTCAGCCTTC AGCACACGCT CAGATGCATC TGGGACAAAT 4200 

GACTTCAGAG AGTTTGTTCT GGAAATGCAG AAGACCATCA CAGACCTCAG AACACAGATA 4260 

AAGAAACTTG AATCACGGCT CAGTACCACA GAGTGCGTGG ATGCCGGGGG CGAATCTCAC 4320 

GCCAACAACA CCAAGTGGAA AAAAGATGCA TGCACCATTT GTGAATGCAA AGACGGGCAG 4380 

GTCACCTGCT TCGTGGAAGC TTGCCCCCCT GCCACCTGTG CTGTCCCCGT GAACATCCCA 4440 

GGGGCCTGCT GTCCAGTCTG CTTACAGAAG AGGGCGGAGG AAAAGCCCTAJ3GCTCCTGGG 4500 

AGGCTCCTCA GAGTTTGTCT GCTGTGCCAT CGTGAGATCG GGTGGCCGAT GGCAGGGAGC 4560 

TGCGGACTGC AGACCAGGAA ACACCCAGAA CTCGTGACAT TTCATGACAA CGTCCAGCTG 4620 

GTGCTGTTAC AGAAGGCAGT GCAGGAGGCT TCCAACCAGA GCATCTGCGG AGAAGGAGGC 4680 

ACAGCAGGTG CCTGAAGGGA AGCAGGCAGG AGTCCTAGCT TCACGTTAGA CTTCTCAGGT 4740 

TTTTATTTAA TTCTTTTAAA ATGAAAAATT GGTGCTACTA TTAAATTGCA CAGTTGAATC 4800 

ATTTAGGCGC CTAAATTGGT TTTGCCTCCC AACACCATTT CTTTTTAAAT AAAGCAGGAT 4860 

ACCTCTATAT GTCAGCCTTG CCTTGTTCAG ATGCCAGGAG CCGGCAGACC TGTCACCCGC 4920 

AGGTGGGGTG AGTCTCGGAG CTGCCAGAGG GGCTCACCGA AATCGGGGTT CCATCACAAG 4980 

CTATGTTTAA AAAGAAAATT GGTGTTTGGC AAACGGAACA GAACCTTTGA TGAGAGCGTT 5040 

CACAGGGACA CTGTCTGGGG GTGCAGTGCA AGCCCCCGGC CTCTTCCCTG GGAACCTCTG 5100 

AACTCCTCCT TCCTCTGGGC TCTCTGTAAC ATTTCACCAC ACGTCAGCAT CTAATCCCAA 5160 

GACAAACATT CCCGCTGCTC GAAGCAGCTG TATAGCCTGT GACTCTCCGT GTGTCAGCTC 5220 

CTTCCACACC TGATTAGAAC ATTCATAAGC CACATTTAGA AACAGATTTG CTTTCAGCTG 5280 

TCACTTGCAC ACATACTGCC TAGTTGTGAA CCAAATGTGA AAAAACCTCC TTCATCCCAT 5340 

TGTGTATCTG ATACCTGCCG AGGGCCAAGG GTGTGTGTTG ACAACGCCGC TCCCAGCCGG 5400 

CCCTGGTTGC GTCCACGTCC TGAACAAGAG CCGCTTCCGG ATGGCTCTTC CCAAGGGAGG 5460 
AGGAGCTCAA GTGTCGGGAA CTGTCTAACT TCAGGTTGTG TGAGTGCGTT 

Seq ID NO: 87 Protein sequence: 
Protein Accession #: BAA13219 

1 11 21 31 41 51 

I I I I I I 

SRPWWLRASE RPSAPSAMAK RSRGPGRRCL LALVLPCAWG TLAWAQKPG AGCPSRCLCF 60 

RTTVRCMHLL LEAVPAVAPQ TSILDLRFNR IREIQPGAFR RLRNLNTLLL NNNQIKRIPS 120 

GAFEDLENLK YLYLYKNEIQ SIDRQAFKGL ASLEQLYLHF NQIETLDPDS FQHLPKLERL 180 

FLHNNRITHL VPGTFNHLES MKRLRLDSNT LHCDCEILWI, ADLLKTYAES GNAQAAAICE 240 

YPRRIQGRSV ATITPEELNC ERPRITSBPQ DADVTSGNTV YFTCRAEGNP KPEIIWLRNN 300 

NELSMKTDSR LNLLDDGTLM IQNTQETDQG IYQCMAKNVA GEVKTQEVTL RYFGSPARPT 360 

FVIQPQNTEV LVGESVTIjEC SATGHPPPRI SWTRGDRTPL PVDPRVNITP SGGLYIQNW 420 

QGDSGEYACS ATNNIDSVHA TAFIIVQALP QFTVTPQDRV VIEGQTVDFQ CEAKGNPPPV 480 

IAWTKGGSQL SVDRRHLVLS SGTLRISGVA IiHDQGQYECQ AVNIIGSQKV VAHLTVQPRV 540 

TPVFASIPSD TTVEVGANVQ LPCSSQGEPE PAITWNKDGV QVTESGKFHI SPEGFLTIND 600 

VGPADAGRYE CVARNTIGSA SVSMVLSVNV PDVSRNGDPF VATSIVEAIA TVDRAINSTR 660 

THLFDSRPRS PNDLLAIiFRY PRDPYTVEQA RAGEIFERTL QLIQEHVQHG LMVDLNGTSY 720 

HYNDLVSPQY LNLIANLSGC TAHRRVNNCS DMCFHQKYRT HDGTCNNLQH PMWGASLTAF 780 

ERLLKSVYEN GFNTPRGINP HRLYNGHALP MPRLVSTTLI GTETVTPDEQ FTHMLMQWGQ 840 

FLDHDLDSTV VALSQARFSD GQHCSNVCSN DPPCFSVMIP PNDSRARSGA RCMFFVRSSP 900 

VCGSGMTSUb MNSVYPREQI NQLTSYIDAS NVYGSTEHEA RSIRDIiASHR GLLRQGIVQR 960 

SGKPLIiPFAT GPPTECMRDE NESPIPCFLA GDHRANEQLG LTSMHTIiWFR EHNRIATELIi 1020 

KLNPHWDGDT IYYETRKIVG AEIQHITYQH WLPKILGEVG MRTLGEYHGY DPG1NAGIFN 1080 

AFATAAFRFG HTLVNPLLYR LDENFQPIAQ DHLPIiHKAFF SPFRIVNEGG IDPLLRGLFG 1140 

VAGKMRVPSQ LLNTELTERL FSMAHTVALD LAAINIQRGR DHGIPPYHDY RVYCNLSAAH 1200 

TFEDLKNEIK NPEIREKLKR LYGSTLNIDL FPAIiWEDLV PGSRLGPTLM CLLSTQFKRL 1260 
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RDGDRLWYEN PGVFSPAQLT QIKQTSLARI LCDNADNITR VQSDVFRVAB FPHGYGSCDB 1320 

IPRVDLRVWQ DCCEDCRTRG QFNAFSYHFR GRRSLEFSYQ EDKPTKKTRP RKIPSVGRQG 1380 

EHLSNSTSAF STRSDASGTN DFREFVLEMQ KTITDLRTQI KKLESRLSTT ECVDAGGESH 1440 
ANNTKWKKDA CTICECKDGQ VTCFVEACPP ATCAVPVNIP GACCPVCLQK RAEEKP 



Seq ID NO: B8 DNA sequence 

Nucleic Acid Accession #: NM^004S34.1 

Coding sequence: 80-3577 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

AATTCGAGGA TCCGGGTACC ATGGCACAGA GCGACAGAGA CATTTATTGT TATTTGTTTT 60 

TTGGTGGCAA AAAGGGAA AA TG GCQAACGA CTCCCCTGCA AAAAGTCTGG TGGACATCGA 120 

CCTCTCCTCC CTGCGGGATC CTGCTGGGAT TTTTGAGCTG GTGGAAGTGG TTGGAAATGG 180 

CACCTATGGA CAAGTCTATA AGGGTCGACA TGTTAAAACG GGTCAGTTGG CAGCCATCAA 240 

AGTTATGGAT GTCACTGAGG ATGAAGAGGA AGAAATCAAA CTGGAGATAA ATATGCTAAA 300 

GAAATACTCT CATCACAGAA ACATTGCAAC ATATTATGGT GCTTTCATCA AAAAGAGCCC 360 

TCCAGGACAT GATGACCAAC TCTGGCTTGT TATGGAGTTC TGTGGGGCTG GGTCCATTAC 420 

AGACCTTGTG AAGAACACCA AAGGGAACAC ACTCAAAGAA GACTGGATCG CTTACATCTC 480 

CAGAGAAATC CTGAGGGGAC TGGCACATCT TCACATTCAT CATGTGATTC ACCGGGATAT 540 

CAAGGGCCAG AATGTGTTGC TGACTGAGAA TGCAGAGGTG AAACTTGTTG ACTTTGGTGT 600 

GAGTGCTCAG CTGGACAGGA CTGTGGGGCG GAGAAATACG TTCATAGGCA CTCCCTACTG 660 

GATGGCTCCT GAGGTCATCG CCTGTGATGA GAACCCAGAT GCCACCTATG ATTACAGAAG 720 

TGATCTTTGG TCTTGTGGCA TTACAGCCAT TGAGATGGCA GAAGGTGCTC CCCCTCTCTG 780 

TGACATGCAT CCAATGAGAG CACTGTTTCT CATTCCCAGA AACCCTCCTC CCCGGCTGAA 840 

GTCAAAAAAA TGGTCGAAGA AGTTTTTTAG TTTTATAGAA GGGTGCCTGG TGAAGAATTA 900 

CATGCAGCGG CCCTCTACAG AGCAGCTTTT GAAACATCCT TTTATAAGGG ATCAGCCAAA 960 

TGAAAGGCAA GTTAGAATCC AGCTTAAGGA TCATATAGAT CGTACCAGGA AGAAGAGAGG 1020 

CGAGAAAGAT GAAACTGAGT ATGAGTACAG TGGGAGTGAG GAAGAAGAGG AGGAAGTGCC 1080 

TGAACAGGAA GGAGAGCCAA GTTCCATTGT GAACGTGCCT GGTGAGTCTA CTCTTCGCCG 1140 

AGATTTCCTG AGACTGCAGC AGGAGAACAA GGAACGTTCC GAGGCTCTTC GGAGACAACA 1200 

GTTACTACAG GAGCAACAGC TCCGGGAGCA GGAAGAATAT AAAAGGCAAC TGCTGGCAGA 1260 

GAGACAGAAG CGGATTGAGC AGCAGAAAGA ACAGAGGCGA CGGCTAGAAG AGCAACAAAG 1320 

GAGAGAGCGG GAGGCTAGAA GGCAGCAGGA ACGTGAACAG CGAAGGAGAG AACAAGAAGA 1380 

AAAGAGGCGT CTAGAGGAGT TGGAGAGAAG GCGCAAAGAA GAAGAGGAGA GGAGACGGGC 1440 

AGAAGAAGAA AAGAGGAGAG TTGAAAGAGA ACAGGAGTAT ATCAGGCGAC AGCTAGAAGA 1500 

GGAGCAGCGG CACTTGGAAG TCCTTCAGCA GCAGCTGCTC CAGGAGCAGG CCATGTTACT 1560 

GCATGACCAT AGGAGGCCGC ACCCGCAGCA CTCGCAGCAG CCGCCACCAC CGCAGCAGGA 1620 

AAGGAGCAAG CCAAGCTTCC ATGCTCCCGA GCCCAAAGCC CACTACGAGC CTGCTGACCG 1680 

AGCGCGAGAG GTTCCTGTGA GAACAACATC TCGCTCCCCT GTTCTGTCCC GTCGAGATTC 1740 

CCCACTGCAG GGCAGTGGGC AGCAGAATAG CCAGGCAGGA CAGAGAAACT CCACCAGTAT 1800 

TGAGCCCAGG CTTCTGTGGG AGAGAGTGGA GAAGCTGGTG CCCAGACCTG GCAGTGGCAG 1860 

CTCCTCAGGG TCCAGCAACT CAGGATCCCA GCCCGGGTCT CACCCTGGGT CTCAGAGTGG 1920 

CTCCGGGGAA CGCTTCAGAG TGAGATCATC ATCCAAGTCT GAAGGCTCTC CATCTCAGCG 1980 

CCTGGAAAAT GCAGTGAAAA AACCTGAAGA TAAAAAGGAA GTTTTCAGAC CCCTCAAGCC 2040 

TGCTGGCGAA GTGGATCTGA CCGCACTGGC CAAAGAGCTT CGAGCAGTGG AAGATGTACG 2100 

GCCACCTCAC AAAGTAACGG ACTACTCCTC ATCCAGTGAG GAGTCGGGGA CGACGGATGA 2160 

GGAGGACGAC GATGTGGAGC AGGAAGGGGC TGACGAGTCC ACCTCAGGAC CAGAGGACAC 2220 

CAGAGCAGCG TCATCTCTGA ATTTGAGCAA TGGTGAAACG GAATCTGTGA AAACCATGAT 2280 

TGTCCATGAT GATGTAGAAA GTGAGCCGGC CATGACCCCA TCCAAGGAGG GCACTCTAAT 2340 

CGTCCGCCAG ACTCAGTCCG CTAGTAGCAC ACTCCAGAAA CACAAATCTT CCTCCTCCTT 2400 

TACACCTTTT ATAGACCCCA GATTACTACA GATTTCTCCA TCTAGCGGAA CAACAGTGAC 2460 

ATCTGTGGTG GGATTTTCCT GTGATGGGAT GAGACCAGAA GCCATAAGGC AAGATCCTAC 2520 

CCGGAAAGGC TCAGTGGTCA ATGTGAATCC TACCAACACT AGGCCACAGA GTGACACCCC 2580 

GGAGATTCGT AAATACAAGA AGAGGTTTAA CTCTGAGATT CTGTGTGCTG CCTTATGGGG 2640 

AGTGAATTTG CTAGTGGGTA CAGAGAGTGG CCTGATGCTG CTGGACAGAA GTGGCCAAGG 2700 

GAAGGTCTAT CCTCTTATCA ACCGAAGACG ATTTCAACAA ATGGACGTAC TTGAGGGCTT 2760 

GAATGTCTTG GTGACAATAT CTGGCAAAAA GGATAAGTTA CGTGTCTACT ATTTGTCCTG 2820 

GTTAAGAAAT AAAATACTTC ACAATGATCC AGAAGTTGAG AAGAAGCAGG GATGGACAAC 2880 

CGTAGGGGAT TTGGAAGGAT GTGTAGATTA TAAAGTTGTA AAATATGAAA GAATCAAATT 294 0 

TCTGGTGATT GCTTTGAAGA GTTCTGTGGA AGTCTATGCG TGGGCACCAA AGCCATATCA 3000 

CAAATTTATG GCCTTTAAGT CATTTGGAGA ATTGGTACAT AAGCCATTAC TGGTGGATCT 3060 

CACTGTTGAG GAAGGCCAGA GGTTGAAAGT GATCTATGGA TCCTGTGCTG GATTCCATGC 3120 

TGTTGATGTG GATTCAGGAT CAGTCTATGA CATTTATCTA CCAACACATG TAAGAAAGAA 3180 

CCCACACTCT ATGATCCAGT GTAGCATCAA ACCCCATGCA ATCATCATCC TCCCCAATAC 3240 

AGATGGAATG GAGCTTCTGG TGTGCTATGA AGATGAGGGG GTTTATGTAA ACACATATGG 3300 

AAGGATCACC AAGGATGTAG TTCTACAGTG GGGAGAGATG CCTACATCAG TAG CAT AT AT 3360 

TCGATCCAAT CAGACAATGG GCTGGGGAGA GAAGGCCATA GAGATCCGAT CTGTGGAAAC 3420 

TGGTCACTTG GATGGTGTGT TCATGCACAA AAGGGCTCAA AGACTAAAAT TCTTGTGTGA 3480 

ACGCAATGAC AAGGTGTTCT TTGCCTCTGT TCGGTCTGGT GGCAGCAGTC AGGTTTATTT 3540 

CATGACCTTA GGCAGGACTT CTCTTCTGAG CTGGTAGAAG CAGTGTGATC CAGGGATTAC 3600 

TGGCCTCCAG AGTCTTCAAG ATCCTGAGAA CTTGGAATTC CTTGTAACTG GAGCTCGGAG 3660 

CTGCACCGAG GGCAACCAGG ACAGCTGTGT GTGCAGACCT CATGTGTTGG GTTCTCTCCC 3720 

CTCCTTCCTG TTCCTCTTAT ATACCAGTTT ATCCCCATTC TTTTTTTTTT TCTTACTCCA 3780 
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AAATAAATCA AGGCTGCAAT GCAGCTGGTG CTGTTCAGAT TCCAAAAAAA AAAAAAAACC 3840 
ATGGTACCCG GATCCTCGAA TTCC 



Seq ID No: 89 Protein sequence; 
Protein Accession #: NP_004825.1 



1 11 21 31 41 51 

I I I I I I 

KANDSPAKSL VDIDLSSLRD PAGIFELVEV VGNGTYGQVY KGRHVKTGQL AAIKVMDVTB 60 

DEEEEIKLEI NMLKKYSHHR NIATYYGAFI KKSPPGHDDQ LWLVMEFCGA GSITDLVKNT 120 

KGNTLKEDWI AYISREILRG LAHLHIHHVI HRDIKGQNVL LTENAEVKLV DFGVSAQLDR 180 

TVGRRNTFIG TPYWMAPEVI ACDENPDATY DYRSDLWSCG ITAIEMAEGA PPLCDMHPMR 240 

ALFLIPRNPP PRLKSKKWSK KFFSFIEGCL VKNYMQRPST EQLLKHPFIR DQPNERQVRI 300 

QLKDHIDRTR KKRGEKDETE YEYSGSEEEE EEVPEQEGEP SSIVNVPGES TLRRDFLRLQ 360 

QENKERSEAL RRQQLLQEQQ LREQEEYKRQ LLAERQKRIE QQKEQRRRLB EQQRREREAR 420 

RQQEREQRRR EQEEKRRLEE LERRRKEEEE RRRAEEEKRR VEREQEYIRR QLEEEQRHLE 480 

VLQQQLLQEQ AMLLHDHRRP HPQHSQQPPP PQQERSKPSF HAPEPKAHYE PADRAREVPV 540 

RTTSRSPVLS RRDSPLQGSG QQNSQAGQRN STSIEPRLIiW ERVEKLVPRP GSGSSSGSSN 600 

SGSQPGSHPG SQSGSGERFR VRSSSKSEGS PSORLENAVK KPEDKKEVFR PLKPAGEVDL 660 

TALAKELRAV EDVRPPHKVT DYSSSSEESG TTDEEDDDVE QEGADESTSG PEDTRAASSL 720 

NLSNGETESV KTMIVHDDVE SEPAMTPSKE GTLIVRQTQS ASSTLQKHKS SSSFTPFIDP 780 

RLLQISPSSG TTVTSWGFS CDGMRPEAIR QDPTRKGSW NVNPTNTRPQ SDTPEIRKYK 840 

KRFNSEILCA ALWGVNLLVG TESGLMLLDR SGQGKVYPLI NRRRFQQMDV LEGLNVLVTI 900 

SGKKDKLRVY YLSWLRNKIL HNDPEVEKKQ GWTTVGDLEG CVHYKWKYE RIKFLVIALK 960 

SSVEVYAWAP KPYHKFMAFK SFGELVHKPL LVDLTVEEGQ RLKVIYGSCA GFHAVDVDSG 1020 

SVYDIYLPTH VRKNPHSM1Q CSIKPHAIII LPNTDGMEIjL VCYEDEGVYV NTYGRITKDV 1080 

VLQWGEMPTS VAYIRSNQTM GWGEKAIEIR SVETGHLDGV FMHKRAQRLK FLCERNDKVF 1140 
FASVRSGGSS QVYFMTLGRT SLLSW 



Seq ID NO: 90 DNA sequence 

Nucleic Acid Accession #: none found 

Coding sequence: 2-71 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

TTACACTTCA ATTCCTTACA CGGTATTTCA AACAAACAGT TTTGCTGAGA GGAGCTTTTG 60 

TCTCTCCT TA AG AAAATGTT TATAAAGCTG AAAGGAAATC AAACAGTAAT CTTAAAAATG 120 

AAAACAAAAC AACCCAACAA CCTAGATAAC TACAGTGATC AGGGAGCACA GTTCAACTCC 180 

TTGTTATGTT TTAGTCATAT GGCCTACTCA AACAGCTAAA TAACAACACC AGTGGCAGAT 240 

AAAAATCACC ATTTATCTTT CAGCTATTAA TCTTTTGAAT GAATAAACTG TGACAAACAA 300 

ATTAACATTT TTGAACATGA AAGGCAACTT CTGCACAATC CTGTATCCAA GCAAACTTTA 360 

AATTATCCAC TTAATTATTA CTTAATCTTA AAAAAAATTA GAACCCAGAA CTTTTCAATG 420 

AAGCATTTGA AAGTTGAAGT GGAATTTAGG AAAGCCATAA AAATATAAAT ACTGTTATCA 480 

CAGCACCAGC AAGCCATAAT CTTTATACCT ATCAGTTCTA TTTCTATTAA CAGTAAAAAC 540 

ATTAAGCAAG ATATAAGACT ACCTGCCCAA GAATTCAGTC TTTTTTCATT TTTGTTTTTC 600 

TCAGTTCTGA GGATGTTAAT CGTCAAATTT TCTTTGGACT GCATTCCTCA CTACTTTTTG 660 

CACAATGGTC TCACGTTCTC ACATTTGTTC TCGCGAATAA ATTGATAAAA GGTGTTAAGT 720 

TCTGTGAATG TCTTTTTAAT TATGGGCATA ATTGTGCTTG ACTGGATAAA AACTTAAGTC 780 

CACCCTTATG TTTATAATAA TTTCTTGAGA ACAGCAAACT GCATTTACCA TCGTAAAACA 840 

ACATCTGACT TACGGGAGCT GCAGGGAAGT GGTGAGACAG TTCGAACGGC TCCTCAGAAA 900 

TCCAGTGACC CAATTCTAAA GACCATAGCA CCTGCAAGTG ACACAACAAG CAGATTTATT 960 

ATACATTTAT TAGCCTTAGC AGGCAATAAA CCAAGAATCA CTTTGAAGAC ACAGCAAAAA 1020 

GTGATACACT CCGCAGATCT GAAATAGATG TGTTCTCAGA CAACAAAGTC CCTTCAGAAT 1080 
CTTCATGTTG CATAAATGTT ATGAATATTA ATAAAAAGTT GATTGAGA 



Seq ID No: 91 Protein sequence; 
Protein Accession #: none found 

1 11 21 31 41 51 

I I I I I I 

YTSIPYTVFQ TNSFAERSFC LSL 



Seq ID NO: 92. DNA sequence 

Nucleic Acid Accession #: NM_003706.1 

Coding sequence: 310-1935 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I 1 I I I I 

CACGAGGCAG GGGCCATTTT ACCTCCAGGT TGGCCCTGCT CAGGACCAGG AGGAAACACC 60 
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TCCAGCCCGC GACCTCCTCC CACAGGGGGA AAAGGAAAGC AGGAGGACCA CAGAAGCTTT 120 

GGCACCGAGG ATCCCCGCAG TCTTCACCCG CGGAGATTCC GGCTGAAGGA GCTGTCCAGC 180 

GACTACACCG CTAAGCGCAG GGAGCCCAAG CCTCCGCACC GGATTCCGGA GCACAAGCTC 240 

CACCGCGCAT GCGCACACGC CCCAGACCCA GGCTCAGGAG GACTGAGAAT TTTCTGACCG 300 

CAGTGCAC CA TGG GAAGCTC TGAAGTTTCC ATAATTCCTG GGCTCCAGAA AGAAGAAAAG 360 

GCGGCCGTGG AGAGACGAAG ACTTCATGTG CTGAAAGCTC TGAAGAAGCT AAGGATTGAG 420 

GCTGATGAGG CCCCAGTTGT TGCTGTGCTG GGCTCAGGCG GAGGACTGCG GGCTCACATT 480 

GCCTGCCTTG GGGTCCTGAG TGAGATGAAA GAACAGGGCC TGTTGGATGC CGTCACGTAC 540 

CTCGCAGGGG TCTCTGGATC CACTTGGGCA ATATCTTCTC TCTACACCAA TGATGGTGAC 600 

ATGGAAGCTC TCGAGGCTGA CCTGAAACAT CGATTTACCC GACAGGAGTG GGACTTGGCT 660 

AAGAGCCTAC AGAAAACCAT CCAAGCAGCG AGGTCTGAGA ATTACTCTCT GACCGACTTC 720 

TGGGCCTACA TGGTTATCTC TAAGCAAACC AGAGAACTGC CGGAGTCTCA TTTGTCCAAT 780 

ATGAAGAAGC CCGTGGAAGA AGGGACACTA CCCTACCCAA TATTTGCAGC CATTGACAAT 840 

GACCTGCAAC CTTCCTGGCA GGAGGCAAGA GCACCAGAGA CCTGGTTCGA GTTCACCCCT 900 

CACCACGCTG GCTTCTCTGC ACTGGGGGCC TTTGTTTCCA TAACCCACTT CGGAAGCAAA 960 

TTCAAGAAGG GAAGACTGGT CAGAACTCAC CCTGAGAGAG ACCTGACTTT CCTGAGAGGT 1020 

TTATGGGGAA GTGCTCTTGG TAACACTGAA GTCATTAGGG AATACATTTT TGACCAGTTA 1080 

AGGAATCTGA CCCTGAAAGG TTTATGGAGA AGGGCTGTTG CTAATGCTAA AAGCATTGGA 1140 

CACCTTATTT TTGCCCGATT ACTGAGGCTG CAAGAAAGTT CACAAGGGGA ACATCCTCCC 1200 

CCAGAAGATG AAGGCGGTGA GCCTGAACAC ACCTGGCTGA CTGAGATGCT CGAGAATTGG 1260 

ACCAGGACCT CCCTGGAAAA GCAGGAGCAG CCCCATGAGG ACCCCGAAAG GAAAGGCTCA 1320 

CTCAGTAACT TGATGGATTT TGTGAAGAAA ACAGGCATTT GCGCTTCAAA GTGGGAATGG 1380 

GGGACCACTC ACAACTTCCT GTACAAACAC GGTGGCATCC GGGACAAGAT AATGAGCAGC 1440 

CGGAAGCACC TCCACCTGGT GGATGCTGGT TTAGCCATCA ACACTCCCTT CCCACTCGTG 1500 

CTGCCCCCGA CGCGGGAGGT TCACCTCATC CTCTCCTTCG ACTTCAGTGC CGGAGATCCT 1560 

TTCGAGACCA TCCGGGCTAC CACTGACTAC TGCCGCCGCC ACAAGATCCC CTTTCCCCAA 1620 

GTAGAAGAGG CTGAGCTGGA TTTGTGGTCC AAGGCCCCCG CCAGCTGCTA CATCCTGAAA 1680 

GGAGAAACTG GACCAGTGGT GATACATTTT CCCCTGTTCA ACATAGATGC CTGTGGAGGT 1740 

GATATTGAGG CATGGAGTGA CACATACGAC ACATTCAAGC TTGCTGACAC CTACACTCTA 1800 

GATGTGGTGG TGCTACTCTT GGCATTAGCC AAGAAGAATG TCAGGGAAAA CAAGAAGAAG 1860 

ATCCTTAGAG AGTTGATGAA CGTGGCCGGG CTCTACTACC CGAAGGATAG TGCCCGAAGT 1920 

TGCTGCTTGG CATAGATGAG CCTCAGCTTC CAGGGCACTG TGGGCCTGTT GGTCTACTAG 1980 

GGCCCTGAAG TCCACCTGGC CTTCCTGTTC TTCACTCCCT TCAGCCACAC GCTTCATGGC 2040 

CTTGAGTTCA CCTTGGCTGT CCTAACAGGG CCAATCACCA GTGACCAGCT AGACTGTGAT 2100 

TTTGATAGCG TCATTCAGAA GAAGGTGTCC AAGGAGCTGA AGGTGGTGAA ATTTGTCCTG 2160 

CAGGTCCCTC GGGAGATCCT GGAGCTGGAG CATGAGTGTC TGACAATCAG AAGCATCATG 2220 

TCCAATGTCC AGATGGCCAG AATGAATGTG ATAGTTCAGA CCAATGCCTT CCACTGCTCC 2280 

TTTATGACTG CACTTCTAGC CAGTAGCTCT GCACAAGTTA GCTCTGTAGA AGTAAGAACT 2340 

TGGGCTTAAA TCATGGGCTA TCTCTCCACA GCCAAGTGGA GCTCTGAGAA TACAACAAGT 2400 

GCTCAATAAA TGCTTGCTGA TTGACTGATG AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 2460 
AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAA 



Seq ID No: 93 Protein sequence: 
Protein Accession #: NP_003697.1 



1 11 21 31 41 51 

I I I ) I I 

MGSSEVSIIP GLQKEBKAAV ERRRLHVLKA LKKLRIEADE APWAVLGSG GGLRAHIACL 60 

GVLSEMKEQG LLDAVTYLAG VSGSTWAISS LYTNDGDMEA LEADLKHRFT RQEWDLAKSL 120 

QKTIQAARSE NYSLTDFWAY MVISKQTREL PESHLSNMKK PVEEGTLPYP IFAAIDNDLQ 180 

PSWQEARAPE TWFEFTPHHA GFSALGAFVS ITHFGSKFKK GRLVRTHPER DLTFLRGLWG 240 

SALGNTEVIR EYIFDQLRNL TUCGLWRRAV ANAKSIGHLI FARLLRLQES SQGEHPPPED 300 

EGGEPEHTWL TEMLENWTRT SLEKQEQPHE DPERKGSLSN LMDFVKKTGI CASKWEWGTT 360 

HNFLYKHGGI rdkimssrkh lhlvdaglai ntpfplvlpp TREVHLILSF DFSAGDPFET 420 

IRATTDYCRR HKIPFPQVEE AELDLWSKAP ASCYILKGET GPWIHFPLF NIDACGGDIE 480 

AWSDTYDTFK LADTYTLDW VLLLALAKKN VRENKKKILR ELMNVAGLYY PKDSARSCCL .540 
A 



Seq ID NO: 94 DMA sequence 
Nucleic Acid Accession #: AK027351 

Coding sequence: 1-642 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 . 51 

I I I I I I 

AGGGAAAAAA ACTCCATTAA AAAGCCCAGC TTTCCTCCAT GTTAGATGTG ACTTGGAAAA 60 

TGAGAAAGAT TTAGCAAAAT TCCACCGTAT CTTTTGCCAG GCTAGAGACA GGGAGAGCAG 120 

AGTAAAACCC TCAGGCTGCT GAAATTTCTA GGCTGTTAGG AAGCCCCTCG AATTCTGTGA .180 

AAATGAGGGT TTCTTAACTC ACACTGAGAG CGGAAAGGGG CAGACCCTTT TCATAACTCC 240 

CTCAAGTGTG TGTTACCTTT CTTTACCAGC ATGGTAAGCA ACAGGACATA TCCCAGCCTC 300 

GGACATGTCT GTATGATCCA AGGTACCCAA AGTCAGACAG AGTAAACTCA AGCCTGGCAC * 360 

TGGCTTTCTG CCGCTTCATG TGCTTTGGAA AAAGCAGGAG AAGCAATAGC AGCAGGAGTC 420 

CCCAGCAGCT GGAGCCGCAA GAATGAACTG CAAAGAGGGA ACTGACAGCA GCTGCGGCTG 480 

CAGGGGCAAC GACGAGAAGA AGATGTTGAA GTGTGTGGTG GTGGGGGACG GTGCCGTGGG 540 
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GAAAACCTGC CTGCTGATGA GCTACGCCAA CGACGCCTTC CCAGAGGAAT ACGTGCCCAC 600 

TGTGTTTGAC CACTATGCAG TTACTGTGAC TGTGGGAGGC AAGCAACACT TGCTCGGACT 660 

GTATGACACC GCGGGACAGG AGGACTACAA CCAGCTGAGG CCACTCTCCT ACCCCAACAC 720 

GGATGTGTTT TTGATCTGCT TCTCTGTCGT AAACCCTGCC TCTTACCACA ATGTCCAGGA 780 

5 GGAATGGGTC CCCGAGCTCA AGGACTGCAT GCCTCACGTG CCTTATGTCC TCATAGGGAC 840 

CCAC3ATTGAT CTCCGTGATG ACCCAAAAAC CTTGGCCCGT TTGCTGTATA TGAAAGAGAA 900 

ACCTCTCACT TACGAGCATG GTGTGAAGCT CGCAAAAGCG ATCGGAGCAC AGTGCTACTT 960 

GGAATGTTCA GCTCTGACTC AGAAAGGTCT CAAAGCGGTT TTTGATGAAG CAATCCTCAC 1020 

CATTTTCCAC CCCAAGAAAA AGAAGAAACG CTGTTCTGAG GGTCACAGCT GCTGTTCAAT 1080 

10 TATCTGAGGT TGTCTGGGAC CTGCCTCCAC CCCATCCAGG GATGAGAATG GCAGCCAATC 1140 

TCTGTGGCCA AGCTCCAGCC AAAAAGGAGG GCACGACCAG AAAGGAACTC CCTTTGCACG 1200 

GAGGCTTGCC CCATCACCCT CTGAGCCCTC CCAACACAGC ACACTAGTCA GCCCACTGCC 1260 

ACGACCTCCC TGCCAGCCAG AAGCATCCGT ACTGCACGCT GTCTGAGAAT GCTGGGCCTG 1320 

GATTGCAGAC AGTGCCGCTG CTGATCGCAT CAAAAACAAA GTCAAAGGCC ATCTCACATT 1380 

15 TTACAAATCC CCAGCTCATG AACGTGAAGC TGATAGGAAA TCACCCCAGG GAACCCGAAA 1440 

AAGAAACTTG ATTCCTCTAT TGCTGGCCTT ACTTGATGTC TTTTATAAAA CTTGGGACTA 1500 

CAATACTAAC CTTTTTTTCT GAATCTGCTG TTCTACCCAT GTGTCTCACA TTCATTTGTA 1560 

TTATTTCAAG AAATGTACTA ATTTCCAGTT CACTCAGGCC TTACTAATCC ATACCAAATT 1620 

AGCCTAAAGA CAAGGCATTT TATATTCATT TCTATTTTCA GCATGTTTCT ACCAAAGCTA 1680 

20 TTAGAACCAA CACGTACCTC TGAATGCCCG ATTATAAGAA GACATGAGAA GACTTTAAAA 1740 

GTTTTGGAAA TTTACAGAGC CATGATTTTT GAACCTAATT GAAAGAAAAC CATCTGAATT 1800 

GTTGCAGGTC CACATTTTTG CCAAAGATAC ACT CT AT AG A TGCTTAGTAG TGGCCTGATT 1860 

TTTTTCCATG TATTGCCACG ACAAACTAAA AATGAACTGT GTTTAAGAAT GTAGTATTTC 1920 

TGTTTTTCAT CCAAGTTGAT TGGGGGAAGA ATATGGCAGG ATCCATCTTT TACAGTATTT 1980 

25 TGTATTCAGT AAAGTGGACA TTCCTGCTCC TCCCTTCCCC CATTGCATGC CCTCTTCCTC 2040 

• • CCTTGATTTC ACTTTCTCTC ATGCCCGGAT CCTTTTATTC TCCCCAGTTA TAACCCAGTT 2100 

ATAAAAGAAA GATCTGAGCA TAAAGATACG TGTTTAAAAA TAACTAAAAG TAAAGGAAAG 2160 

TGCCTTAATT TTTCTATTTG CTTCAACTGA AAGTGCTTCT CAGCTCGCCC CATGTAAGTT 2220 

CTCATTCCAT GTAAATGACA TTTTCCAGTT ACAACTGGTA CTGAGATTTT GCCTCTCTCT 2280 

30 TTCCTTACTC ATCCTCCCAA ATGTCTTTGT GGGAGCCATA TCAGTGGATA CCAAGCTCTG 2340 

TATCCATTTG TCCCCTGCCC TCCACAATGT GTGACATAGA ACAGGGACTT TGGCCCTGGG 2400 

AAAGCAAAAG CTCCCAGTAA GGAATCCTGT GCCCAATGAT GTAAAACAAT TCCAAACATC 2460 

CAGGAATTTT TGTATCATAG AGCGAATTAC TTCCTATCTT TTCATTAGAG GCTATGAGGA 2520 

CTTCTAATTA GTCTTAGTTG CTTATAAGTG CCCTGGAATC ACCCAGGTAG GCACTTAATT 2580 

35 TTTTTTTCAG TTGCATGAGC AAAGTGCTTC TTAGTAGTGT GAAATTACAA CAACTTTAAG 2640 

ACTTTCCAGA TTCAAGCTCC CACTGTTGGA AAAAGCCAGC CTTTCTAATC TCTTCTGCTA 2700 

CTGGAATAAG CACTTAAGAA TTGCGTGATA GCCAGGCACC GTGGCTCATG CCTGTAATCC 2760 

CAACACTTAG GGAGGCTGAG GTGGGTGGGC CGCTTGAGCT CAGGAGTTCA AGACCAGCCT 2820 

GGGTAATATA GTGAGATCCT GTGTCTCTAT- AAAAAAATTA AAAATTAGTC AGTTGTAGTG 2880 

40 ACACATACCT GTAGTCCCAG CTACTCAGGA GGCTGAGGTG GAAGGATCAC TTGAGCCCAG 2940 

AAGGTAAGGC TGCAGTGAGC TGTGACTGTG CCACTACACT CCAGCCTGAG TGACAGAGAA 3000 

AGAACCTGTC AAAAAAAAAA AAAAAACAAC CTACATTTCA AGTACTATTT CCCTTCTCTC 3060 

CCATCTAATT GCTAAAGATT TTCTTTCATA CGCACACACT CCAGTGACTG GAAAAACGGG 3120 

AGTTTTCAGT CAAAGCTTGA CATTTAGAGA AAACAAGGAC TTTCTGCCTT TATAAATGGA 3180 

45 AATCAACTGT GTATGAACTA TAACTCTGCA GAGGTTATGA ATT CATC CTT TACAAACAAT 3240 

AATGAACTTT TAGTCCTGTA ATAAATGAAA TGTTATTAGG CAGCTTTGTT GCATGATTGC 3300 

ATAGTTATAT CTTGCTAACG GGCCACTCAT TTCTCACTGA TGTGGATGAA AAAATGAGAG 3360 

CAGTATGTTT CCAGGTGTGT GCACTCAACA GGCAAATAGC TCCCGAGGTC ACCACTTCCC 3420 

TAATGGGCCA CAGGAAGTAA GTTGATCTTG ATGGGGAGAT CACGTCACCC AGAACCAGCA 3480 

50 ACTGGATAGA GACTGTTGTT AGTGTCTGGG TAGAGCACAG GCTCCCAGGG GTCTTAAGAG 3540 
CTAATTACTG AATAAAACAA TCTAGAACAA AGCAA 

Seq ID No: 95 Protein sequence: 
Protein Accession #: CAC06611.1 



1 11 21 31 41 51 

I I I I I I 

MNCKEGTDSS CGCRGNDEKK MLKCVWGDG AVGKTCLLMS YANDAFPEEY VPTVFDHYAV 60 

60 TVTVGGKQHL LGLYDTAGQE DYNQLRPLSY PNTDVFLICF SWNPASYHN VQEEWVPELK 120 

DCMPHVPYVL IGTQIDLRDD PKTLARLLYM KEKPLTYSHG VKLAKAIGAQ CYLECSALTQ 180 

KGLKAVFDEA ILTIFHPKKK KKRCSEGHSC CSII 



65 .Seq ID NO: 96 DNA sequence 

Nucleic Acid Accession #: NM_003654.1 

Coding sequence: 367-1602 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

70 | | | | | | 

GGGGAGGGCG CGGGAGGCGG AGGATGCCGC CGCGGCTGCT GCCGCCGCCG CCACCCGCGG 60 

GTCCCCGGCG ACCCTACTCC AGACCCGAGG ATGGAGCCGG CGCTGGGCGC TGCAGCTGCT 120 

CCCGGCGCGT CCCCGACCAG GTAGCTGGTG TCACTTCGGT GTGGTTGGAA GAAGACTTTC 180 

TCCCCAGCTG CATTCCCGGA GGCGCCCTTT CGACCTGGAG GCCGGGTCTG CTGGCCACAG 240 

75 GGCTGCCGCA CTGGCTGGGA CTGCCAGCTG GGCCTGGAGA CGCTGGTGGC TGTGGACTCC 300 

CCAGCTTGGA GCAGTCCCTC TTTGACCTCA CCCCTTGGAG AAGCAGCCCC ATGAAGGTGC 360 
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CCAGCCATGC AATGTTCCTG GAAGGCCGTC CTCCTCCTTG CCCTGGCCTC CATTGCCATC 420 

CAGTACACGG CCATCCGCAC CTTCACCGCC AAGTCCTTTC ACACCTGCCC CGGGCTGGCA 480 

GAGGCCGGGC TGGCCGAGCG ACTGTGCGAG GAGAGCCCCA CCTTCGCCTA CAACCTCTCC 540 

CGCAAGACCC ACATCCTCAT CCTGGCCACC ACGCGCAGCG GCTCCTCCTT CGTGGGCCAG 600 

5 CTCTTCAACC AGCACCTGGA CGTCTTCTAC CTGTTTGAGC CCCTCTACCA CGTCCAGAAC 660 

ACGCTCATCC CCCGCTTCAC CCAGGGCAAG AGCCCGGCCG ACCGGCGGGT CATGCTAGGC 720 

GCCAGCCGCG ACCTCCTGCG GAGCCTCTAC GACTGCGACC TCTACTTCCT GGAGAACTAC 780 

ATCAAGCCGC CGCCGGTCAA CCACACCACC GACAGGATCT TCCGCCGCGG GGCCAGCCGG 840 

GTCCTCTGCT CCCGGCCTGT GTGCGACCCT CCGGGGCCAG CCGACCTGGT CCTGGAGGAG 900 

10 GGGGACTGTG TGCGCAAGTG CGGGCTACTC AACCTGACCG TGGCGGCCGA GGCGTGCCGC 960 

GAGCGCAGCC ACGTGGCCAT CAAGACGGTG CGCGTGCCCG AGGTGAACGA CCTGCGCGCC 1020 

CTGGTGGAAG ACCCGCGATT AAACCTCAAG GTCATCCAGC TGGTCCGAGA CCCCCGCGGC 1080 

ATTCTGGCTT CGCGCAGCGA GACCTTCCGC GACACGTACC GGCTCTGGCG GCTCTGGTAC 1140 

GGCACCGGGA GGAAACCCTA CAACCTGGAC GTGACGCAGC TGACCACGGT GTGCGAGGAC 1200 

15 TTCTCCAACT CCGTGTCCAC CGGCCTCATG CGGCCCCCGT GGCTCAAGGG CAAGTACATG 1260 

TTGGTGCGCT ACGAGGACCT GGCTCGGAAC CCTATGAAGA AGACCGAGGA GATCTACGGG 1320 

TTCCTGGGCA TCCCGCTGGA CAGCCACGTG GCCCGCTGGA TCCAGAACAA CACGCGGGGC 1380 

GACCCCACCC TGGGCAAGCA CAAATACGGC ACCGTGCGAA ACTCGGCGGC CACGGCCGAG 1440 

AAGTGGCGCT TCCGCCTCTC CTACGACATC GTGGCCTTTG CCCAGAACGC CTGCCAGCAG 1500 

20 GTGCTGGCCC AGCTGGGCTA CAAGATCGCC GCCTCGGAGG AGGAGCTGAA GAACCCCTCG 1560 

GTCAGCCTGG TGGAGGAGCG GGACTTCCGC CCCTTCTCGT_GACCCGGGCG GTGCGGGTGG 1620 

GGGCGGGAGG CGCAAGGTGT CGGTTTTGAT AAAATGGACC GTTTTTAACT GTTGCCTTAT 1680 

TAACCCCTCC CTCTCCCACC TCATCTTCGT GTCCTTCCTG CCCCCAGCTC ACCCCACTCC 1740 

CTTCTGCCCC TTTTTTGTCT CTGAAATTTG CACTACGTCT TGGACGGGAA TCACTGGGGC 1800 

25 AGAGGGCGCC TGAAGTAGGG TCCCGCCCCC CCCACCCCAT TCAGACACAT GGATGTTGGG 1860 

TCTCTGTGCG GACGGTGACA ATGTTTACAA GCACCACATT TACACATCCA CACACGCACA 1920 

CGGGCACTCG CGAGGCGACT TCTCAAGCTT TTGAATGGGT GAGTGGTCGG GTATCTAGTT 1980 

TTTGCACTGT CTTACTATTC AAGGTAAGAG GATACAAACA AGAGGACCAC TTGTCTCTAA 2040 

TTTATGAATG GTGTCCATCC TTTCCCCATC CCTGCCTCCT GCCCCTGACG CCCATTTCCC 2100 

30 CCCTTAGAGC AGCGAAACTG CCCCCTCCTG CCCGCCCTTG CCTGTCGGTG AGGCAGGTTT 2160 

TTACTGTGAG GTGAACGTGG ACCTGTTTCT GTTTCCAGTC TGTGGTGATG CTGTCTGTCT 2220 

GTCTGAGTCT CGTGGCCGCC CCTGGACCAG TGATGACTGA TGAATCTTAT GAGCTTCTGA 2280 

TTGATCTCGG GGTCCATCTG TGATATTTCT TTGTGCCAAA AAGAAAAAAA AAGAGTGGAT 2340 

CAGTTTGCTA AATGAACATT GAAATTGAAA TGCTTTATCT GTGTTTTCTG TAAATAAAAG 2400 

35 AGTGCAATAA TCACC 

Seq ID No: 97 Protein sequence: 
Protein Accession #: NP 003645.1 



40 



45 



50 



i 
I 

MQCSWKAVLL 
THILILATTR 
RDLLRSLYDC 
CVRKCGLLNL 
ASRSETFRDT 
RYEDIARNPM 
RFRLSYDIVA 



11 
I 

LALASIAIQY 
SGSSFVGQLF 
DLYFLENYIK 
TVAAEACRER 
YRLWRLWYGT 
KKTEEIYGFL 
FAQNACQQVL 



21 
I 

TAIRTFTAKS 
NQHLDVFYLF 
PPPVNHTTDR 
SHVAIKTVRV 
GRKPYNLDVT 
GIPLDSHVAR 
AQUGYKIAAS 



31 

I 

FHTCPGLAEA 
EPLYHVQNTL 
IFRRGASRVL 
PEVNDLRALV 
QLTTVCEDFS 
WIQNNTRGDP 
EEELKNPSVS 



41 

I 

GLAERLCEES 
IPRFTQGKSP 
CSRPVCDPPG 
EDPRLNLKVI 
NSVSTGLMRP 
TLGKHKYGTV 
LVEERDFRPF 



51 
I 

PTFAYNLSRK 
ADRRVMLGAS 
PADLVLEEGD 
QLVRDPRGIL 
PWLKGKYMLV 
RNSAATAEKW 
S 



60 
120 
180 
240 
300 
360 



55 



60 



65 



70 



75 



Seq ID NO: 98 DNA sequence 

Nucleic Acid Accession #: NM_002852.1 

Coding sequence: 68-1213 (underlined sequences correspond to start and stop codons) 



CTCAAACTCA 
TCCAGCAATG 
GAACTCGGAT 
CCATCCCACT 
GCTCTTCATC 
CGACGTCCTG 
CCTGGCGAGG 
CGAGCTGCTG 
GGCGCAGCGC 
GACGCGAGCC 
TTGTGAAACA 
AGTGAGACCA 
ATTAAACAAA 
GTATCTCAGC 
TGAAGCCATG 
AGGGCTCACA 
AGGTCACATT 
TGTGGGTGGT 
CTGGGATAGT 



11 
I 

GCTCACTTGA 
CATCTCCTTG 
GATTATGATC 
GAGGACCCCA 
ATGCTGGAGA 
CGGGGCGAGC 
CCGTGCGCGC 
CAGGCGACCC 
CCAGAGGAGG 
GACCTGCACG 
GCTATTTTAT 
ATGAGGCTTG 
ACCATCCTGT 
TACCAATCCA 
GTTTCCCTGG 
TCCTTGTGGG 
GTTCCTGAGG 
GGCTTTGATG 
GTTCTTAGCA 



21 
I 

GAGTCTCCTC 
CGATTCTGTT 
TCATGTATGT 
CGCCGTGCGA 
ACTCGCAGAT 
TGCAGAGGCT 
CGGGGGCTCC 
GCGACGCGGG 
CGGGGCGCGC 
CGGTGCAGGG 
TCCCAATGCG 
AGTCTTTTAG 
TTTCCTATGG 
TAGTGTTTGT 
GAAGGTGGAC 
TAAATGGTGA 
GAGGAATCCT 
AAACATTAGC 
ATGAAGAGAT 



31 

I 

CCGCCAGCTG 
TTGTGCTCTC 
GAATTTGGAC 
CTGCGGTCAG 
GAGAGAGCGC 
GCGGGAGGAG 
CGCAGAGGCC 
CCGCAGGCTG 
CCTGGCCGCG 
CTGGGCTGCC 
TTCCAAGAAG 
TGCCTGCATT 
CACAAAGAGG 
GGTGGGTGGA 
CCACCTGTGC 
ACTGGCGGCT 
GCAGATTGGC 
CTTCTCTGGG 
AAGAGAGACC 



41 
I 

TGGAAAGAAC 
TGGTCTGCAG 
AACGAAATAG 
GAGCACTCGG 
ATGCTGCTGC 
CTGGGCCGGC 
AGGCTGACCA 
GCGCGTATGG 
GTGCTAGAGG 
CGGAGCTGGC 
ATTTTTGGAA 
TGGGTCAAAG 
AATCCATATG 
GAGGAGAACA 
GGCACCTGGA 
ACCACTGTTG 
CAAGAAAAGA 
AGACTCACAG 
GGAGGAGCAG 



51 

I 

TTTGCGTCTC 
TGTTGGCCGA 
ACAATGGACT 
AATGGGACAA 
AAGCCACGGA 
TCGCGGAAAG 
GTGCTCTGGA 
AGGGCGCGGA 
AGCTGCGGCA 
TGCCGGCAGG 
GCGTGCATCC 
CCACAGATGT 
AAATCCAGCT 
AACTGGTTGC 
ATTCAGAGGA 
AGATGGCCAC 
ATGGCTGCTG 
GCTTCAATAT 
AGTCTTGTCA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
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10 



CATCCGGGGG 
GTATGTTTCA 
AACACATGCC 
TGAAAGAGAG 
AAGGAAAGAC 
TTTCAGTTTA 
TGTTGAACAG 
GAATTTTACA 
TATGTACCTT 
ACTATAAATG 
AAGTTATATT 
AAATAAAATA 



AATATTGTTG 
TAAATGTTGT 
AGTTGGGAAG 
AGTTGAGACC 
ATTGGAAAAA 
ATGCTGTGTC 
AGGGACAATT 
TTGGAAGAAT 
ATTACAAAAA 
TAGTTTATGT 
GCAAAAGGGA 
TTTTATAAAA 



GGTGGGGAGT 
GAAACTCCAC 
GTCTGAAAAC 
AATCTTTATT 
GCTTTTGAGG 
TCTGTCAGAT 
GTTTTACTTT 
AACAAAATAA 
AAATGATGAA 
GTTATAATCG 
TTTGTATTAA 
CTAAAAAAAA 



CACAGAGATC 
TTGAAGCCAA 
TCAGTGCATA 
TGTACTGGCC 
ATAATGTTAC 
AAACTCTCAA 
TCTTTGGTTA 
GATTTGTTGT 
AACATATTTA 
AATGTCACGT 
TTTAAGACTA 
AAAAAAA 



CAGCCACATG 
AGAAAGAAAC 
ATAGGAACAC 
AAATACTGAA 
TAGACTTTAT 
ATAATTAAAA 
ATTTTGTTTT 
CCATTGTTCA 
TACTACAAGG 
TTTTGAGAAG 
TTTTTGTAAA 



GAGGAGCTCA 
TCACACTTAA 
TTGAGACTAA 
TAAACAGTTG 
GCCATGGTGC 
AGGACTGTAT 
GGCCAGAGAT 
TTGTTATTGG 
TGACTTAACA 
ATAGTCATAT 
GCTCTACTGT 



1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 



15 



Seq ID No: 99 Protein sequence: 
Protein Accession #: NP 002843.1 



20 



25 



MHLLAILFCA 
IMLENSQMRE 
LQATRDAGRR 
TAILFPMRSK 
SYQSIVFWG 
IVPEGGILQI 
GNIVGWGVTE 



11 
I 

LWSAVLAENS 
RMLLQATDDV 
LARMEGAEAQ 
KIFGSVHPVR 
GEENKLVAEA 
GQEKNGCCVG 
IQPHGGAQYV 



21 
I 

DDYDLMYVNL 
LRGELQRLRE 
RPEEAGRALA 
PMRLESFSAC 
MVSLGRWTHL 
GGFDETLAFS 
S 



31 

I 

DNEIDNGLHP 
ELGRLAESLA 
AVLEEIiRQTR 
IWVKATDVLN 
CGTWNSEEGL 
GRLTGFNIWD 



41 

I 

TEDPTPCDCG 
RPCAPGAPAE 
ADLHAVQGWA 
KTILFSYGTK 
TSLWVNGELA 
SVLSNEEIRE 



51 
I 

QEHSEWDKLF 
ARLTSALDEL 
ARSWLPAGCE 
RNPYEIQLYL 
ATTVEMATGH 
TGGAESCHIR 



60 
120 
180 
240 
300 
360 



Seq ID NO: 100 DNA sequence 
Nucleic Acid Accession #: NM_007351.1 
30 Coding sequence; 72-37S8 (underlined sequences correspond to start and stop codons) 



35 



40 



45 



50 



55 



60 



65 



70 



75 



CTGCTATCAA 
AAACTACTGA 
GCATTGGGCT 
AGACTATGCC 
CCACTCGGGT 
GTCTTCTTAA 
ATCAAACTCT 
TCCCAACCAA 
CTACACTGAA 
ACACAGTTGG 
GAGCCCCACG 
ACCAAAAATC 
GGTTATCTCC 
CTTGTGGCTG 
ATAGGATGCA 
GGCCGAAATG 
AAAGTCATAC 
ACCCAGAAGT 
TTCTGCAGAA 
CCTCCCTAGA 
GTCTAAAATC 
TTTTTCAAAA 
CAGAGGACCT 
TAGCAGCCCA 
TGGAACTAAG 
CTATTAAAGA 
CAAGAAGCAT 
ATGAGCAGCT 
CAGTTAGCAA 
GTTTGATGAT 
TCACCGTCTC 
CCAAATGCAG 
TAAATCAAAC 
AGCAACTAAA 
CATCACTCAG 
AGATAGAAAA 
AAAGACACAA 
TCAATGAATA 
ATGCTATTGA 
ATAATAGTGA 
CTCAGTTCCA 



11 
I 

AAAGGCCATA 
GATGAAGGGG 
TAACAACAGT 
TTCTGCTTCA 
CATGTCGGCG 
ATCAACACTG 
CACATCCACA 
CGCTAGCATC 
ATTTCTTCAG 
AGGCACTGGA 
GGAAACATAC 
AAATTTCGAA 
CACAGTGACA 
GACCGGTGGA 
ACATAAAATT 
TCAACTAAGA 
AGCTGTTGGC 
GATGCAAAAA 
GAAGATTGAC 
AGGAAAAGTC 
CAAAAGCATT 
TGACATGCAA 
CGAAAGCACC 
GCAAAAGTTT 
GAATCACATT 
ACTAGAAGTA 
TCTGTATTAT 
TTTATCAACT 
TAATGTCACT 
GCTGCAAATG 
TTTGGAGATG 
AAATGATTTT 
ATTGGCTGAA 
TGATTTGACT 
ACAGACAATG 
TCTGACTAGT 
CTTACTTAGA 
TGCCTTAGAA 
TTTCATTCAA 
GATCCATCAT 
CCGTCTGAAT 



21 
I 

AGGATTTTGT 
GCAAGATTAT 
AAGCATTCTT 
GTTCCTCCAA 
GAGATAGCTA 
CCTCCCTCAG 
GAGAAAGCAG 
AAGTTCAATC 
AGCTTTGCCA 
GGCATTGGAG 
CTCAGCCGGG 
ACAACTAGAG 
TTGGACAACC 
TCCTGTCCTC 
GTCACCTCAT 
GCCCAGGAAC 
AGAGGAGTAG 
ATGACTGATC 
AATATTTCTT 
AGCGAAGATA 
AATGTACTGA 
GAGACTGTAG 
AGGCAAATAA 
GTTTTGGTGC 
GTGAATGTAA 
AAGCAGACTC 
GAATCCCTCA 
GAACAGGTAT 
GAGTACATGT 
TTTGAAGATT 
GAGAAAGAGT 
AAATTTCAAC 
GTTCTCTTTC 
TATGATATGG 
ACATATGAAC 
GCTGTCAATA 
AATGAAGTAC 
ATGGAAGATG 
GATAACTATG 
AAATGTACCT 
GATTCTATTC 



31 
I 

CCCCAAATTT 
TTGTCCTTCT 
GGACTATACC 
ATAAAATACA 
CAACTCCAGA 
AAACAAGTGC 
AAGGAGTGGT 
CTGGAGCAGA 
GAAAGTCAAA 
GCGTTGGAGG 
GTGACAGCAG 
GAAAGAATTG 
AGGTCACTTA 
AGAGATCTCA 
TGGATTGGAG 
AGCAAAGTTT 
CTGAGCAGCA 
AGGTGAACTA 
TGACTGTGAA 
AAAGCAGAGA 
TAAGAGACAT 
CACAGCTCTT 
TTCAAAAAGT 
AAGAGAATCG 
GGCAAGAAAT 
ATTTAGAAGG 
ATAAAACTCT 
CAGACCAGAA 
CTACTTTACA 
TGCACATTCA 
CTCTCAGAGG 
TTAAGGACAC 
CAATGGACAA 
AGATCCTTCA 
AACCAAAGGA 
GTCTAAATTT 
AGGGTCGTGA 
GCCTCAATAA 
CCCTAAAAGA 
CCGATATGGA 
AGACTTTGGT 



41 

I 

CACATGAGCT 
TTCTAGTTTA 
TGAGGATGGG 
AAGTTTGCAA 
GGCAAGAACT 
ACCTGCTGAG 
CAAGTTACAG 
ATCAGTGGTC 
TGAACAAGCA 
CACTGGAGGC 
TTCCAGCCAA 
GTGTGCTTAT 
TGTCCCAGGT 
GAAGATATCC 
GTGCTGTCCT 
GATACACACC 
GCAGCAGCAA 
CCAGGCAATG 
TGATGTAAGG 
ATTTCAATCT 
AGTAAGAGAA 
CAAGACTGTA 
TAATGAATCT 
GCCCACTTTG 
GACTCTTACA 
TGCTCTAGAA 
TTCTAAATTG 
GAATGCTCCA 
TGAAAATATA 
AGAAAGCAAG 
TGAATGTGAA 
AGAAGAGAAT 
TAAGATGGAC 
ACCCTTGCTT 
AGCAATAGTG 
TATTATCAAA 
TGATGCCTTA 
GACAATGACT 
GACTTTAAGT 
AACTATTTTG 
CAATGACAAT 



51 
I 

ACCTTGCTTC 
TGGAGTGGGG 
AACTCTCAGA 
ATACTGCCAA 
TCTGAAGACA 
GGTGTGAGAA 
AATCTTACCC 
CTTTCCAATT 
ACTTCTCTAA 
GTGGGAAATC 
AGAACTGACT 
GTACATACCA 
GGGAAAGGAC 
AATCCTGTCT 
GGATACAGTG 
AACCAGGCTG 
GGCTGTGGTG 
AAACTGACTC 
AACACTTACT 
CTTCTAAAAG 
CAATTTAAAA 
TCAAGTCTAT 
GTGGTTTCAA 
ACTGATATAG 
TGTGAGAAGC 
CAGGAACACT 
AAGGAAGTAC 
GCTGCTGAGT 
AAGAAGCAGA 
ATTAACAATC 
GACATGTTAT 
TTACATGTGT 
AAAATGAGTG 
GAGCAGGGAG 
ATAAGGAAAA 
GAACTTACAA 
GAAAGACGTA 
ATTATAAATA 
ACTATTAAGG 
ACATTTATTC 
CAGAGATATA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1060 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
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ACTTTGTTTT GCAAGTCGCC AAGACCCTTG 
AGTCCAACTT CCAAAAGATG TATCAAATGT 
ACCAGCAAAA TATGAGTCAT TTGGAAGAAA 
ATTTTGAGAC TCGGTTGCAA GACATTGAGT 
ATATTTCAGT TAAAAAAGGC AGTGTAGTTA 
AAGTATTAAA TTCCAGATTT AAGGCGTTGG 
TCTTTTCGCT TAACAAAACT CTCCACGAAG 
GTGTGTCAGA ACTGAATGCT ACCATCCCTA 
AACTTCTTCA GAAAGGTCTA ACAGAATTTG 
CTGCCCTATC TAATTCAACT TGTTGTATAG 
TTGTCAAGTC TCAGAAGCAA GTAAAATCAT 
CAACGGTAAA TCTTACCACA GTCCTGATAG 
TATATCCTGA GGAGTATTCA AGCTGTAGTC 
TAAATGGAAG AACTAGCTTT ACCTGTGCCT 
CTATCAAGCT TGTGGAAGAA AATGCTTTAG 
ATGCACCCAT GGTGGCATTT TTTGCATCTC 
TCCTGTTTAA TAACTTGGAT GTCAATTATG 
TTAGAATTCC GTATCTTGGA GTATATGTTT 
ATATTTCTGG ATTTTTAGTG GTTGATGGAA 
TTAACAGTGA AATACACTGT GATAGGGTTT 
ATGGGCAGGA AGTCTGGTTA CGACTTGCAA 
TTACTACATT TAGTGGCTAT TTATTATATC 
CACCTTTATT GAGAAACAGC CAGTGTTTTC 
TTGGTTTTTC TACAGGAAAT GAAAATCAAC 
TCTATTTTAT AAAATTATTT GAATATTGTT 
CTAAAGAAAT TTAGTGGCAC AGAAAACAAA 
TTATTTCTTC ATTTTAAGTC ATTGCAATGG 
ATATTATCAG TCACAGTTTT CTTTCCAATT 
TAAATATATA ACACACATTT TCTAGATTCA 



CAGGTATTCC CAGAGATGAG AAACTAAATC 2520 

TCAATGAAAC CACTTCCCAA GTGAGAAAAT 2580 

AACTACTCTT AACTACCAAG ATTTCCAAAA 2640 

CTAAAGTTAC CCAGACGCTC ATACCTTATT 2700 

CAAATGAGAG AGATCAGGCT CTTCAACTGC 2760 

AAGCAAAATC TATCCATCTT TCAATTAACT 2820 

TTTTAACAAT GTGTCACAAT GCTTCTACAA 2880 

AGTGGATAAA ACATTCCCTG CCAGATATTC 2940 

TGGAACCAAT AATTCAAATA AAAACTCAAG 3000 

ATCGATCGTT GCCTGGTAGT CTGGCAAATG 3060 

TGCCAAAGAA AATTAACGCA CTTAAGAAAC 3120 

GCCGGACTCA AAGAAACACG GACAACATAA 3180 

GGCATCCGTG CCAAAATGGG GGCACGTGCA 3240 

GCAGACATCC TTTTACTGGT GACAACTGCA 3300 

CTCCAGATTT TTCCAAAGGA TCTTACAGAT 3360 

ATACGTATGG AATGACTATA CCTGGTCCTA 3420 

GAGCTTCATA TACCCCAAGA ACTGGAAAAT 3480 

TCAAGTACAC CATCGAGTCA TTTAGTGCTC 3540 

TAGACAAGCT TGCATTTGAG TCTGAAAATA 3600 

TAACTGGGGA TGCCTTATTA GAATTAAATT 3660 

AAGGAACAAT TCCAGCCAAG TTTCCCCCTG 3720 

GTACATAAGT TAGTATGAAA AACAGACTAT 3780 

ATTTATCTTT GCTTGCACAT CTGCTCTGTT 3840 

TTGTTTTTTT AATATGAGTA AACTTGTATG 3900 

TAATGTCTGA ATATGAAAGA GTTCTTGATC 3960 

GTGAATTTGT TAGCATAATT ATTCCTATTC 4020 

AAAGTAATAT TATAAAACGG TAATTACAAC 4080 

AAACACTTAA CTTTTGTTAT TCCCTGTATA 4140 
CAAATTTAAA TAAATTACTC AAAAAATG 



Seq ID No: 101 Protein sequence: 
Protein Accession #: NP 031377.1 



1 11 21 31 41 51 

I I I I I I 

MKGARLFVLL SSLWSGGIGL NNSKHSWTIP EDGNSQKTMP SASVPPNKIQ SLQILPTTRV 60 

MSAEIATTPE ARTSEDSLLK STLPPSETSA PAEGVRNQTL TSTEKAEGW KLQNLTLPTN 120 

ASIKPNPGAE SWLSNSTLK FLQSFARKSN EQATSLNTVG GTGGIGGVGG TGGVGNRAPR 180 

ETYLSRGDSS SSQRTDYQKS NPETTRGKNW CAYVHTRLSP TVTLDNQVTY VPGGKGPCGW 240 

TGGSCPQRSQ KISNPVYRMQ HKIVTSLDWR CCPGYSGPKC QLRAQEQQSL IHTNQAESHT 300 

AVGRGVAEQQ QQQGCGDPEV MQKMTDQVNY QAMKLTLLQK KIDNISLTVN DVRNTYSSLE 360 

GKVSEDKSRE PQSLLKGLKS KS1NVLIRDI VREQFKIFQN* DMQETVAQLF KTVSSLSEDL 420 

ESTRQIIQKV NESWSIAAQ QKFVLVQENR PTLTDIVELR NHIVNVRQEM TLTCEKPIKE 480 

LEVKQTHLEG ALEQEKSRSI LYYESLNKTX SKLKEVHEQL LSTEQVSDQK NAPAAESVSN 540 

NVTEYMSTLH ENIKKQSLMM LQMFEDLHIQ ESKINNljTVS LEMEKESLRG ECEDMLSKCR 600 

NDFKFQLKDT EENLHVLNQT LAEVLFPMDN KMDKMSEQLN DLTYDMEILQ PLLEOGASLR 660 

QTMTYEQPKE AIVIRKKIEN LTSAVNSLNF IIKELTKRHN LLRNEVQGRD DALERRINEY 720 

ALEMEDGLNK TMTIINNAID FIQDNYALKE TLSTIKDNSE IHHKCTSDME TILTFIPQFH 780 

RLNDSIQTLV NDNQRYNFVL QVAKTLAGIP RDEKLNQSNF QKMYQMFNET TSQVRKYQQN 840 

MSHLEEKLLL TTKISKNFET RLQDIESKVT QTLIPYYISV KKGSWTNER DQALQLQVLN 900 

SRFKALEAKS IHLSINFFSL NKTLHEVLTM CHNASTSVSE LNATIPKWIK HSLPDIQIiLQ 960 

KGLTEFVEPI IQIKTQAALS NSTCCIDRSL PGSLANWKS QKQVKSLPKK INALKKPTVN 1020 

LTTVLIGRTQ RNTDNIIYPE EYSSCSRHPC QNGGTCINGR TSFTCACRHP FTGDNCTIKL 1080 

VEENALAPDF SKGSYRYAPM VAFFASHTYG MTIPGPILFN NLDVNYGASY TPRTGKFRIP 1140 

YliGVYVFKYT IESFSAHISG FLWDGIDKL AFESENINSE IHCDRVLTGD ALLELNYGQE 1200 
VWUUjAKGTI PAKFPPVTTF SGYLLYRT 



Seq ID NO: 102 DNA sequence 

Nucleic Acid Accession NM_000873.2 

Coding sequence: 57-884 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I II I 1 I 

ATCTCCCTCC AGGCAGCCCT TGGCTGGTCC CTGCGAGCCC GTGGAGACTG CCAGAGATGT 60 

CCTCTTTCGG TTACAGGACC CTGACTGTGG CCCTCTTCAC CCTGATCTGC TGTCCAGGAT 120 

CGGATGAGAA GGTATTCGAG GTACACGTGA GGCCAAAGAA GCTGGCGGTT GAGCCCAAAG 180 

GGTCCCTCGA GGTCAACTGC AGCACCACCT GTAACCAGCC TGAAGTGGGT GGTCTGGAGA 240 

CCTCTCTAAA TAAGATTCTG CTGGACGAAC AGGCTCAGTG GAAACATTAC TTGGTCTCAA 300 

ACATCTCCCA TGACACGGTC CTCCAATGCC ACTTCACCTG CTCCGGGAAG CAGGAGTCAA 360 

TGAATTCCAA CGTCAGCGTG TACCAGCCTC CAAGGCAGGT CATCCTGACA CTGCAACCCA 420 

CTTTGGTGGC TGTGGGCAAG TCCTTCACCA TTGAGTGCAG GGTGCCCACC GTGGAGCCCC 480 

TGGACAGCCT CACCCTCTTC CTGTTCCGTG GCAATGAGAC TCTGCACTAT GAGACCTTCG 540 

GGAAGGCAGC CCCTGCTCCG CAGGkGGCCA CAGCCACATT CAACAGCACG GCTGACAGAG 600 

AGGATGGCCA CCGCAACTTC TCCTGCCTGG CTGTGCTGGA CTTGATGTCT CGCGGTGGCA 660 

ACATCTTTCA CAAACACTCA GCCCCGAAGA TGTTGGAGAT CTATGAGCCT GTGTCGGACA 720 
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GCCAGATGGT CATCATAGTC ACGGTGGTGT CGGTGTTGCT GTCCCTGTTC GTGACATCTG 780 
TCCTGCTCTG CTTCATCTTC GGCCAGCACT TGCGCCAGCA GCGGATGGGC ACCTACGGGG 840 
TGGGAGCGGC TTGGAGGAGG CTGCCCCAGG CCTTCCGGCC ATAGCAACCA TGAGTGGCAT 900 
GGCCACCACC ACGGTGGTCA CTGGAACTCA GTGTGACTCC TCAGGGTTGA GGTCCAGCCC 960 
TGGCTGAAGG ACTGTGACAG GCAGCAGAGA CTTGGGACAT TGCCTTTTCT AGCCCGAATA 1020 
CAAACACCTG GACTT 

Seq ID No: 103 Protein sequence: 
Protein Accession #: NP_000864.1 



1 11 21 31 41 51 

I I I I I I 

MSSFGYRTLT VALFTLICCP GSDEKVFBVH VRPKKLAVEP KGSLEVNCST TCNQPEVGGL 60 

ETSLNKILLD EQAQWKHYLV SNISHDTVLQ CHFTCSGKQB SMNSNVSVYQ PPRQVILTLQ 120 

PTLVAVGKSF TIECRVPTVE PLDSLTLFLF RGNETLHYET FGKAAPAPQE ATATFNSTAD 180 

REDGHRNFSC LAVLDLMSRG GNIFHKHSAP KMLEIYEPVS DSQMVIIVTV VSVLLSLFVT 240 
SVLLCFIFGQ HLRQQRMGTY GVRAAWRRLP QAFRP 

Seq ID NO: 104 DNA sequence 

Nucleic Acid Accession #: NM_001795.2 

Coding sequence: 121-2475 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

GACGGTCGGC TGACAGGCTC CACAGAGCTC CACTCACGCT CAGGCCCTGG ACGGACAGGC 60 

AGTCCAACGG AACAGAAACA TCCCTCAGCC CCACAGGCAC GATCTGTTCC TCCTGGGAAG 120 

ATGCAGAGGC TCATGATGCT CCTCGCCACA TCGGGCGCCT GCCTGGGCCT GCTGGCAGTG 180 

GCAGCAGTGG CAGCAGCAGG TGCTAACCCT GCCCAACGGG ACACCCACAG CCTGCTGCCC 240 

ACCCACCGGC GCCAAAAGAG AGATTGGATT TGGAACCAGA TGCACATTGA TGAAGAGAAA 300 

AACACCTCAC TTCCCCATCA TGTAGGCAAG ATCAAGTCAA GCGTGAGTCG CAAGAATGCC 360 

AAGTACCTGC TCAAAGGAGA ATATGTGGGC AAGGTCTTCC GGGTCGATGC AGAGACAGGA 420 

GACGTGTTCG CCATTGAGAG GCTGGACCGG GAGAATATCT CAGAGTACCA CCTCACTGCT -480 

GTCATTGTGG ACAAGGACAC TGGTGAAAAC CTGGAGACTC CTTCCAGCTT CACCATCAAA 540 

GTTCATGACG TGAACGACAA CTGGCCTGTG TTCACGCATC GGTTGTTCAA TGCGTCCGTG 600 

CCTGAGTCGT CGGCTGTGGG GACCTCAGTC ATCTCTGTGA CAGCAGTGGA TGCAGACGAC 660 

CCCACTGTGG GAGACCACGC CTCTGTCATG TACCAAATCC TGAAGGGGAA AGAGTATTTT 720 

GCCATCGATA ATTCTGGACG TATTATCACA ATAACGAAAA GCTTGGACCG AGAGAAGCAG 780 

GCCAGGTATG AGATCGTGGT GGAAGCGCGA GATGCCCAGG GCCTCCGGGG GGACTCGGGC 840 

ACGGCCACCG TGCTGGTCAC TCTGCAAGAC ATCAATGACA ACTTCCCCTT CTTCACCCAG 900 

ACCAAGTACA CATTTGTCGT GCCTGAAGAC ACCCGTGTGG GCACCTCTGT GGGCTCTCTG 960 

TTTGTTGAGG ACCCAGATGA GCCCCAGAAC CGGATGACCA AGTACAGCAT CTTGCGGGGC 1020 - 

GACTACCAGG ACGCTTTCAC CATTGAGACA AACCCCGCCC ACAACGAGGG CATCATCAAG 1080 

CCCATGAAGC CTCTGGATTA TGAATACATC CAGCAATACA GCTTCATCGT CGAGGCCACA 1140 

GACCCCACCA TCGACCTCCG ATACATGAGC CCTCCCGCGG GAAACAGAGC CCAGGTCATT 1200 

ATCAACATCA CAGATGTGGA CGAGCCCCCC ATTTTCCAGC AGCCTTTCTA CCACTTCCAG 1260 

CTGAAGGAAA ACCAGAAGAA GCCTCTGATT GGCACAGTGC TGGCCATGGA CCCTGATGCG 1320 

GCTAGGCATA GCATTGGATA CTCCATCCGC AGGACCAGTG ACAAGGGCCA GTTCTTCCGA 1380 

GTCACAAAAA AGGGGGACAT TTACAATGAG AAAGAACTGG ACAGAGAAGT CTACCCCTGG 1440 

TATAACCTGA CTGTGGAGGC CAAAGAACTG GATTCCACTG GAACCCCCAC AGGAAAAGAA 1500 

TCCATTGTGC AAGTCCACAT TGAAGTTTTG GATGAGAATG ACAATGCCCC GGAGTTTGCC 1560 

AAGCCCTACC AGCCCAAAGT GTGTGAGAAC GCTGTCCATG GCCAGCTGGT CCTGCAGATC 1620 

TCCGCAATAG ACAAGGACAT AACACCACGA AACGTGAAGT TCAAATTCAC CTTGAATACT 1680 

GAGAACAACT TTACCCTCAC GGATAATCAC GATAACACGG CCAACATCAC AGTCAAGTAT 1740 

GGGCAGTTTG ACCGGGAGCA TACCAAGGTC CACTTCCTAC CCGTGGTCAT CTCAGACAAT 1800 

GGGATGCCAA GTCGCACGGG CACCAGCACG CTGACCGTGG CCGTGTGCAA GTGCAACGAG 1860 

CAGGGCGAGT TCACCTTCTG CGAGGATATG GCCGCCCAGG TGGGCGTGAG CATCCAGGCA 1920 

GTGGTAGCCA TCTTACTCTG CATCCTCACC ATCACAGTGA TCACCCTGCT CATCTTCCTG 1980 

CGGCGGCGGC TCCGGAAGCA GGCCCGCGCG CACGGCAAGA GCGTGCCGGA GATCCACGAG 2040 

CAGCTGGTCA CCTACGACGA GGAGGGCGGC GGCGAGATGG ACACCACCAG CTACGATGTG 2100 

TCGGTGCTCA ACTCGGTGCG CCGCGGCGGG GCCAAGCCCC CGCGGCCCGC GCTGGACGCC 2160 

CGGCCTTCCC TCTATGCGCA GGTGCAGAAG CCACCGAGGC ACGCGCCTGG GGCACACGGA 2220 

GGGCCCGGGG AGATGGCAGC CATGATCGAG GTGAAGAAGG ACGAGGCGGA CCACGACGGC 2280 

GACGGCCCCC CCTACGACAC GCTGCACATC TACGGCTACG AGGGCTCCGA GTCCATAGCC 2340 

GAGTCCCTCA GCTCCCTGGG CACCGACTCA TCCGACTCTG ACGTGGATTA CGACTTCCTT 2400 

AACGACTGGG GACCCAGGTT TAAGATGCTG GCTGAGCTGT ACGGCTCGGA CCCCCGGGAG 2460 

GAGCTGCTGT ATTAGGCGGC CGAGGTCACT CTGGGCCTGG, GGACCCAAAC CCCCTGCAGC 2520 

CCAGGCCAGT CAGACGCCAG GCACCACAGC CTCCAAAAAT GGCAGTGACT CCCCAGCCCA 2580 

GCACCCCTTC CTCGTGGGTC CCAGAGACCT CATCAGCCTT GGGATAGCAA ACTCCAGGTT 2640 

CCTGAAATAT CCAGGAATAT ATGTCAGTGA TGACTATTCT CAAATGCTGG CAAATCCAGG 2700 

CTGGTGTTCT- GTCTGGGCTC AGACATCCAC ATAACCCTGT CACCCACAGA CCGCCGTCTA 2760 

ACTCAAAGAC TTCCTCTGGC TCCCCAAGGC TGCAAAGCAA AACAGACTGT GTTTAACTGC 2820 

TGCAGGGTCT TTTTCTAGGG TCCCTGAACG CCCTGGTAAG GCTGGTGAGG TCCTGGTGCC 2880 

TATCTGCCTG GAGGCAAAGG CCTGGACAGC TTGACTTGTG GGGCAGGATT CTCTGCAGCC 2940 

CATTCCCAAG GGAGACTGAC CATCATGCCC TCTCTCGGGA GCCCTAGCCC TGCTCCAACT 3000 
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CCATACTCCA CTCCAAGTGC CCCACCACTC CCCAACCCCT CTCCAGGCCT GTCAAGAGGG 3060 

AGGAAGGGGC CCCATGGCAG CTCCTGACCT TGGGTCCTGA AGTGACCTCA CTGGCCTGCC 3120 

ATGCCAGTAA CTGTGCTGTA CTGAGCACTG AACCACATTC AGGGAAATGG CTTATTAAAC 3180 

TTTGAAGCAA CTGTGAATTC ATTCTGGAGG GGCAGTGGAG ATCAGGAGTG ACAGATCACA 3240 

GGGTGAGGGC CACCTCCACA CCCACCCCCT CTGGAGAAGG CCTGGAAGAG CTGAGACCTT 3300 

GCTTTGAGAC TCCTCAGCAC CCCTCCAGTT TTGCCTGAGA AGGGGCAGAT GTTCCCGGAG 3360 

CAGAAGACGT CTCCCCTTCT CTGCCTCACC TGGTCGCCAA TCCATGCTCT CTTTCTTTTC 3420 

TCTGTCTACT CCTTATCCCT TGGTTTAGAG GAACCCAAGA TGTGGCCTTT AGCAAAACTG 3480 

GACAATGTCC AAACCCACTC ATGACTGCAT GACGGAGCCG AGCCATGTGT CTTTACACCT 3540 

CGCTGTTGTC ACATCTCAGG GAACTGACCC TCAGGCACAC CTTGCAGAAG GCAAGGCCCT 3600 

GCCCTGCCCA ACCTCTGTGG TCACCCATGC ATCTTCCACT GGAACGTTTC ACTGCAAACA 3660 

CACCTTGGAG AAGTGGCATC AGTCAACAGA GAGGGGCAGG GAAGGAGACA CCAAGCTCAC 3720 

CCTTCGTCAT GGACCGAGGT TCCCACTCTG GGCAAAGCCC CTCACACTGC AAGGGATTGT 3780 

AGATAACACT GACTTGTTTG TTTTAACCAA TAACTAGCTT CTTATAATGA TTTTTTTACT 3B40 

AATGATACTT ACAAGTTTCT AGCTCTCACA GACATATAGA ATAAGGGTTT TTGCATAATA 3900 

AGCAGGTTGT TATTTAGGTT AACAATATTA ATTCAGGTTT TTTAGTTGGA AAAACAATTC 3960 

CTGTAACCTT CTATTTTCTA TAATTGTAGT AATTGCTCTA CAGATAATGT CTATATATTG 4020 

GCCAAACTGG TGCATGACAA GTACTGTATT TTTTTATACC TAAATAAAGA AAAATCTTTA 4080 
GCCTGGGCAA CAAAAAAA 

Seq ID No: 105 Protein sequence: 
Protein Accession #: NP_001786.1 



1 11 21 31 41 51 

I I I I I I 

MQRfcMMLLAT SGACLGLIiAV AAVAAAGANP AQRDTHSLLP THRRQKRDWI WNQMHIDEEK 60 

NTSLPHHVGK IKSSVSRKNA KYLLKGEYVG KVFRVDAETG DVFAIERLDR ENISEYHLTA 120 

VIVDKDTGEN LETPSSFTIK VHDVNDNWPV FTHRLFNASV PESSAVGTSV ISVTAVDADD 180 

PTVGDHASVM YQILKGKEYF AIDNSGRIIT ITKSLDREKQ ARYEIWEAR DAQGLRGDSG 240 

TATVLVTLQD INDNFPFFTQ TKYTFWPED TRVGTSVGSL FVEDPDEPQN RMTKYSILRG 300 

DYQDAFTIET NPAHNEGIIK PMKPLDYEYI QQYSFIVEAT DPTIDLRYMS PPAGNRAQVI 360 

INITDVDEPP IFQQPFYHFQ UCENQKKPLI GTVIAMDPDA ARHSIGYSIR RTSDKGQFFR 420 

VTKKGDIYNE KELDREVYPW YNLTVEAKEL DSTGTPTGKE SIVQVHIEVL DENDNAPEFA 480 

KPYQPKVCEN AVHGQLVLQI SAIDKDITPR NVKFKFTLNT ENNFTLTDNH DNTANITVKY 540 

GQFDREHTKV HFLPWISDN GMPSRTGTST LTVAVCKCNE QGEFTFCEDM AAQVGVSIQA 600 

WAILLCILT ITVITLLIFL RRRLRKQARA HGKSVPEIHE QLVTYDEEGG GEMDTTSYDV 660 

SVLNSVRRGG AKPPRPALDA RPSLYAQVQK PPRHAPGAHG GPGEMAAMIE VKKDEADHDG 720 

DGPPYDTLHI YGYEGSESIA ESLSSLGTDS SDSDVDYDFL NDWGPRFKML AELYGSDPRE 780 
ELLY 

Seq ID NO: 106 DNA sequence 

Nucleic Acid Accession #: none found 

Coding sequence: 1-474 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I 1 

ACAG TACTCT GTGCAAAAAA CCTGGTGAAA AAGGATTTTT TCCGACTTCC TGATCCATTT 60 

GCTAAGGTGG TGGTTGATGG ATCTGGGCAA TGCCATTCTA CAGATACTGT GAAGAATACG 120 

CTTGATCCAA AGTGGAATCA GCATTATGAC CTGTATATTG GAAAGTCTGA TTCAGTTACG 180 

ATCAGTGTAT GGAATCACAA GAAGATCCAT AAGAAACAAG GTGCTGGATT TCTCGGTTGT 240 

GTTCGTCTTC TTTCCAATGC CATCAACCGC CTCAAAGACA CTGGTTATCA GAGGTTGGAT 300 

TTATGCAAAC TCGGGCCAAA TGACAATGAT ACAGTTAGAG GACAGATAGT AGTAAGTCTT 360 

CAGTCCAGAG ACCGAATAGG CACAGGAGGA CAAGTTGTGG ACTGCAGTCG TTTATTTGAT 420 

AACGATTTAC CAGACGGAGC TCATTATTTG TGGACTTGGA AAGATAGATG T TAAT GACTG 480 

GAAGGTAAAC ACCCGGTTAA AACACTGTAC ACCAGACAGC AACATTGTCA AATGGTTCTG 540 

GAAAGCTGTG GAGTTTTTTG ATGAAGAGCG ACGAGCAAGA TTGCTTCAGT TTGTGACAGG 600 

ATCCTCTCGA GTGCCTCTGC AGGGCTTCAA AGCATTGCAA GGTGCTGCAG GCCCGAGACT 660 

CTTTACCATA CACCAGATTG ATGCCTGCAC TAACAACCTG CCGAAAGCCC ACACTTGCTT 720 

CAATCGAATA GACATTCCAC CCTATGAAAG CTATGAAAAG CTATATGAAA AGCTGCTAAC 780 

AGCCATTGAA GAAACATGTG GATTTGCTGT GGAATGACAA GCTTCAAGGA TTTACCCAGG 840 
AC 

Seq ID No: 107 Protein sequence: 
Protein Accession #: none found 

1 11 21 31 41 51 

I I I I I I 

TVLCAKNLVK KDFFRLPDPF AKVWDGSGQ CHSTDTVKNT LDPKWNQHYD LYIGKSDSVT 60 

ISVWNHKKIH KKQGAGFLGC VRLLSNAINR LKDTGYQRLD LCKLGPNDND TVRGQIWSL 120 
QSRDRIGTGG QWDCSRLFD NDLPDGAHYL WTWKDRC 

Seq ID NO: 108 DNA sequence 

Nucleic Acid AcceBBion #: NMJJ02318.1 

Coding sequence: 248-2572 (underlined sequences correspond to start and stop codons) 
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1 11 21 31 41 51 

I I I I J I 

ACTCCAGCGC GCGGCTACCT ACGCTTGGTG CTTGCTTTCT CCAGCCATCG GAGACCAGAG 60 

CCGCCCCCTC TGCTCGAGAA AGGGGCTCAG CGGCGGCGGA AGCGGAGGGG GACCACCGTG 120 

GAGAGCGCGG TCCCAGCCCG GCCACTGCGG ATCCCTGAAA CCAAAAAGCT CCTGCTGCTT 180 

CTGTACCCCG CCTGTCCCTC CCAGCTGCGC AGGGCCCCTT CGTGGGATCA TCAGCCCGAA 240 

GACAGGGATG GAGAGGCCTC TGTGCTCCCA CCTCTGCAGC TGCCTGGCTA TGCTGGCCCT 300 

CCTGTCCCCC CTGAGCCTGG CACAGTATGA CAGCTGGCCC CATTACCCCG AGTACTTCCA 360 

GCAACCGGCT CCTGAGTATC ACCAGCCCCA GGCCCCCGCC AACGTGGCCA AGATTCAGCT 420 

GCGCCTGGCT GGGCAGAAGA GGAAGCACAG CGAGGGCCGG GTGGAGGTGT ACTATGATGG 480 

CCAGTGGGGC ACCGTGTGCG ATGACGACTT CTCCATCCAC GCTGCCCACG TCGTCTGCCG 540 

GGAGCTGGGC TATGTGGAGG CCAAGTCCTG GACTGCCAGC TCCTCCTACG GCAAGGGAGA 600 

AGGGCCCATC TGGTTAGACA ATCTCCACTG TACTGGCAAC GAGGCGACCC TTGCAGCATG 660 

CACCTCCAAT GGCTGGGGCG TCACTGACTG CAAGCACACG GAGGATGTCG GTGTGGTGTG 720 

CAGCGACAAA AGGATTCCTG GGTTCAAATT TGACAATTCG TTGATCAACC AGATAGAGAA 780 

CCTGAATATC CAGGTGGAGG ACATTCGGAT TCGAGCCATC CTCTCAACCT ACCGCAAGCG 840 

CACCCCAGTG ATGGAGGGCT ACGTGGAGGT GAAGGAGGGC AAGACCTGGA AGCAGATCTG 900 

TGACAAGCAC TGGACGGCCA AGAATTCCCG CGTGGTCTGC GGCATGTTTG GCTTCCCTGG 960 

GGAGAGGACA TACAATACCA AAGTGTACAA AATGTTTGCC TCACGGAGGA AGCAGCGCTA 1020 

CTGGCCATTC TCCATGGACT GCACCGGCAC AGAGGCCCAC ATCTCCAGCT GCAAGCTGGG 1080 

CCCCCAGGTG TCACTGGACC CCATGAAGAA TGTCACCTGC GAGAATGGGC TGCCGGCCGT 1140 

GGTGAGTTGT GTGCCTGGGC AGGTCTTCAG CCCTGACGGA CCCTCGAGAT TCCGGAAAGC 1200 

ATACAAGCCA GAGCAACCCC TGGTGCGACT GAGAGGCGGT GCCTACATCG GGGAGGGCCG 1260 

CGTGGAGGTG CTCAAAAATG GAGAATGGGG GACCGTCTGC GACGACAAGT GGGACCTGGT 1320 

GTCGGCCAGT GTGGTCTGCA GAGAGCTGGG CTTTGGGAGT GCCAAAGAGG CAGTCACTGG 1380 

CTCCCGACTG GGGCAAGGGA TCGGACCCAT CCACCTCAAC GAGATCCAGT GCACAGGCAA 1440 

TGAGAAGTCC ATTATAGACT GCAAGTTCAA TGCCGAGTCT CAGGGCTGCA ACCACGAGGA 1500 

GGATGCTGGT GTGAGATGCA ACACCCCTGC CATGGGCTTG CAGAAGAAGC TGCGCCTGAA 1560 

CGGCGGCCGC AATCCCTACG AGGGCCGAGT GGAGGTGCTG GTGGAGAGAA ACGGGTCCCT 1620 

TGTGTGGGGG ATGGTGTGTG GCCAAAACTG GGGCATCGTG GAGGCCATGG TGGTCTGCCG 1680 

CCAGCTGGGC CTGGGATTCG CCAGCAACGC CTTCCAGGAG ACCTGGTATT GGCACGGAGA 174 0 

TGTCAACAGC AACAAAGTGG TCATGAGTGG AGTGAAGTGC TCGGGAACGG AGCTGTCCCT 1800 

GGCGCACTGC CGCCACGACG GGGAGGACGT GGCCTGCCCC CAGGGCGGAG TGCAGTACGG 1860 

GGCCGGAGTT GCCTGCTCAG AAACCGCCCC TGACCTGGTC CTCAATGCGG AGATGGTGCA 1920 

GCAGACCACC TACCTGGAGG ACCGGCCCAT GTTCATGCTG CAGTGTGCCA TGGAGGAGAA 1980 

CTGCCTCTCG GCCTCAGCCG CGCAGACCGA CCCCACCACG GGCTACCGCC GGCTCCTGCG 2040 

CTTCTCCTCC CAGATCCACA ACAATGGCCA GTCCGACTTC CGGCCCAAGA ACGGCCGCCA 2100 

CGCGTGGATC TGGCACGACT GTCACAGGCA CTACCACAGC ATGGAGGTGT TCACCCACTA 2160 

TGACCTGCTG AACCTCAATG GCACCAAGGT GGCAGAGGGC CACAAGGCCA GCTTCTGCTT 2220 

GGAGGACACA GAATGTGAAG GAGACATCCA GAAGAATTAC GAGTGTGCCA ACTTCGGCGA 2280 

TCAGGGCATC ACCATGGGCT GCTGGGACAT GTACCGCCAT GACATCGACT GCCAGTGGGT 2340 

TGACATCACT GACGTGCCCC CTGGAGACTA CCTGTTCCAG GTTGTTATTA ACCCCAACTT 2400 

CGAGGTTGCA GAATCCGATT ACTCCAACAA CATCATGAAA TGCAGGAGCC GCTATGACGG 2460 

CCACCGCATC TGGATGTACA ACTGCCACAT AGGTGGTTCC TTCAGCGAAG AGACGGAAAA 2520 

AAAGTTTGAG CACTTCAGCG GGCTCTTAAA CAACCAGCTG TCCCCGCA GT AA AGAAGCCT 2580 

GCGTGGTCAA CTCCTGTCTT CAGGCCACAC CACATCTTCC ATGGGACTTC CCCCCAACAA 2640 

CTGAGTCTGA ACGAATGCCA CGTGCCCTCA CCCAGCCCGG CCCCCACCCT GTCCAGACCC 2700 

CTACAGCTGT GTCTAAGCTC AGGAGGAAAG GGACCCTCCC ATCATTCATG GGGGGCTGCT 2760 

ACCTGACCCT TGGGGCCTGA GAAGGCCTTG GGGGGGTGGG GTTTGTCCAC AGAGCTGCTG 2820 

GAGCAGCACC AAGAGCCAGT CTTGACCGGG ATGAGGCCCA CAGACAGGTT GTCATCAGCT 2880 

TGTCCCATTC AAGCCACCGA GCTCACCACA GACACAGTGG AGCCGCGCTC TTCTCCAGTG 2940 

ACACGTGGAC AAATGCGGGC TCATCAGCCC CCCCAGAGAG GGTCAGGCCG AACCCCATTT 3000 

CTCCTCCTCT TAGGTCATTT TCAGCAAACT TGAATATCTA GACCTCTCTT CCAATGAAAC 3060 

CCTCCAGTCT ATTATAGTCA CATAGATAAT GGTGCCACGT GTTTTCTGAT TTGGTGAGCT 3120 

CAGACTTGGT GCTTCCCTCT CCACAACCCC CACCCCTTGT TTTTCAAGAT ACTATTATTA 3180 

TATTTTCACA GACTTTTGAA GCACAAATTT ATTGGCATTT AATATTGGAC ATCTGGGCCC 3240 

TTGGAAGTAC AAATCTAAGG AAAAACCAAC CCACTGTGTA AGTGACTCAT CTTCCTGTTG 3300 

TTCCAATTCT GTGGGTTTTT GATTCAACGG TGCTATAACC AGGGTCCTGG GTGACAGGGC 3360 

GCTCACTGAG CACCATGTGT CATCACAGAC ACTTACACAT ACTTGAAACT TGGAATAAAA 3420 
GAAAGATTTA TG 

Seq ID No: 109 Protein sequence: 
Protein Accession #: NP 002309.1 



1 11 21 31 41 51 

I I I I I i 

MERPLCSHLC SCIAMLALLS PLSLAQYDSW PHYPEYFQQP APEYHQPQAP ANVAKIQLRL 60 

AGQKRKHSEG RVEVYYDGQW GTVCDDDFSI HAAHWCREL GYVEAKSWTA SSSYGKGEGP 120 

IWLDNIiHCTG NEATLAACTS NGWGVTDCKH TEDVGWCSD KRIPGPKFDN SLINQIENLN 180 

IQVEDIRIRA ILSTYRKRTP VMEGYVEVKE GKTWKQICDK HWTAKNSRW CGMFGFPGER 240 

TYNTKVYKMF ASRRKQRYWP FSMDCTGTEA HISSCKLGPQ VSLDPMKNVT C ENGL PAWS 300 

CVPGQVFSPD GPSRFRKAYK PEQPLVRLRG GAYIGEGRVE VLKNGEWGTV CDDKWDLVSA 360 

SWCRELGFG SAKEAVTGSR LGQGIGPIHL NEIQCTGNEK SIIDCKFNAE SQGCNHEEDA 420 



231 



WO 02/079492 



PCI7US02/04915 



GVRCNTPAMG 
GLGFASNAFQ 
VACSETAPDL 
SQIHNNGQSD 
TBCEGDIQKN 
AESDYSNNIM 



LQKKLRHNGG 
BTWYWHGDVN 
VLNAEMVQQT 
FRPKNGRHAW 
YECANFGDQG 
KCRSRYDGHR 



RNPYEGRVEV 
SNKWMSGVK 
TYLEDRPMFM 
IWHDCHRHYH 
ITMGCWDMYR 
IWMYNCHIGG 



LVERNGSLVW 
CSGTELSLAH 
LQCAMEENCL 
SMEVFTHYDL 
HDIDCQWVDI 
SFSEETEKKP 



GMVCGQNWGI 
CRHDGEDVAC 
SASAAQTDPT 
LNLNGTKVAE 
TDVPPGDYLF 
EHFSGLLNNQ 



VEAMWCRQL 
PQGGVQYGAG 
TGYRRLIiRFS 
GHKASFCLED 
QWINPNFEV 
LSPQ 



480 
540 
600 
660 
720 



10 



15 



20 



25 



30 



Seq ID NO: 110 DNA sequence 

Nucleic Acid Accession #: none found, CAT_73007_3 

Coding sequence: 1-495 (underlined sequences correspond to start and stop codons) 



1 
I 

CGGACGCGTG 
GCTGCTGCCT 
CTGGGTGATC 
ACCTCTAGCT 
ACCCAGGATA 
CCCAGGCGCA 
ACTGTTCCCC 
CCAGCCCCAC 
GCCCACTCTG 
CACTTTGCTC 
GTGGTGATCA 
TCTGAAGATC 
GCGAGAACGA 
GCACTCAGCA 
GGGC 



11 

I. 

GGTCGACCCA 
GCCCGCCACA 
CTGGGCTTCC 
CTCAGGGAGG 
CATCCCTTCC 
GGTGTCCCCA 
CCAATTCAAC 
GTCAGAGACT 
GCTAGGGGGC 
CTGGAGCCTG 
TCCTAGTTGG 
CCAGAACCTG 
CAGCATCGCC 
GCGAGCATCA 



21 
I 

CGCGTCCGCC 
TACCCAGCTG 
TCCTGTTCCG 
CCTTGGCGGT 
TCAGAGGCTA 
GCAGTGGAAG 
CACCATGAGC 
GTGCTCACTG 
GGCAGGCTGA 
TCTACCACAG 
TGTGGTCAGC 
GGAGTACAGG 
CATTATCACA 
AAGGAGCCTA 



31 
I 

CACGCGTCCG 
ACATGGGCAC 
AGGCCACAAC 
CTAAGTCTGA 
ACAGGCCAAG 
AGACGGAGGC 
CTGAGCATGA 
TGGCTGCATT 
GAGCTCACCT 
TGTTATCAGC 
CTGAGGGTTC 
GCGTGTCTGA 
TGGAAGACAT 
CGCATGGCCC 



41 

I 

TATGGACAGA 
CGCAGGAGCC 
TCCCAGCCCA 
CCACAGAGCC 
CCATCTGTCC 
ACAAGCAGAG 
GGGAAGATGC 
TGGGATGGAG 
GTTCAGCAGA 
TTCATTGTCA 
AGTGTCGGAA 
CAAGCTGGTC 
CACACGACTT 
AGACTGAGAG 



51 
) 

GCCTCCACTG 
ATGCAGCTGT 
CAATGACCCA 
AGTTTCTTCC 
AGCACTGGTA 
ACACATTTCA 
GACCATCCTG 
TCGGGTGGAG 
GAAGTGGAAC 
TCCTGGTGGT 
GAGCAAGGAG 
ACAGACCATG 
AGGGCAACAC 
CAAGCACAAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 



Seq ID No: 111 Protein sequence: 

Protein Accession #: none found, CAT_73007_3 
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1 11 21 31 41 51 

I I I I 1 I 

RTRGSTHASA HASVWTEPPL AAACPPHTQL TWAPQEPCSC LGDPGLPPVP RPQLPAHNDP 
TSSSQGGLGG LSLTTEPVSS TQDTSLPQRL TGQAICPALV PRRRCPQQWK RRRHKQRHIS 
TVPPNSTTMS LSMREDATIL PAPRQRLCSL WLHLGWSRVE AHSG 



60 
120 



Seq ID NO: 112 DNA sequence 

Nucleic Acid Accession #: NM_005424.1 

Coding sequence: 37-3453 {underlined sequences correspond to start and stop codons) 



CGCTCGTCCT 
TTGCTCCCCA 
GCCAACCTGC 
GGGGCGGGGA 
ATCGTGCGCA 
ACGCTTCGCG 
GCTGGGGCGC 
CCAGACAAGG 
CACAAGGAGA 
GACTGGCATG 
TCGAGCGGCA 
CGGCTCATCG 
CCAGGTTGCC 
GGCTTCACTG 
CAGGAGCAGT 
TATGGCTGCT 
GGTCATTTTG 
CGGTTCAGTG 
CGGATCCCCC 
CGGATCAACT 
AAGCCAGACG 
GCTGAGTTCG 
TCCACATCTG 
CCCCTGGCTG 
GTCTCGTTCT 
AGTACCATGG 
CTGAGGCCAA 
GAGGGGGCCT 
CCGTGGTTGG 
CCCTTGGTGC 



11 
I 

GGCTGGCCTG 
TCCTCTTCTT 
GGCTCACGGA 
GGGGCTCGGA 
CCCCGCCCGG 
GCTTCTCCAA 
GGCGCACGCG 
TCACACACAC 
AGCAGACAGA 
AAGCCCAGGA 
TCTACAGTGC 
TGCGGGGTTG 
TACATGGAGG 
GCACCCGCTG 
GCCCAGGCAT 
CTTGTGGATC 
GGGCTGATTG 
GTTGTGTCTG 
AGATCCTCAA 
GTGCAGCTGC 
GCACTGTGCT 
AGGTGCCCCG 
GCGGCCAAGA 
CACCTCGGCT 
CTGGGGATGG 
ACTGGTCGAC 
AGACAGGATA 
GGGGGCCTCC 
AGGGCTGGCA 
CCGGGCCACT 



21 

I 

GGTCGGCCTC 
GGCTTCTCAT 
CCCCCAGCGC 
CGCCTGGGGC 
GCCACCCCTG 
GCCCTCGGAC 
CGTCATCTAC 
TGTGAACAAA 
CGTGATCTGG 
TGGGCGGTTC 
CACTTACCTG 
TGGGGCTGGG 
TGTCTGCCAC 
TGAACAGGCC 
ATCAGGCTGC 
TGGCTGGAGA 
CCGACTCCAG 
CCCCTCTGGG 
CATGGCCTCA 
AGGGAACCCC 
CCTGTCCACC 
CTTGGTTCTT 
CAGCCGGCGC 
CCTGACCAAG 
ACCCATCTCC 
CATTGTGGTG 
CAGTGTTCGT 
CACCCTCATG 
TGTGGAAGGC 
GGTGGGCGAC 



31 
I 

TGGAGTATGG 
GTGGGCGCGG 
TTCTTCCTGA 
CCGCCCCTGC 
CGCCTGGCGC 
CTCGTGGGCG 
GTGCACAACA 
GGTGACACCG 
AAGAGCAACG 
CTGCTGCAGC 
GAAGCCAGCC 
CGCTGGGGGC 
GACCATGACG 
TGCAGAGAGG 
CGGGGCCTCA 
GGAAGCCAGT 
TGCCAGTGTC 
TGGCATGGAG 
GAACTGGAGT 
TTCCCCGTGC 
AAGGCCATTG 
GCGGACAGTG 
TTCAAGGTCA 
CAGAGCCGCC 
ACTGTCCGCC 
GACCCCAGTG 
GTGCAGCTGA 
ACCACAGACT 
ACTGACCGGC 
GGTTTCCTGC 



41 
I 

TCTGGCGGGT 
CGGTGGACCT 
CTTGCGTGTC 
TGCTGGAGAA 
GCAACGGTTC 
TCTTCTCCTG 
GCCCTGGAGC 
CTGTACTTTC 
GATCCTACTT 
TCCCAAATGT 
CCCTGGGCAG 
CAGGCTGTAC 
GCGAATGTGT 
GCCGTTTTGG 
CCTTCTGCCT 
GCCAAGAAGC 
AGAATGGTGG 
TGCACTGTGA 
TCAACTTAGA 
GGGGCAGCAT 
TGGAGCCAGA 
GGTTCTGGGA 
ATGTGAAAGT 
AGCTTGTGGT 
TGCACTACCG 
AGAACGTGAC 
GCCGGCCAGG 
GTCCTGAGCC 
TGCGAGTGAG 
TGCGCCTGTG 



51 
I 

GCCCCCTTTC 
GACGCTGCTG 
TGGGGAGGCC 
GGACGACCGT 
GCACCAGGTC 
CGTGGGCGGT 
CCACCTGCTT 
TGCACGTGTG 
CTACACCCTG 
GCAGCCACCA 
CGCCTTCTTT 
CAAGGAGTGC 
ATGCCCCCCT 
GCAGAGCTGC 
CCCAGACCCC 
TTGTGCCCCT 
CACTTGTGAC 
GAAGTCAGAC 
GACGATGCCC 
AGAGCTACGC 
GAAGACCACA 
GTGCCGTGTG 
GCCCCCCGTG 
CTCCCCGCTG 
GCCCCAGGAC 
GTTAATGAAC 
GGAAGGAGGA 
TTTGTTGCAG 
CTGGTCCTTG 
GGACGGGACA 



60 
120 
160 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
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WO 02/079492 
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CGGGGGCAGG AGCGGCGGGA GAACGTCTCA 
GGACTCACGC CTGGCACCCA CTACCAGCTG 
GGCCCGGCCT CGCCCCCTGC ACACGTGCTT 
CACCTCCACG CCCAGGCCCT CTCAGACTCC 
GCTCTGCCTG GGCCAATATC CAAGTACGTT 
GACCCACTGT GGATAGACGT GGACAGGCCT 
AACGCCAGCA CGCGCTACCT CTTCCGCATG 
AGCAACACAG TAGAAGAGTC CACCCTGGGC 
GAGAGCCGGG CAGCTGAAGA GGGCCTGGAT 
GTGTCTGCCA CCTGCCTCAC CATCCTGGCC 
AGCTGCCTGC ATCGGAGACG CACCTTCACC 
CTGCAGTTCA GCTCAGGGAC CTTGACACTT 
CTGAGCTACC CAGTGCTAGA GTGGGAGGAC 
AACTTCGGCC AGGTCATCCG GGCCATGATC 
ATCAAAATGC TGAAAGAGTA TGCCTCTGAA 
GAAGTTCTGT GCAAATTGGG GCATCACCCC 
AACCGAGGTT ACTTGTATAT CGCTATTGAA 
CTGCGGAAAA GCCGGGTCCT AGAGACTGAC 
TCTACCCTTA GCTCCCGGCA GCTGCTGCGT 
TACCTGAGTG AGAAGCAGTT CATCCACAGG 
GAGAACCTAG CCTCCAAGAT TGCAGACTTC 
AAGAAGACGA TGGGGCGTCT CCCTGTGCGC 
GTCTATACCA CCAAGAGTGA TGTCTGGTCC 
CTTGGAGGTA CACCCTACTG TGGCATGACC 
GGCTACCGCA TGGAGCAGCC TCGAAACTGT 
TGCTGGCGGG ACCGTCCCTA TGAGCGACCC 
CGCATGCTGG AAGCCAGGAA GGCCTATGTG 
GCGGGCATTG ATGCCACAGC TGAGGAGGCC 
GCTGGCCGGA GCAAACTCTG CTGTCTAACC 
CTTAAGCTGC CTCAAGGAAT TTTTTTAACT 
GGTGGGCTTA GGGGAACTGG GTTCCCATGC 
TCCTTCTTTC TAGTTCAGCT GCCCCACAGG 
CAAACCCCCA CTCCAGCTCC TTCGCTTAAG 
CAGCTACTCC CACTCCCGGC CTGTCATTCA 
AAAAA 



TCCCCCCAGG CCCGCACTGC CCTCCTGACG 1860 

GATGTGCAGC TCTACCACTG CACCCTCCTG 1920 

CTGCCCCCCA GTGGGCCTCC AGCCCCCCGA 1980 

GAGATCCAGC TGACATGGAA GCACCCGGAG 2040 

GTGGAGGTGC AGGTGGCTGG GGGTGCAGGA 2100 

GAGGAGACAA GCACCATCAT CCGTGGCCTC 2160 

CGGGCCAGCA TTCAGGGGCT CGGGGACTGG 2220 

AACGGGCTGC AGGCTGAGGG CCCAGTCCAA 2280 

CAGCAGCTGA TCCTGGCGGT GGTGGGCTCC 2340 

GCCCTTTTAA CCCTGGTGTG CATCCGCAGA 2400 

TACCAGTCAG GCTCGGGCGA GGAGACCATC 2460 

ACCCGGCGGC CAAAACTGCA GCCCGAGCCC 2520 

ATCACCTTTG AGGACCTCAT CGGGGAGGGG 2580 

AAGAAGGACG GGCTGAAGAT GAACGCAGCC 2640 

AATGACCATC GTGACTTTGC GGGAGAACTG 2700 

AACATCATCA ACCTCCTGGG GGCCTGTAAG 2760 

TATGCCCCCT ACGGGAACCT GCTAGATTTT 2820 

CCAGCTTTTG CTCGAGAGCA TGGGACAGCC 2880 

TTCGCCAGTG ATGCGGCCAA TGGCATGCAG 2940 

GACCTGGCTG CCCGGAATGT GCTGGTCGGA 3000 

GGCCTTTCTC GGGGAGAGGA GGTTTATGTG 3060 

TGGATGGCCA TTGAGTCCCT GAACTACAGT 3120 

TTTGGAGTCC TTCTTTGGGA GATAGTGAGC 3180 

TGTGCCGAGC TCTATGAAAA GCTGCCCCAG 3240 

GACGATGAAG TGTACGAGCT GATGCGTCAG 3300 

CCCTTTGCCC AGATTGCGCT ACAGCTAGGC 3360 

AACATGTCGC TGTTTGAGAA CTTCACTTAC 3420 

TGAG CTGCCA TCCAGCCAGA ACGTGGCTCT 3480 

TGTGACCAGT CTGACCCTTA CAGCCTCTGA 3540 

TAAGGGAGAA AAAAAGGGAT CTGGGGATGG 3600 

TTTGTAGGTG TCTCATAGCT ATCCTGGGCA 3660 

TGTGTTTCCC ATCCCACTGC TCCCCCAACA 3720 

CCAGCACTCA CACCACTAAC ATGCCCTGTT 3780 

GAAAAAAATA AATGTTCTAA TAAGCTCCAA 3840 



Seq ID No: 113 Protein sequence: 
Protein Accession #: NP_005415.1 



1 11 21 31 41 51 

I I I I I I 

MVWRVPPFLL PILFLASHVG AAVDLTLLAN LRLTDPQRFF LTCVSGEAGA GRGSDAWGPP 60 

LLLEKDDRIV RTPPGPPLRL ARNGSHQVTL RGFSKPSDLV GVFSCVGGAG ARRTRVIYVH . 120 

NSPGAHLLPD KVTHTVNKGD TAVLSARVHK EKQTDVIWKS NGSYFYTLDW HEAQDGRFLL 180 

QLPNVQPPSS GIYSATYLEA SPLGSAFFRL IVRGCGAGRW GPGCTKECPG CLHGGVCHDH 240 

DGECVCPPGF TGTRCEQACR EGRFGQSCQE QCPGISGCRG LTFCLPDPYG CSCGSGWRGS 300 

QCQEACAPGH FGADCRLQCQ CQNGGTCDRF SGCVCPSGWH GVHCEKSDRI PQILNMASEL 360 

EFNLETMPRI NCAAAGNPFP VRGSIELRKP DGTVLLSTKA IVEPEKTTAE FEVPRLVLAD 420 

SGFWECRVST SGGQDSRRFK VNVKVPPVPL AAPRLLTKQS RQLWSPLVS FSGDGPISTV 480 

RLHYRPQDST MDWSTIWDP SENVTLMNLR ' PKTGYSVRVQ LSRPGEGGEG AWGPPTLMTT 540 

DCPEPLLQPW LEGWHVEGTD RLRVSWSLPL VPGPLVGDGF LLRLWDGTRG QERRENVSSP 600 

QARTALLTGL TPGTHYQLDV QLYHCTLLGP ASPPAHVLLP PSGPPAPRHL HAQALSDSEI 660 

QLTWKHPEAL PGPISKYWE VQVAGGAGDP LWIDVDRPEE TSTIIRGIiNA STRYLFRMRA 720 

SIQGLGDWSN TVEESTLGNG LQAEGPVQES RAAEEGLDQQ LIIiAWGSVS ATCLTILAAL 780 

LTLVCIRRSC LHRRRTFTYQ SGSGEETILQ FSSGTLTLTR RPKLQPEPLS YPVLEWEDIT 840 

FEDLIGEGNF GQVTRAMIKK DGLKMNAAIK MLKEYASEND HRDFAGELEV LCKLGHHPNI 900 

INLLGACKNR GYLYIAIEYA PYGNLLDFLR KSRVLETDPA FAREHGTAST LSSRQLLRFA 960 

SDAANGMQYL SEKQFIHRDL AARNVLVGEN LASKIADFGL SRGEEVYVKK TMGRLPVRWM 1020 

AIESLNYSVY TTKSDVWSFG VLLWEIVSLG GTPYCGMTCA ELYEKLPQGY RMEQPRNCDD 1080 
EVYELMRQCW RDRPYERPPF AQIALQLGRM LEARKAYVNM SLFENFTYAG IDATAEEA 



Seq ID NO: 114 DNA sequence 

Nucleic Acid Accession #: NM_002632.1 

Coding sequence: 322-771 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

GGGATTCGGG CCGCCCAGCT ACGGGAGGAC CTGGAGTGGC ACTGGGCGCC CGACGGACCA 60 

TCCCCGGGAC CCGCCTGCCC CTCGGCGCCC CGCCCCGCCG GGCCGCTCCC CGTCGGGTTC 120 

CCCAGCCACA GCCTTACCTA CGGGCTCCTG ACTCCGCAAG GCTTCCAGAA GATGCTCGAA 180 

CCACCGGCCG GGGCCTCGGG GCAGCAGTGA GGGAGGCGTC CAGCCCCCCA CTCAGCTCTT 240 

CTCCTCCTGT GCCAGGGGCT CCCCGGGGGA TGAGCATGGT GGTTTTCCCT CGGAGCCCCC 300 

TGGCTCGGGA CGTCTGAGAA G ATG CCGGTC ATGAGGCTGT TCCCTTGCTT CCTGCAGCTC 360 

CTGGCCGGGC TGGCGCTGCC TGCTGTGCCC CCCCAGCAGT GGGCCTTGTC TGCTGGGAAC 420 

GGCTCGTCAG AGGTGGAAGT GGTACCCTTC CAGGAAGTGT GGGGCCGCAG CTACTGCCGG 480 



233 
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GCGCTGGAGA GGCTGGTGGA CGTCGTGTCC GAGTACCCCA GCGAGGTGGA GCACATGTTC 540 

AGCCCATCCT GTGTCTCCCT GCTGCGCTGC ACCGGCTGCT GCGGCGATGA GAATCTGCAC 600 

TGTGTGCCGG TGGAGACGGC CAATGTCACC ATGCAGCTCC TAAAGATCCG TTCTGGGGAC 660 

CGGCCCTCCT ACGTGGAGCT GACGTTCTCT CAGCACGTTC GCTGCGAATG CCGGCCTCTG 720 

5 CGGGAGAAGA TGAAGCCGGA AAGGTGCGGC GATGCTGTTC CCCGGAGGTA^CCCACCCCT 760 

TGGAGGAGAG AGACCCCGCA CCCGGCTCGT GTATTTATTA CCGTCACACT CTTCAGTGAC 840 

TCCTGCTGGT ACCTGCCCTC TATTTATTAG CCAACTGTTT CCCTGCTGAA TGCCTCGCTC 900 

CCTTCAAGAC GAGGGGCAGG GAAGGACAGG ACCCTCAGGA ATTCAGTGCC TTCAACAACG 960 

TGAGAGAAAG AGAGAAGCCA GCCACAGACC CCTGGGAGCT TCCGCTTTGA AAGAAGCAAG 1020 

10 ACACGTGGCC TCGTGAGGGG CAAGCTAGGC CCCAGAGGCC CTGGAGGTCT CCAGGGGCCT 1080 

GCAGAAGGAA AGAAGGGGGC CCTGCTACCT GTTCTTGGGC CTCAGGCTCT GCACAGACAA 1140 

GCAGCCCTTG CTTTCGGAGC TCCTGTCCAA AGTAGGGATG CGGATTCTGC TGGGGCCGCC 1200 

ACGGCCTGGT GGTGGGAAGG CCGGCAGCGG GCGGAGGGGA TTCAGCCACT TCCCCCTCTT 1260 

CTTCTGAAGA TCAGAACATT CAGCTCTGGA GAACAGTGGT TGCCTGGGGG CTTTTGCCAC 1320 

15 TCCTTGTCCC CCGTGATCTC CCCTCACACT TTGCCATTTG CTTGTACTGG GACATTGTTC 1380 

TTTCCGGCCG AGGTGCCACC ACCCTGCCCC CACTAAGAGA CACATACAGA GTGGGCCCCG 1440 

GGCTGGAGAA AGAGCTGCCT GGATGAGAAA CAGCTCAGCC AGTGGGGATG AGGTCACCAG 1500 

GGGAGGAGCC TGTGCGTCCC AGCTGAAGGC AGTGGCAGGG GAGCAGGTTC CCCAAGGGCC 1560 

CTGGCACCCC CACAAGCTGT CCCTGCAGGG CCATCTGACT GCCAAGCCAG ATTCTCTTGA 1620 

20 ATAAAGTATT CTAGTGTGGA AACGC , 



25 



Seq ID No: 115 Protein sequence! 
Protein Accession #: NP 002623.1 
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MPVMRLFPCP LQLLAGLALP AVPPQQWALS AGNGSSEVEV VPFQEVWGRS YCRALERLVD 60 
WSEYPSEVB HMFSPSCVSL LRCTGCCGDE NLHCVPVETA NVTMQLLKIR SGDRPSYVEL 120 
TFSQHVRCEC RPLREKMKPE RCGDAVPRR 

Seq ID NO: 116 DNA sequence 

Nucleic Acid Accession #: NM_0 073 61.1 

Coding sequence; 1-4131 (underlined sequences correspond to start and stop codons) 



ATGGAGGGGG 
CTGCAGTTGC 
GAGTCGTGGT 
GCTGGCGAAT 
ACGGCATCAT 
CCACCGACTT 
GCCGAGTCCT 
TGCGCGCTGG 
CTGGGAGCAG 
AACACTTTCC 
CCTGCCAACG 
CAGCTTCCAG 
CCATATTTCA 
CTGGGGATCC 
AGGCCAGCTG 
TTCAGCCATG 
GTGAATGAGG 
AGCAGCATTG 
ACCTTGGATC 
AAAGGCCAAG 
GACAGAGATT 
ATCCAGCCCT 
CCTGAAGAAG 
CGAGGGACGT 
TATAATGCTG 
TTCTGCACGG 
GGGAAGCACT 
CTCCACGTGG 
GGCAATGATG 
CTCCTCCCCC 
GGCTCTGAGA 
TTCTACCCGG 
AACTACCTGA 
ACAGCCCACA 
ACAAGTTCCA 
ATCCACCAGA 
ACCACCCAGC 
CTTAGATTTG 
GTGAATCCTT 
ACAGGTGTAG 



11 
I 

ACCGGGTGGC 
TAATGTTGCG 
GGGACCAGCT 
CCCCTGCACT 
CTCCACTCAG 
CCCGGCCATC 
GTACCGAGAG 
CTTCCCGCGC 
GTAGGCGCTT 
AGGCAGTTTT 
GCCTGCAGTT 
CTCGGGTGGG 
GCTTGACTAG 
CTGGAGTGTG 
CAGTTGGAGA 
CTACAGCCCT 
AGGAAGCTGA 
ATGTTTCCTT 
CTCACACCAA 
TTGAGCCCTG 
CACTGGCTCC 
ACCCAGATGG 
AAATTGTTCT 
ATGAGGTGGG 
CCAACAAGGA 
ACTATGCCAC 
GTCTGCCTGA 
GCCATACACC 
GCAGAGCCTA 
TCACACCAAT 
ACGGCTTCAG 
GAGAGGAGAC 
GCATTAAGAC 
TCTCTCCCTA 
GAGACTACTC 
ACATCACTTA 
AGCTGAACGT 
CTGTGACCAA 
GCTATGATGG 
ATTACACCTG 



21 
I 

CGGGCGGCCG 
GGCCGCGGCG 
CCTGCAGGAA 
TCTTACGAAG 
GACTTCCCCA 
GCCCCTTTTC 
GACACCTCCC 
TCTGCGCGCT 
ACGAGGAGGT 
GGCATCTGAT 
CCTTGGAACC 
CTTCTGCCGA 
CACTGAACAG 
GGCTTTCCAT 
CCTTTCCGCT 
GGAAAGTGAC 
ATACCTTCCG 
CCAATCCAAA 
AGAAGGAACA 
GGATGAGAGA 
TTCCTGGGAA 
AGGGCCAGTG 
TCGAAGTTAC 
ACTGGAAGAC 
AACCTGTGAA 
TGGCTTCTGC 
GGGGGCACCT 
CGTGCACTTC 
CACGGCCATC 
TGGAGGCCTG 
CCTCGCAGGT 
GGTTCGTATC 
CAACATTCAA 
CAAGGAGCTG 
TCTGACTTTT 
CCAGGTGTGC 
GGACCGGGTC 
TCAAATTGGC 
GAGCCACATG 
TGAGTGCGCA 



31 
I 

GTGCTGTCGT 
CTGCACCCAG 
GGCGACGACG 
CCCGATTCAG 
GGGAAACGCA 
TGGCGGACAT 
CCGCAGTGCT 
TTTTACCCCC 
CAAACGCGGG 
GGGTCTGATA 
CGCCGCAAAG 
GGGGAGGCTG 
TCTGTGAAAA 
ATCGGCAGCA 
GCCCACTCTT 
TATAATGAGG 
GGTGAACCAG 
GTGGATACAA 
TCTCTGGGAG 
GAGACCAGAA 
ACCCCACCAC 
CCTTCGGAAA 
CCTGCTTCAG 
AACATAGGTT 
CACAACCACA 
TGCCACTGCC 
CACCGAGTGA 
ACTGATGTGG 
AGCCACATCC 
TTTGGCTGGC 
GCTGCCTTTA 
ACTCAAACTG 
GGCCAGGTGC 
TACCACTACT 
GGTGCAATCA 
AGGCACGCCC 
TTTGCCTTGT 
CCGGTCAAAG 
TGTGACACAA 
TCTGGGTACC 



41 
I 

CGTTACCAGT 
ACGAGCTCTT 
TAAAGCTCAG 
CAACCTCTAC 
GTATGTGGAC 
CGACACGAGC 
GGGCC TGGCC 
ACCCACGCCT 
CGCTGCCCTC 
GCTACGCCCT 
AGTCTTACAA 
ATGATCTGAA 
ATCTCTATCA 
CTTCCCCGTT 
CTGTTCCCCT 
ACAATTTGGA 
AGGAGGCATT 
AGCCTTTAGA 
AGGTAGGGGG 
GCCCAGCTCC 
CGTACCCCGA 
TGGATGTTCC 
GTCACACTAC 
CCAACACCGA 
GACAATGCTC 
AATCCAAGTT 
ATGGGAAAGT 
ACCTGCATGC 
CACAGCCAGC 
TCTTTGCTTT 
CCCATGACAT 
CTGAGGGACT 
CTTACGTCCC 
CCGACTCCAC 
ACCAAACATG 
CCAGACACCC 
ATAATGATGA 
AAGATTCAGA 
CAGCACGGTG 
AGGGAGATGG 



51 
I 

GCTACTGCTG 
CCCACACGGG 
CCGTGGTGAA 
GTGGGCACCA 
TATGATTTCC 
CACGGCAGAG 
GCCCGCTATG 
TCCTGGCCAC 
GGGAGAGCTG 
CTTTCTTTAT 
TGTCCAGCTT 
GTCAGAAGGA 
ACTAAGCAAC 
GGACAATGTC 
GGGACGTTCC 
TTACTACGAT 
GAATGGCCAC 
GGAATCTTCC 
CCCAGATTTA 
ACCAGAGGTA 
AAACGGAAGC 
CCCAGCTCAT 
ACCCTTAAGT 
'GGTCTTCACG 
CCGGCATGCC 
TTATGGAAAT 
GAGTGGCCAC 
GTATATCGTG 
AGCCCAGGCC 
AGAAAAACCT 
GGAAGTTACA 
TGACCCAGAG 
AGCAAATTTC 
TGTGACCTCT 
GTCCTACCGC 
GTCCTTCCCC 
AGAAAGAGTG 
CCCCACTCCG 
CCATCCAGGG 
ACGGAACTGT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
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GTGGATGAAA ATGAATGTGC AACTGGCTTT CATCGCTGTG GCCCCAACTC TGTATGTATC 2460 

AACTTGCCTG GAAGCTACAG GTGTGAGTGC CGGAGTGGTT ATGAGTTTGC AGATGACCGG 2520 

CATACTTGCA TCTTGATCAC CCCACCTGCC AACCCCTGTG AGGATGGCAG TCATACCTGT 2580 

GCTCCTGCTG GGCAGGCCCG GTGTGTTCAC CATGGAGGCA GCACGTTCAG CTGTGCCTGC 2640 

CTGCCTGGTT ATGCCGGCGA TGGGCACCAG TGCACTGATG TAGATGAATG CTCAGAAAAC 2700 

AGATGTCACC CTGCAGCTAC CTGCTACAAT ACTCCTGGTT CCTTCTCCTG CCGTTGTCAA 2760 

CCCGGATATT ATGGGGATGG ATTTCAGTGC ATACCTGACT CCACCTCAAG CCTGACACCC 2820 

TGTGAACAAC AGCAGCGCCA TGCCCAGGCC CAGTATGCCT ACCCTGGGGC CCGGTTCCAC 2880 

ATCCCCCAAT GCGACGAGCA GGGCAACTTC CTGCCCCTAC AGTGTCATGG CAGCACTGGT 2940 

TTCTGCTGGT GCGTGGACCC TGATGGTCAT GAAGTTCCTG GTACCCAGAC TCCACCTGGC 3000 

TCCACCCCGC CTCACTGTGG ACCATCACCA GAGCCCACCC AGAGGCCCCC GACCATCTGT 3060 

GAGCGCTGGA GGGAAAACCT GCTGGAGCAC TACGGTGGCA CCCCCCGAGA TGACCAGTAC 3120 

GTGCCCCAGT GCGATGACCT GGGCCACTTC ATCCCCCTGC AGTGCCACGG AAAGAGCGAC 3180 

TTCTGCTGGT GTGTGGACAA AGATGGCAGA GAGGTGCAGG GCACCCGCTC CCAGCCAGGC 3240 

ACCACCCCTG CGTGTATACC CACCGTCGCT CCACCCATGG TCCGGCCCAC GCCCCGGCCA 3300 

GATGTGACCC CTCCATCTGT GGGCACCTTC CTGCTCTATA CTCAGGGCCA GCAGATTGGC 3360 

TACTTACCCC TCAATGGCAC CAGGCTTCAG AAGGATGCAG CTAAGACCCT GCTGTCTCTG 3420 

CATGGCTCCA TAATCGTGGG AATTGATTAC GACTGCCGGG AGAGGATGGT GTACTGGACA 3480 

GATGTTGCTG GACGGACAAT CAGCCGTGCC GGTCTGGAAC TGGGAGCAGA GCCTGAGACG 3540 

ATCGTGAATT CAGGTCTGAT AAGCCCTGAA GGACTTGCCA TAGACCACAT CCGCAGAACA 3600 

ATGTACTGGA CGGACAGTGT CCTGGATAAG ATAGAGAGCG CCCTGCTGGA TGGCTCTGAG 3660 

CGCAAGGTCC TCTTCTACAC AGATCTGGTG AATCCCCGTG CCATCGCTGT GGATCCAATC 3720 

CGAGGCAACT TGTACTGGAC AGACTGGAAT AGAGAAGCTC CTAAAATTGA AACGTCATCT 3780 

TTAGATGGAG AAAACAGAAG AATTCTGATC AATACAGACA TTGGATTGCC CAATGGCTTA 384 0 

ACCTTTGACC CTTTCTCTAA ACTGCTCTGC TGGGCAGATG CAGGAACCAA AAAACTGGAG 3900 

TGTACACTAC CTGATGGAAC TGGACGGCGT GTCATTCAAA ACAACCTCAA GTACCCCTTC 3960 

AGCATCGTAA GCTATGCAGA TCACTTCTAC CACACAGACT GGAGGAGGGA TGGTGTTGTA 4020 

TCAGTAAATA AACATAGTGG CCAGTTTACT GATGAGTATC TCCCAGAACA ACGATCTCAC 4 0 80 

CTCTACGGGA TAACTGCAGT CTACCCCTAC TGCCCAACAG GAAGAAA GTA A GTACAGTAA 4140 

TGTAAAGGAA GACTTGGAGT TTACAATCAG AACCTGGACC CTAAAGAACA GTGACTGCAA 4200 

AGGCAAAGAA AGTAAAAAAG GAATTGGCCA TTAGACGTTC CTGAGCATCC AAGATGAACA 4260 

TTTTGTAGTG CAAAAAGACT TTTGTGAAAA GCTGATACCT CAATCTTTAC TACTGTATTT 4320 

TTAAAAATGA AGGTTGTTAT TGCAAGTTTA AAAAGGTAAC AGAATTTTAA CTGTTGCTTA 4380 

TTAAAGCAAC TTCTTGTAAA CATTTATCAT TAATATTTAA AAGATCAAAT TCATTCAACT 4440 

AAGAATTAGA GTTTAAGACT CTAAACCTGA TTTTTGCCAT GGATTCCTTC TGGCCAAGAA 4500 

ATTAAAGCAC ATGTGATCAA TATAACAATA TAATCCTAAA CCTTGACAGT TGGAGAAGCC 4560 

AATGCAGAAC TGATGGGAAA GGACCAATTA TTTATAGTTT CCCAACAAAA GTTCTAAGAT 4620 

TTTTTACCTC TGCATCAGTG CATTTCTATT TATATCAAAA GGTGCTAAAA TGATTCAATT 4680 

TGCATTTTCT GATCCTGTAG TGCCTCTATA GAAGTACCCA CAGAAAGTAA AGTATCACAT 4740 

TTATAAATAC CAAAGATGTA ACAATTTTAA AATTTTCTAG ATTACTCCAA TAAAGTGTTT 4800 
TAAGTTTAAA AAAAAAAAAA AAAAAAAAA 



Seq ID Mo: 117 Protein sequence: 
protein Accession #: NP_03l387.l 

1 11 21 31 41 51 

I I I . • I I I . 

MEGDRVAGRP VLSSLPVLLL LQLLMLRAAA LHPDELFPHG BSWWDQLLQE GDDVKLSRGE 60 • 

AGESPALLTK PDSATSTWAP TASSPLRTSP GKRSMWTMIS PPTSRPSPLF WRTSTRATAE 120 

AESCTERTPP -PQCWAWPPAM CALASRALRA FYPHPRLPGH LGAGRRLRGG QTRALPSGEL 180 

NTFQAVLASD GSDSYALFLY PANGLQFLGT RPKESYNVQL QLPARVGFCR GEADDLKSEG 240 

PYFSLTSTEQ SVKNLYQLSN LGIPGVWAFH IGSTSPLDNV RPAAVGDIiSA AHSSVPLGRS 300 

FSHATALESD YNEDNLDYYD VNEEEAEYLP GEPEEALNGH SSIDVSFQSK VDTKPLEESS 360 

TLDPHTKEGT SLGEVGGPDL KGQVEPWDER ETRSPAPPEV DRDSLAPSWE TPPPYPENGS 420 

IQPYPDGGPV PSEMDVPPAH PEEEIVLRSY PASGHTTPLS RGTYEVGLED NIGSNTEVFT 4 80 

YNAANKETCE HNHRQCSRHA FCTDYATGFC CHCQSKFYGN GKHCLPEGAP HRVNGKVSGH 540 

LHVGHTPVHF TDVDLHAYIV GNDGRAYTAI SHIPQPAAQA LLPLTPIGGL FGWLFALEKP 600 

GSENGFSLAG AAFTHDMBVT FYPGEETVRI TQTAEGLDPE NYLSIKTNIQ GQVPYVPANF 660 

TAHISPYKEL YHYSDSTVTS TSSRDYSLTF GAINQTWSYR IHQNITYQVC RHAPRHPSFP 720 

TTQQLNVDRV FALYNDEERV LRFAVTNQIG PVKEDSDPTP VNPCYDGSHM CDTTARCHPG 780 

TGVDYTCECA SGYQGDGRNC VDENECATGF HRCGPNSVCI NLPGSYRCEC RSGYEFADDR 840 

HTCILITPPA NPCEDGSHTC APAGQARCVH HGGSTFSCAC LPGYAGDGHQ CTDVDECSEN 900 

RCHPAATCYN TPGSFSCRCQ PGYYGDGFQC IPDSTSSLTP CEQQQRHAQA QYAYPGARFH 960 

IPQCDEQGNF LPLQCHGSTG FCWCVDPDGH EVPGTQTPPG STPPHCGPSP EPTQRPPTIC 1020 

ERWRENLLEH YGGTPRDDQY VPQCDDLGHF IPLQCHGKSD FCWCVDKDGR EVQGTRSQPG 1080 

TTPACIPTVA PPMVRPTPRP DVTPPSVGTF LLYTQGQQIG YLPLNGTRLQ KDAAKTLLSL 1140 

HGSIIVGIDY DCRERMVYWT DVAGRTISRA GLELGAEPET IVNSGLISPE GLAIDHIRRT 1200 

MYWTDSVLDK IBSALLDGSE RKVLFYTDLV NPRAIAVDPI RGNLYWTDWN REAPKIETSS 1260 

LDGENRRIJjI NTDIGLPNGL TFDPFSKLLC WADAGTKKLE CTLPDGTGRR VIQNNLKYPF 1320 
SIVSYADHFY HTDWRRDGW SVNKHSGQFT DEYLPEQRSH LYGITAVYPY CPTGRK 

Seq ID NO: 118 DNA sequence 

Nucleic Acid Accession #: NMJ)03 088.1 

Coding sequence: 112-1593 (underlined sequences correspond to start and stop codons) 
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1 11 21 31 41 51 

I I I I I I 

GCGGAGGGTG CGTGCGGGCC GCGGCAGCCG AACAAAGGAG CAGGGGOGCC GCCGCAGGGA 60 

CCCGCCACCC ACCTCCCGGG GCCGCGCAGC GGCCTCTCGT CTACTGCCAC CATGACCGCC 120 

AACGGCACAG CCGAGGCGGT GCAGATCCAG TTCGGCCTCA TCAACTGCGG CAACAAGTAC 180 

CTGACGGCCG AGGCGTTCGG GTTCAAGGTG AACGCGTCCG CCAGCAGCCT GAAGAAGAAG 240 

CAGATCTGGA CGCTGGAGCA GCCCCCTGAC GAGGCGGGCA GCGCGGCCGT GTGCCTGCGC 300 

AGCCACCTGG GCCGCTACCT GGCGGCGGAC AAGGACGGCA ACGTGACCTG CGAGCGCGAG 360 

GTGCCCGGTC CCGACTGCCG TTTCCTCATC GTGGCGCACG ACGACGGTCG CTGGTCGCTG 420 

CAGTCCGAGG CGCACCGGCG CTACTTCGGC GGCACCGAGG ACCGCCTGTC CTGCTTCGCG 480 

CAGACGGTGT CCCCCGCCGA GAAGTGGAGC GTGCACATCG CCATGCACCC TCAGGTCAAC 540 

ATCTACAGTG TCACCCGTAA GCGCTACGCG CACCTGAGCG CGCGGCCGGC CGACGAGATC 600 

GCCGTGGACC GCGACGTGCC CTGGGGCGTC GACTCGCTCA TCACCCTCGC CTTCCAGGAC 660 

CAGCGCTACA GCGTGCAGAC CGCCGACCAC CGCTTCCTGC GCCACGACGG GCGCCTGGTG 720 

GCGCGCCCCG AGCCGGCCAC TGGCTACACG CTGGAGTTCC GCTCCGGCAA GGTGGCCTTC 780 

CGCGACTGCG AGGGCCGTTA CCTGGCGCCG TCGGGGCCCA GCGGCACGCT CAAGGCGGGC 840 

AAGGCCACCA AGGTGGGCAA GGACGAGCTC TTTGCTCTGG AGCAGAGCTG CGCCCAGGTC 900 

GTGCTGCAGG CGGCCAACGA GAGGAACGTG TCCACGCGCC AGGGTATGGA CCTGTCTGCC 960 

AATCAGGACG AGGAGACCGA CCAGGAGACC TTCCAGCTGG AGATCGACCG CGACACCAAA 1020 

AAGTGTGCCT TCCGTACCCA CACGGGCAAG TACTGGACGC TGACGGCCAC CGGGGGCGTG 1080 

CAGTCCACCG CCTCCAGCAA GAATGCCAGC TGCTACTTTG ACATCGAGTG GCGTGACCGG 1140 

CGCATCACAC TGAGGGCGTC CAATGGCAAG TTTGTGACCT CCAAGAAGAA TGGGCAGCTG 1200 

GCCGCCTCGG TGGAGACAGC AGGGGACTCA GAGCTCTTCC TCATGAAGCT CATCAACCGC 1260 

CCCATCATCG TGTTCCGCGG GGAGCATGGC TTCATCGGCT GCCGCAAGGT CACGGGCACC 1320 

CTGGACGCCA ACCGCTCCAG CTATGACGTC TTCCAGCTGG AGTTCAACGA TGGCGCCTAC 1380 

AACATCAAAG ACTCCACAGG CAAATACTGG ACGGTGGGCA GTGACTCCGC GGTCACCAGC 1440 

AGCGGCGACA CTCCTGTGGA CTTCTTCTTC GAGTTCTGCG ACTATAACAA GGTGGCCATC 1500 

AAGGTGGGCG GGCGCTACCT GAAGGGCGAC CACGCAGGCG TCCTGAAGGC CTCGGCGGAA 1560 

ACCGTGGACC CCGCCTCGCT CTGGGAGTAC TAGGGCCGGC CCGTCCTTCC CCGCCCCTGC 1620 

CCACATGGCG GCTCCTGCCA ACCCTCCCTG CTAACCCCTT CTCCGCCAGG TGGGCTCCAG 1680 

GGCGGGAGGC AAGCCCCCTT GCCTTTCAAA CTGGAAACCC CAGAGAAAAC GGTGCCCCCA 1740 

CCTGTCGCCC CTATGGACTC CCCACTCTCC CCTCCGCCCG GGTTCCCTAC TCCCCTCGGG 1800 

TCAGCGGCTG CGGCCTGGCC CTGGGAGGGA TTTCAGATGC CCCTGCCCTC TTGTCTGCCA 1860 

CGGGGCGAGT CTGGCACCTC TTTCTTCTGA CCTCAGACGG CTCTGAGCCT TATTTCTCTG 1920 

GAAGCGGCTA AGGGACGGTT GGGGGCTGGG AGCCCTGGGC GTGTAGTGTA ACTGGAATCT 1980 

TTTGCCTCTC CCAGCCACCT CCTCCCAGCC CCCCAGGAGA GCTGGGCACA TGTCCCAAGC 2040 

CTGTCAGTGG CCCTCCCTGG TGCACTGTCC CCGAAACCCC TGCTTGGGAA GGGAAGCTGT 2100 

CGGGAGGGCT AGGACTGACC CTTGTGGTGT TTTTTTGGGT GGTGGCTGGA AACAGCCCCT 2160 

CTCCCACGTG GGAGAGGCTC AGCCTGGCTC CCTTCCCTGG AGCGGCAGGG CGTGACGGCC 2220 

ACAGGGTCTG CCCGCTGCAC GTTCTGCCAA GGTGGTGGTG GCGGGCGGGT AGGGGTGTGG 2280 

GGGCCGTCTT CCTCCTGTCT CTTTCCTTTC ACCCTAGCCT GACTGGAAGC AGAAAATGAC 2340 

CAAATCAGTA TTTTTTTTAA TGAAATATTA TTGCTGGAGG CGTCCCAGGC AAGCCTGGCT 2400 

GTAGTAGCGA GTGATCTGGC GGGGGGCGTC TCAGCACCCT CCCCAGGGGG TGCATCTCAG 2460 

CCCCCTCTTT CCGTCCTTCC CGTCCAGCCC CAGCCCTGGG CCTGGGCTGC CGACACCTGG 2520 

GCCAGAGCCC CTGCTGTGAT TGGTGCTCCC TGGGCCTCCC GGGTGGATGA AGCCAGGCGT 2580 

CGCCCCCTCC GGGAGCCCTG GGGTGAGCCG CCGGGGCCCC CCTGCTGCCA GCCTCCCCCG 2640 

TCCCCAACAT GCATCTCACT CTGGGTGTCT TGGTCTTTTA TTTTTTGTAA GTGTCATTTG 2700 

TATAACTCTA AACGCCCATG ATAGTAGCTT CAAACTGGAA ATAGCGAAAT AAAATAACTC 2760 
AGTCTGC 



Seq ID No: 119 Protein sequence: 
Protein Accession #: NP_003079.1 

1 11 21 31 41 51 

I I I I I I 

MTANGTAEAV QIQFGLINCG NKYLTAEAFG FKVNASASSL KKKQIWTLEQ PPDEAGSAAV 60 
CLRSHLGRYL AADKDGNVTC EREVPGPDCR FLIVAHDDGR WSLQSEAHRR YFGGTEDRLS 120 
CFAQTVSPAE KWSVHIAMHP QVNIYSVTRK RYAHLSARPA DEIAVDRDVP WGVDSLITLA 180 
FQDQRYSVQT ADHRFLRHDG RLVARPEPAT GYTLEFRSGK VAFRDCEGRY LAPSGPSGTL 240 
KAGKATKVGK DELFALEQSC AQWLQAANE RNVSTRQGMD LSANQDEETD QETFQLEIDR 300 
DTKKCAFRTH TGKYWTLTAT GGVQSTASSK NASCYFDIEW RDRRITLRAS NGKFVTSKKN 360 
GQIiAASVETA GDSELFLMKL INRPIIVFRG EHGFIGCRKV TGTIDANRSS YDVFQLEFND 420 
GAYNIKDSTG KYWTVGSDSA VTSSGDTPVD FFFEFCDYNK VAIKVGGRYL KGDHAGVLKA 480 . 
SAETVDPASL WEY 

Seq ID NO: 120 DNA sequence 

Nucleic Acid Accession #: NM_006404.1 

Coding sequence: 25-741 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

CAGGTCCGGA GCCTCAACTT CAGGATGTTG ACAACATTGC TGCCGATACT GCTGCTGTCT 60 

GGCTGGGCCT TTTGTAGCCA AGACGCCTCA GATGGCCTCC AAAGACTTCA TATGCTCCAG 120 

ATCTCCTACT TCCGCGACCC CTATCACGTG TGGTACCAGG GCAACGCGTC GCTGGGGGGA 180 
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CACCTAACGC ACGTGCTGGA AGGCCCAGAC ACCAACACCA CGATCATTCA GCTGCAGCCC 240 

TTGCAGGAGC CCGAGAGCTG GGCGCGCACG CAGAGTGGCC TGCAGTCCTA CCTGCTCCAG 300 

TTCCACGGCC TCGTGCGCCT GGTGCACCAG GAGCGGACCT TGGCCTTTCC TCTGACCATC 360 

CGCTGCTTCC TGGGCTGTGA GCTGCCTCCC GAGGGCTCTA GAGCCCATGT CTTCTTCGAA 420 

GTGGCTGTGA ATGGGAGCTC CTTTGTGAGT TTCCGGCCGG AGAGAGCCTT GTGGCAGGCA 480 

GACACCCAGG TCACCTCCGG AGTGGTCACC TTCACCCTGC AGCAGCTCAA TGCCTACAAC 540 

CGCACTCGGT ATGAACTGCG GGAATTCCTG GAGGACACCT GTGTGCAGTA TGTGCAGAAA 600 

CATATTTCCG CGGAAAACAC GAAAGGGAGC CAAACAAGCC GCTCCTACAC TTCGCTGGTC 660 

CTGGGCGTCC TGGTGGGCGG TTTCATCATT GCTGGTGTGG CTGTAGGCAT CTTCCTGTGC 720 

ACAGGTGGAC GGCGATG TTA A TTACTCTCC AGCCCCGTCA GAAGGGGCTG GATTGATGGA 780 

GGCTGGCAAG GGAAAGTTTC AGCTCACTGT GAAGCCAGAC TCCCCAACTG AAACACCAGA 840 

AGGTTTGGAG TGACAGCTCC TTTCTTCTCC CACATCTGCC CACTGAAGAT TTGAGGGAGG 900 

GGAGATGGAG AGGAGAGGTG GACAAAGTAC TTGGTTTGCT AAGAACCTAA GAACGTGTAT 960 

GCTTTGCTGA ATTAGTCTGA TAAGTGAATG TTTATCTATC TTTGTGGAAA ACAGATAATG 1020 

GAGTTGGGGC AGGAAGCCTA TGCGCCATCC TCCAAAGACA GACAGAATCA CCTGAGGCGT 1080 

TCAAAAGATA TAACCAAATA AACAAGTCAT CCACAATCAA AATACAACAT TCAATACTTC 1140 

CAGGTGTGTC AGACTTGGGA TGGGACGCTG ATATAATAGG GTAGAAAGAA GTAACACGAA 1200 

GAAGTGGTGG AAATGTAAAA TCCAAGTCAT ATGGCAGTGA TCAATTATTA ATCAATTAAT 1260 
AATATTAATA AATTTCTTAT ATTT 



Seq ID No: 121 Protein sequence: 
Protein Accession #: NP_006395.1 

1 11 21 31 41 51 

III I I I 

MLTTLLPILIi lsgwafcsqd asdglqrlhm lqisyfrdpy hvwyqgnasl GGHLTHVLEG 60 

PDTNTTIIQL QPLQEPESWA RTQSGLQSYL LQFHGLVRLV HQERTLAFPL TIRCFLGCEL 120 
PPEGSRAHVF FEVAVNGSSF VSFRPERALW QADTQVTSGV VTFTLQQLNA YNRTRYELRE 180 
FLEDTCVQYV QKHISAENTK GSQTSRSYTS LVLGVLVGGF IIAGVAVGIF LCTGGRRC 

Seq ID NO: 122 DNA sequence 

Nucleic Acid Accession #: none found 

Coding sequence: 2-505 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

III III 

C GAGA AGCTG GGAGAGACAC CACTTGTCCC TGAACAAGAC AATTCAGTAA CATCTATTCC 60 

TGAGATTCCT CGATGGGGAT CACAGAGCAC GATGTCTACC CTTCAAATGT CCCTTCAAGC 120 

CGAGTCAAAG GCCACTATCA CCCCATCAGG GAGCGTGATT TCCAAGTTTA ATTCTACGAC 180 

TTCCTCTGCC ACTCCTCAGG CTTTCGACTC CTCCTCTGCC GTGGTCTTCA TATTTGTGAG 240 

CACAGCAGTA GTAGTGTTGG TGATCTTGAC CATGACAGTA CTGGGGCTTG TCAAGCTCTG 300 

CTTTCACGAA AGCCCCTCTT CCCAGCCAAG GAAGGAGTCT ATGGGCCCGC CGGGCCTGGA 360 

GAGTGATCCT GAGCCCGCTG CTTTGGGCTC CAGTTCTGCA CATTGCACAA ACAATGGGGT 420 

GAAAGTCGGG GACTGTGATC TGCGGGACAG AGCAGAGGGT GCCTTGCTGG CGGAGTCCCC 480 

TCTTGGCTCT AGTGATGC AT AGG GAAACAG GGGACATGGG CACTCCTGTG AACAGTTTTT 540 

CACTTTTGAT GAAACGGGGA ACCAAGAGGA ACTTACTTGT GTAACTGACA ATTTCTGCAG 600 

AAATCCCCCT TCCTCTAAAT TCCCTTTACT CCACTGAGGA GCTAAATCAG AACTGCACAC 660 

TCCTTCCCTG ATGATAGAGG AAGTGGAAGT GCCTTTAGGA TGGTGATACT GGGGGACCGG 720 

GTAGTGCTGG GGAGAGATAT TTTCTTATGT TTATTCGGAG AATTTGGAGA AGTGATTGAA 780 

CTTTTCAAGA CATTGGAAAC AAATAGAACA CAATATAATT TACATTAAAA AATAATTTCT 840 

ACCAAAATGG AAAGGAAATG TTCTATGTTG TTCAGGCTAG GAGTATATTG GTTCGAAATC 900 
CCAGGGAAAA AAATAAAAAT AAAAAATTAA AGGATTGTTG ATAAAA 

Seq ID No: 123 Protein sequence: 
Protein Accession #: none found 

1 11 21 31 41 51 

I I I I I I 

EKLGETPLVP EQDNSVTSIP EIPRWGSQST MSTLQMSLOA ESKATITPSG SVISKFNSTT 60 
SSATPQAFDS SSAWFIFVS TAWVLVILT MTVLGLVKLC FHESPSSQPR KESMGPPGLE 120 
SDPEPAALGS SSAHCTNNGV KVGDCDLRDR AEGALLAESP LGSSDA 



Seq ID NO: 124 DNA sequence 

Nucleic Acid Accession #: NM_006500.1 

Coding sequence: 27-1967 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

ACTTGCGTCT CGCCCTCCGG CCAAGCATGG GGCTTCCCAG GCTGGTCTGC GCCTTCTTGC 60 

TCGCCGCCTG CTGCTGCTGT CCTCGCGTCG CGGGTGTGCC CGGAGAGGCT GAGCAGCCTG 120 

CGCCTGAGCT GGTGGAGGTG GAAGTGGGCA GCACAGCCCT TCTGAAGTGC GGCCTCTCCC 180 

AGTCCCAAGG CAACCTCAGC CATGTCGACT GGTTTTCTGT CCACAAGGAG AAGCGGACGC 240 

TCATCTTCCG TGTGCGCCAG GGCCAGGGCC AGAGCGAACC TGGGGAGTAC GAGCAGCGGC 300 

TCAGCCTCCA GGACAGAGGG GCTACTCTGG CCCTGACTCA AGTCACCCCC CAAGACGAGC 360 
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GCATCTTCTT GTGCCAGGGC AAGCGCCCTC 
TCTACAAAGC TCCGGAGGAG CCAAACATCC 
GTAAGGAGCC TGAGGAGGTC GCTACCTGTG 
TCATCTGGTA CAAGAATGGC CGGCCTCTGA 
CGTCCCAGAC TGTGGAGTCG AGTGGTTTGT 
TGGTTAAAGA AGACAAAGAT GCCCAGTTTT 
GGAACCACAT GAAGGAGTCC AGGGAAGTCA 
TGTGGCTGGA AGTGGAGCCC GTGGGAATGC 
GTTTGGCTGA TGGCAACCCT CCACCACACT 
GGGAGGCAGA GGAAGAGACA ACCAACGACA 
AGGAACACAG TGGGCGCTAT GAATGTCAGG 
TGAGTGAACC ACAGGAACTA CTGGTGAACT 
CCCCTGAGAG ACAGGAAGGC AGCAGCCTCA 
ACCTCGAGTT CCAGTGGCTG AGAGAAGAGA 
TTCAGTTGCA TGACCTGAAA CGGGAGGCAG 
CCAGCATACC CGGCCTGAAC CGCACACAGC 
GGATGGCATT CAAGGAGAGG AAGGTGTGGG 
GTGAAGCGTC AGGGCACCCC CGGCCCACCA 
AACAAGACCA AGATCCACAG CGAGTCCTGA 
TGTTGGAGAC AGGTGTTGAA TGCACGGCCT 
TCTTCCTGGA GCTGGTCAAT TTAACCACCC 
TCAGCACTTC CACTGCCAGT CCTCATACCA 
TGCCGGAGCC GGAGAGCCGG GGCGTGGTCA 
TGGCGGTGCT GGGCGCTGTC CTCTATTTCC 
GCTCAGGGAA GCAGGAGATC ACGCTGCCCC 
TTAAGTCAGA TAAGCTCCCA GAAGAGATGG 
GGGCTCCGGG AGACCAGGGA GAGAAATACA 
CAGCTCCCTT CCCTGCCTGG ACCATTCCCA 
CCTCCAAAGG GACTAGAGAG AAGCCTCCTG 
GGCCACTGGG TTAGGACCTG AGGACCTCAC 
GTCCACCACC ATCTCCTCCA CGTTGAGTGA 
CCGAGCGGGT AGGAGAGTTT CTTGCAGAAC 
AAATACCTGG CTCCTGCCAG CAGCTGAGCT 
CAAAGGCTGG CTTCCACCAT CCAGGTGCAC 
GCCTGCTCAT GTTGAAGTGC GCTGTTCACA 
AGAAGCAGCT GCAGTGTTGC TGCCACCACC 
ACATTTTTTC TTTGGTCAGA AGCCAGGAAC 
GGCCAGGTGT GGTGGCTCAC GCCTGTAATC 
TCACAAAGTC AGGACGAGAC CATCCTGGCT 
TACAAAAAAA AATTAGCTAG GCGTAGTGGT 
CTGAAGCAGG AGAATGGTAT GAATCCAGGA 
CACTGCACTC CAGCCTGGGC AACACAGCGA 
ACGCGTACCT GCGGTGAGGA AGCTGGGCGC 
TCCCCGTGTT CACTTGCTCC CATAGCCCTC 
GGGGAGCAGA CAAAGATGAG GTCTACACTG 
TTAGCACCAA ACTTCTACAA ACCAAGCTCA 
AGAATGGTAC TTAGGGATGG AAAACGGGGC 
CTGTGTGTAT GCATACATAT GTGTGTATAT 
TTGTTTCCTT ' TATATATGTA TGTATATATA 
AAAGCTTAAT TGTCCCAGAA AATCATACAT 
AACCTGGGGG CCTGTGAAAC TACAACCAAA 
AGAGATCAGG GGTTACCTCT GCTTCTGAGC 
CTACCCTACT TTTCAGCAGC AAAACGTCCC 
TGTTAGCAGG AGCTATGTCC CTTCCTATCG 



GGTCCCAGGA GTACCGCATC CAGCTCCGCG 420 

AGGTCAACCC CCTGGGCATC CCTGTGAACA 480 

TAGGGAGGAA CGGGTACCCC ATTCCTCAAG 540 

AGGAGGAGAA GAACCGGGTC CACATTCAGT 600 

ACACCTTGCA GAGTATTCTG AAGGCACAGC 660 

ACTGTGAGCT CAACTACCGG CTGCCCAGTG 720 

CCGTCCCTGT TTTCTACCCG ACAGAAAAAG 780 

TGAAGGAAGG GGACCGCGTG GAAATCAGGT 840 

TCAGCATCAG CAAGCAGAAC CCCAGCACCA 900 

ACGGGGTCCT GGTGCTGGAG CCTGCCCGGA 960 

CCTGGAACTT GGACACCATG ATATCGCTGC 1020 

ATGTGTCTGA CGTCCGAGTG AGTCCCGCAG 1080 

CCCTGACCTG TGAGGCAGAG AGTAGCCAGG 1140 

CAGACCAGGT GCTGGAAAGG GGGCCTGTGC 1200 

GAGGCGGCTA TCGCTGCGTG GCGTCTGTGC 1260 

TGGTCAAGCT GGCCATTTTT GGCCCCCCTT 1320 

TGAAAGAGAA TATGGTGTTG AATCTGTCTT 1380 

TCTCCTGGAA CGTCAACGGC ACGGCAAGTG 1440 

GCACCCTGAA TGTCCTCGTG ACCCCGGAGC 1500 

CCAACGACCT GGGCAAAAAC ACCAGCATCC 1560 

TCACACCAGA CTCCAACACA ACCACTGGCC 1620 

GAGCCAACAG CACCTCCACA GAGAGAAAGC 1680 

TCGTGGCTGT GATTGTGTGC ATCCTGGTCC 1740 

TCTATAAGAA GGGCAAGCTG CCGTGCAGGC 1800 

CGTCTCGTAA GACCGAACTT GTAGTTGAAG 1860 

GCCTCCTGCA GGGCAGCAGC GGTGACAAGA 1920 

TCGATCTGAG GCATTAGCCC CGAATCACTT 1980 

GCTCCCTGCT CACTCTTCTC TCAGCCAAAG 2040 

CTCCCCTCAC CTGCACACCC CCTTTCAGAG 2100 

TTGGCCCTGC AAGCCGCTTT TCAGGGACCA 2160 

AGCTCATCCC AAGCAAGGAG CCCCAGTCTC 2220 

GTGTTTTTTC TTTACACACA TTATGGCTGT 2280 

GGGTAGCCTC TCTGAGCTGG TTTCCTGCCC 2340 

CACTGAAGTG AGGACACACC GGAGCCAGGC 2400 

CCCGCTCCGG AGAGCACCCC AGCGGCATCC 2460 

CTCCTGCTCG CCTCTTCAAA GTCTCCTGTG 2520 

TGGTGTCATT CCTTAAAAGA TACGTGCCGG 2580 

CCAGCACTTT GGGAGGCCGA GGCGGGCGGA 2640 

AACACGGTGA AACCCTGTCT CTACTAAAAA 2700 

TGGCACCTAT AGTCCCAGCT ACTCGGAAGG 2760 

GGTGGAGCTT GCAGTGAGCC GAGACCGTGC 2820 

GACTCCGTCT CGAGGAAAAA AAAAGAAAAG 2880 

TGTTTTCGAG TTCAGGTGAA TTAGCCTCAA 2940 

TTGATGGATC ACGTAAAACT GAAAGGCAGC 3000 

TCCTTCATGG GGATTAAAGC TATGGTTATA 3060 

GGGCCCCAAC CCTAGAAGGG CCCAAATGAG 3120 

CTGGCTAGAG CTTCGGGTGT GTGTGTCTGT 3180 

ATGGTTTTGT CAGGTGTGTA AATTTGCAAA 3240 

TATATGAAAA TATATATATA TATGAAAAAT 3300 

TGCTTTTTTA TTCTACATGG GTACCACAGG 3360 

AGGCACACAA AACCGTTTCC AGTTGGCAGC 3420 

AAATGGCTCA AGCTCTACCA GAGCAGACAG 3480 

GTATGACGCA GCACGAAGGG CCTGGCAGGC 3540 
TTTCCGTCCA CTT 



Seq ID No: 125 Protein sequence:. 
Protein Accession #: NP__006491.1 



1 11 21 31 41 51 

I I I I J I 

MGLPRLVCAF LLAACCCCPR VAGVPGEAEQ PAPELVEVEV GSTALLKCGL SQSQGNLSHV 60 

DWFSVHKEKR TLIFRVRQGQ GQSEPGEYEQ RLSLQDRGAT LALTQVTPQD ERIFLCQGKR 120 

PRSQEYR1QL RVYKAPEEPN IQVNPLGIPV NSKEPEEVAT CVGRNGYPIP QVIWYKNGRP 180 

LKEEKNRVHI QSSQTVESSG LYTLQSILKA QLVKEDKDAQ FYCELNYRLP SGNHMKESRE 240 

VTVPVFYPTE KVWLEVEPVG MLKEGDRVEI RCLADGNPPP HFSISKQNPS TREAEEETTN 300 

DNGVLVLEPA RKEHSGRYEC QAWNLDTMIS LLSEPQELLV NYVSDVRVSP AAPERQEGSS 360 

LTLTCEAESS QDLEFQWLRE ETDQVLERGP VLQLHDLKRE AGGGYRCVAS VPSIPGLNRT 420 

QLVKLAIFGP PWMAFKERKV WVKENMVLNL SCEASGHPRP TISWNVNGTA SEQDQDPQRV 480 

LSTLNVLVTP ELLETGVECT ASNDLGKNTS ILFLELVNLT TLTPDSNTTT GLSTSTASPH 540 

TRANSTSTER KLPEPESRGV VIVAVIVCIL VLAVLGAVLY FLYKKGKLPC RRSGKQEITL 600 
PPSRKTEIiW EVKSDKLPBE MGLLQGSSGD KRAPGDQGEK YIDLRH 



Seq ID NO: 126 DNA sequence 

Nucleic Acid Accession #: NMJJ01955.1 
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Coding sequence: 337-975 (underlined sequences correspond to start and stop'codons) 



1 11 21 31 41 51 

I I I I I I 

GGAGCTGTTT ACCCCCACTC TAATAGGGGT TCAATATAAA AAGCCGGCAG AGAGCTGTCC 60 

AAGTCAGACG CGCCTCTGCA TCTGCGCCAG GCGAACGGGT CCTGCGCCTC CTGCAGTCCC 120 

AGCTCTCCAC CACCGCCGCG TGCGCCTGCA GACGCTCCGC TCGCTGCCTT CTCTCCTGGC 180 

AGGCGCTGCC TTTTCTCCCC GTTAAAGGGC ACTTGGGCTG AAGGATCGCT TTGAGATCTG 240 

AGGAACCCGC AGCGCTTTGA GGGACCTGAA GCTGTTTTTC TTCGTTTTCC TTTGGGTTCA 300 

GTTTGAACGG GAGGTTTTTG ATCCCTTTTT TTCAG AATGG ATTATTTGCT CATGATTTTC 360. 

TCTCTGCTGT TTGTGGCTTG CCAAGGAGCT CCAGAAACAG CAGTCTTAGG CGCTGAGCTC 420 

AGCGCGGTGG GTGAGAACGG CGGGGAGAAA CCCACTCCCA GTCCACCCTG GCX3GCTCCGC 480 

CGGTCCAAGC GCTGCTCCTG CTCGTCCCTG ATGGATAAAG AGTGTGTCTA CTTCTGCCAC 540 

CTGGACATCA TTTGGGTCAA CACTCCCGAG CACGTTGTTC CGTATGGACT TGGAAGCCCT 600 

AGGTCCAAGA GAGCCTTGGA GAATTTACTT CCCACAAAGG CAACAGACCG TGAGAATAGA 660 

TGCCAATGTG CTAGCCAAAA AGACAAGAAG TGCTGGAATT TTTGCCAAGC AGGAAAAGAA 720 

CTCAGGGCTG AAGACATTAT GGAGAAAGAC TGGAATAATC ATAAGAAAGG AAAAGACTGT 780 

TCCAAGCTTG GGAAAAAGTG TATTTATCAG CAGTTAGTGA GAGGAAGAAA AATCAGAAGA 840 

AGTTCAGAGG AACACCTAAG ACAAACCAGG TCGGAGACCA TGAGAAACAG CGTCAAATCA 900 

TCTTTTCATG ATCCCAAGCT GAAAGGCAAG CCCTCCAGAG AGCGTTATGT GACCCACAAC 960 

CGAGCACATT GGTGACAGAC TTCGGGGCCT GTCTGAAGCC ATAGCCTCCA CGGAGAGCCC 1020 

TGTGGCCGAC TCTGCACTCT CCACCCTGGC TGGGATCAGA GCAGGAGCAT CCTCTGCTGG 1080 

TTCCTGACTG GCAAAGGACC AGCGTCCTCG TTCAAAACAT TCCAAGAAAG GTTAAGGAGT 1140 

TCCCCCAACC ATCTTCACTG GCTTCCATCA GTGGTAACTG CTTTGGTCTC TTCTTTCATC 1200 
TGGGGATGAC AATGGACCTC TCAGCAGAAA CACACAGTCA CATTCGAATT C 

Seq ID No: 127 Protein sequence: 
Protein Accession #: NP 001946.1 
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MDYLLMIFSL LFVACQGAPE TAVLGAELSA VGENGGEKPT PSPPWRLRRS KRCSCSSLMD 60 
KECVYFCHLD IIWVNTPEHV VPYGLGSPRS KRALENLLPT KATDRENRCQ CASQKDKKCW 120 
NFCQAGKEIiR AEDIMEKDWN NHKKGKDCSK LGKKCIYQQL VRGRKIRRSS EEHLRQTRSE 180 
TMRNSVKSSF HDPKLKGKPS RERYVTHNRA HW 



Seq ID NO: 128 DNA sequence 

Nucleic Acid Accession #: NM_001721.1 

Coding sequence: 34-2061 (underlined sequences correspond to start and stop codons) 
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GCAAGCACGG AACAAGCTGA GACGGATGAT AAT ATGG ATA CAAAATCTAT TCTAGAAGAA 60 

CTTCTTCTCA AAAGATCACA GCAAAAGAAG AAAATGTCAC CAAATAATTA CAAAGAACGG 120 

CTTTTTGTTT TGACCAAAAC AAACCTTTCC TACTATGAAT ATGACAAAAT GAAAAGGGGC 180 

AGCAGAAAAG GATCCATTGA AATTAAGAAA ATCAGATGTG TGGAGAAAGT AAATCTCGAG 240 

GAGCAGACGC CTGTAGAGAG ACAGTACCCA TTTCAGATTG TCTATAAAGA TGGGCTTCTC 300 

TATGTCTATG CATCAAATGA AGAGAGCCGA AGTCAGTGGT TGAAAGCATT ACAAAAAGAG 360 

ATAAGGGGTA ACCCCCACCT GCTGGTCAAG TACCATAGTG GGTTCTTCGT GGACGGGAAG 420 

TTCCTGTGTT GCCAGCAGAG CTGTAAAGCA GCCCCAGGAT GTACCCTCTG GGAAGCATAT 480 

GCTAATCTGC ATACTGCAGT CAATGAAGAG AAACACAGAG TTCCCACCTT CCCAGACAGA 540 

GTGCTGAAGA TACCTCGGGC AGTTCCTGTT CTCAAAATGG ATGCACCATC TTCAAGTACC 600 

ACTCTAGCCC AATATGACAA CGAATCAAAG AAAAACTATG GCTCCCAGCC ACCATCTTCA 660 

AGTACCAGTC TAGCGCAATA TGACAGCAAC TCAAAGAAAA TCTATGGCTC CCAGCCAAAC 720 

TTCAACATGC AGTATATTCC AAGGGAAGAC TTCCCTGACT GGTGGCAAGT AAGAAAACTG 780 

AAAAGTAGCA GCAGCAGTGA AGATGTTGCA AGCAGTAACC AAAAAGAAAG AAATGTGAAT 840 

CACACCACCT CAAAGATTTC ATGGGAATTC CCTGAGTCAA GTTCATCTGA AGAAGAGGAA 900 

AACCTGGATG ATTATGACTG GTTTGCTGGT AACATCTCCA GATCACAATC TGAACAGTTA 960 

CTCAGACAAA AGGGAAAAGA AGGAGCATTT ATGGTTAGAA ATTCGAGCCA AGTGGGAATG 1020 

TACACAGTGT CCTTATTTAG TAAGGCTGTG AATGATAAAA AAGGAACTGT CAAACATTAC 1080 

CACGTGCATA CAAATGCTGA GAACAAATTA TACCTGGCAG AAAACTACTG TTTTGATTCC 1140 

ATTCCAAAGC TTATTCATTA TCATCAACAC AATTCAGCAG GCATGATCAC ACGGCTCCGC 1200 

CACCCTGTGT CAACAAAGGC CAACAAGGTC CCCGACTCTG TGTCCCTGGG AAATGGAATC 1260 

TGGGAACTGA AAAGAGAAGA GATTACCTTG TTGAAGGAGC TGGGAAGTGG CCAGTTTGGA 1320 

GTGGTCCAGC TGGGCAAGTG GAAGGGGCAG TATGATGTTG CTGTTAAGAT GATCAAGGAG 1380 

GGCTCCATGT CAGAAGATGA ATTCTTTCAG GAGGCCCAGA CTATGATGAA ACTCAGCCAT 1440 

CCCAAGCTGG TTAAATTCTA TGGAGTGTGT TCAAAGGAAT ACCCCATATA CAT AG TG ACT 1500 

GAATATATAA GCAATGGCTG CTTGCTGAAT TACCTGAGGA GTCACGGAAA AGGACTTGAA 1560 

CCTTCCCAGC TCTTAGAAAT GTGCTACGAT GTCTGTGAAG GCATGGCCTT CTTGGAGAGT 1620 

CACCAATTCA TACACCGGGA CTTGGCTGCT CGTAACTGCT TGGTGGACAG AGATCTCTGT 1680 

GTGAAAGTAT CTGACTTTGG AATGACAAGG TATGTTCTTG ATGACCAGTA TGTCAGTTCA 1740 
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GTCGGAACAA AGTTTCCAGT CAAGTGGTCA GCTCCAGAGG TGTTTCATTA CTTCAAATAC 1800 

AGCAGCAAGT CAGACGTATG GGCATTTGGG ATCCTGATGT GGGAGGTGTT CAGCCTGGGG 1860 

AAGCAGCCCT ATGACTTGTA TGACAACTCC CAGGTGGTTC TGAAGGTCTC CCAGGGCCAC 1920 

AGGCTTTACC GGCCCCACCT GGCATCGGAC ACCATCTACC AGATCATGTA CAGCTGCTGG 1980 

5 CACGAGCTTC CAGAAAAGCG TCCCACATTT CAGCAACTCC TGTCTTCCAT TGAACCACTT 2040 

CGGGAAAAAG ACAAGCA TTG AA GAAGAAAT TAGGAGTGCT GATAAGAATG AATATAGATG 2100 

CTGGCCAGCA TTTTCATTCA TTTTAAGGAA AGTAGGAAGG CATAAGTAAT TTTAGCTAGT 2160 

TTTTAATAGT GTTCTCTGTA TTGTCTATTA TTTAGAAATG AACAAGGCAG GAAACAAAAG 2220 

ATTCCCTTGA AATTTAGATC AAATTAGTAA TTTTGTTTTA TGCTGCTCCT GATATAACAC 2280 

10 TTTCCAGCCT ATAGCAGAAG CACATTTTCA GACTGCAATA TAGAGACTGT GTTCATGTGT 2340 

AAAGACTGAG CAGAACTGAA AAATTACTTA TTGGATATTC ATTCTTTTCT TTATATTGTC 2400 
ATTGTCACAA CAATTAAATA TACTACCAAG TACAGAAATG TGGAAAAAAA AAACCG 

Seg ID No: 129 Protein sequence t 
15 Protein Accession #: NPJJ01712.1 



1 11 21 31 41 51 

20 MDTKSILEEL LLKRSQQKKK MSPNNYKERL FVLTKTNLSY YEYDKMKRGS RKGSIEIKKI 
RCVEKVNLEE QTPVERQYPF QIVYKDGLLY VYASNEESRS QWLKALQKEI RGNPHLLVKY 
HSGFFVDGKF LCCQQSCKAA PGCTLWEAYA NLHTAVNEEK HRVPTFPDRV LKIPRAVPVL 
KMDAPSSSTT LAQYDNESKK NYGSQPPSSS TSLAQYDSNS KKIYGSQPNF NMQYIPREDF 
PDWWQVRKLK SSSSSEDVAS SNQKERNVNH TTSKISWEFP ESSSSEEEEN LDDYDWFAGN 

25 ISRSQSEQLL RQKGKEGAFM VRNSSQVGMY TVSLFSKAVN DKKGTVKHYH VHTNAENKLY 
IAENYCFDSI PKLIHYHQHN SAGMITRLRH PVSTKANKVP DSVSLGNGIW ELKREEITLL 
KELGSGQFGV VQLGKWKGQY D VAVKMI KEG SMSEDEFFQE AQTMMKLSHP KLVKFYGVCS 
KEYPIYIVTE YISNGCLLNY LRSHGKGIjEP SQLLEMCYDV CEGMAFLESH QFIHRDLAAR 
NCLVDRDLCV KVSDFGMTRY VLDDQYVSSV GTKFPVKWSA PEVFHYFKYS SKSDVWAFGI 

30 LMWEVFSLGK QPYDLYDNSQ WLKVSQGHR LYRPHIiASDT IYQIMYSCWH ELPEKRPTFQ 
QLLSSIEPLR EKDKH 



Seq ID NO: 130 DNA sequence 
35 Nucleic Acid Accession #: NM_012072.2 

Coding sequence: 149-2107 (underlined sequences correspond to start and stop codons) 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
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AAAGCCCTCA GCCTTTGTGT CCTTCTCTGC GCCGGAGTGG CTGCAGCTCA CCCCTCAGCT 60 

CCCCTTGGGG CCCAGCTGGG AGCCGAGATA GAAGCTCCTG TCGCCGCTGG GCTTCTCGCC 120 

TCCCGCAGAG GGCCACACAG AGACCGG GAT G GCCACCTCC ATGGGCCTGC TGCTGCTGCT 180 

GCTGCTGCTC CTGACCCAGC CCGGGGCGGG GACGGGAGCT GACACGGAGG CGGTGGTCTG 240 

45 CGTGGGGACC GCCTGCTACA CGGCCCACTC GGGCAAGCTG AGCGCTGCCG AGGCCCAGAA 300 

CCACTGCAAC CAGAACGGGG GCAACCTGGC CACTGTGAAG AGCAAGGAGG AGGCCCAGCA 360 

CGTCCAGCGA GTACTGGCCC AGCTCCTGAG GCGGGAGGCA GCCCTGACGG CGAGGATGAG 420 

CAAGTTCTGG ATTGGGCTCC AGCGAGAGAA GGGCAAGTGC CTGGACCCTA GTCTGCCGCT 4 80 

GAAGGGCTTC AGCTGGGTGG GCGGGGGGGA GGACACGCCT TACTCTAACT GGCACAAGGA 540 

50 GCTCCGGAAC TCGTGCATCT CCAAGCGCTG TGTGTCTCTG CTGCTGGACC TGTCCCAGCC 600 

GCTCCTTCCC AACCGCCTGC CCAAGTGGTC TGAGGGCCCC TGTGGGAGCC CAGGCTCCCC 660 

CGGAAGTAAC ATTGAGGGCT TCGTGTGCAA GTTCAGCTTC AAAGGCATGT GCCGGCCTCT 720 

GGCCCTGGGG GGCCCAGGTC AGGTGACCTA CACCACCCCC TTCCAGACCA CCAGTTCCTC 780 

CTTGGAGGCT GTGCCCTTTG CCTCTGCGGC CAATGTAGCC TGTGGGGAAG GTGACAAGGA 840 

55 CGAGACTCAG AGTCATTATT TCCTGTGCAA GGAGAAGGCC CCCGATGTGT TCGACTGGGG 900 

CAGCTCGGGC CCCCTCTGTG TCAGCCCCAA GTATGGCTGC AACTTCAACA ATGGGGGCTG 960 

CCACCAGGAC TGCTTTGAAG GGGGGGATGG CTCCTTCCTC TGCGGCTGCC GACCAGGATT 1020 

CCGGCTGCTG GATGACCTGG TGACCTGTGC CTCTCGAAAC CCTTGCAGCT CCAGCCCATG 1080 

TCGTGGGGGG GCCACGTGCG TCCTGGGACC CCATGGGAAA AACTACACGT GCCGCTGCCC 1140 

60 CCAAGGGTAC CAGCTGGACT CGAGTCAGCT GGACTGTGTG GACGTGGATG AATGCCAGGA 1200 

CTCCCCCTGT GCCCAGGAGT GTGTCAACAC CCCTGGGGGC TTCCGCTGCG AATGCTGGGT 1260 

TGGCTATGAG CCGGGCGGTC CTGGAGAGGG GGCCTGTCAG GATGTGGATG AGTGTGCTCT 1320 

GGGTCGCTCG CCTTGCGCCC AGGGCTGCAC CAACACAGAT GGCTCATTTC ACTGCTCCTG 1380 

TGAGGAGGGC TACGTCCTGG CCGGGGAGGA CGGGACTCAG TGCCAGGACG TGGATGAGTG 1440 

65 TGTGGGCCCG GGGGGCCCCC TCTGCGACAG CTTGTGCTTC AACACACAAG GGTCCTTCCA 1500 

CTGTGGCTGC CTGCCAGGCT GGGTGCTGGC CCCAAATGGG GTCTCTTGCA CCATGGGGCC 1560 

TGTGTCTCTG GGACCACCAT CTGGGCCCCC CGATGAGGAG GACAAAGGAG AGAAAGAAGG 1620 

GAGCACCGTG CCCCGCGCTG CAACAGCCAG TCCCACAAGG GGCCCCGAGG GCACCCCCAA 1680 

GGCTACACCC ACCACAAGTA GACCTTCGCT GTCATCTGAC GCCCCCATCA CATCTGCCCC 1740 

70 ACTCAAGATG CTGGCCCCCA GTGGGTCCTC AGGCGTCTGG AGGGAGCCCA GCATCCATCA 1800 

CGCCACAGCT GCCTCTGGCC CCCAGGAGCC TGCAGGTGGG GACTCCTCCG TGGCCACACA 1860 

AAACAACGAT GGCACTGACG GGCAAAAGCT GCTTTTATTC TACATCCTAG GCACCGTGGT 1920 

GGCCATCCTA CTCCTGCTGG CCCTGGCTCT GGGGCTACTG GTCTATCGCA AGCGGAGAGC 1980 

GAAGAGGGAG GAGAAGAAGG AGAAGAAGCC CCAGAATGCG GCAGACAGTT ACTCCTGGGT 2040 

75 TCCAGAGCGA GCTGAGAGCA GGGCCATGGA GAACCAGTAC AGTCCGACAC CTGGGACAGA 2100 

CTGCTGAAAG TGAGGTGGCC CTAGAGACAC TAGAGTCACC AG CC AC CATC CTCAGAGCTT 2160 
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TGAACTCCCC ATTCCAAAGG GGCACCCACA 
CAAACAATTG TAAGTCTCCT CCTTAAAGGC 
TGTTTGATGT TCCTGAAGTG GAAGCTGTGT 
TCTATAATGA TTGTTACTCC CCCTCCCTTT 
5 GGGTGTGAGG AGGCTGGGGC TAAGGGGCTC 
ATCTAAGAGG AAAAGGTGAG TTGCTCATGC 
CCTAGGATGA AAACTAAATC AATTAATTAT 
TCAAAGGGAA CATGTTCGGA CTGGAAACAT 
AGCACAAGTC TTGCTAAATG TGATACTGTT 

10 TAACCTCTTA GGTGGCAAGG AGGCAGGAAG 
CTTGGGTTTA TTTGCAAAGG AAGCTTGAAA 
CAGGTGTTTG TGAAGTCACA TAATCTACGG 
CACAGATACT TGAATTAATT CATCCAAATG 
TGTGATCAAC ACTAACAAGG AAACAAATTC 

15 CCTCAGACAC CCTGCCTGTG GCCCCGCCTC 
CGAGCTCAGA CAGAGGAAGC CCTGCAGAAA 
TGAACGGGAG ATGATGCACT GTGTTTTGAA 
TCATAGTCCA CAGTTGATGC AGCATCCTGA 
CACACCAAGT AGGGAGCTAG TCAGGCAGTT 

20 TTCCTTAAAA TTGGGGGTAA GGAGGGAAGG 
TTTTTACAGC AAAAACTGCT CAAAGCCATT 
TTGCAAATAT TTCTCCCTAT GATAATGCAG 
TCTCTCTCAC ACACACACAC ACACACACAC 
CCTGGGGCAC TGGAACACAT TCCTGGGGGT 

25 TGAGTATCTC TGGGAGGCCT CATGTCTCCT 
ACAGACAGAG GAAATGTGTC TCCCTCCAAG 
GGTTTTGCCT TAGCAATGCA TCGGTCTCTG 
CAAGGTGCAG GGTTAATACT CTTGCCAGTT 
TTTTAATAGA AAACTAAAGG GGCAGGGGAA 

30 TCGATGGGGC ATTTGGAACT TCTTTTTAAA 
AACTCTGGTG TTTAACACTT AAGGGAGACA 
GGCCACGAGA CTCTAGGTGA TGTGTGAAGC 
CTGTCTGGCC ATTCAGAGGA TTCTAAAGAC 
CACTTAAATA AATGCAAATG CAACATTTCT 

35 TCATTTGGGG TGAAGGAGAC ATTTCTGTCC 
GTATGATTCC TGGGATCCAA CGAGCCCTCC 
GCCCAGGCCC ATCGTCTGTT CTCTGAATGC 
GAACCCCTCT GTGGAACCCA CAAGGGGAGA 
ACCTTCCCTG GCAGGCTGGG TCCCTCTCCT 

40 CCCACCACGG GGGGAGAGCC AGCAACCCAA 
AACCACTGGG CTCAAACACG TGCTTTATTC 
ATGGAAATTC TTGTTTGGGG GATCTTGGGG 
GCCAAGAGGC CATTAACAAA TCGTCCTTGT 
CACAGTGGGG AATCCAAGGG TCACAGTATG 

45 TCTCGCTAGA CACAGTGTTT CTGCCCAGGT 
CATGGGGACG GGGGAAGTTT TCACTTGGAG 
CCAAATAGGT CAATAATTCT GGGAGACTCT 
TCTCTCCCTC CCCTCATCCC ACATCTCAAA 
CACCCAGCTC GCCATGCCTA CTCATTCCTG 

50 CTTCTTTGTC ATTTGAGAAA GGATGCAGGA 
CAGAAAAACC AGGGCAGGAC AGTTATCGAC 
TAGAGGGACT CCACCCCTGC TCAACAGCTT 
CTCTGCCTTC GGTGGCCCAC ACACCTAAGC 
AACACATCTA CGTGTAGCAC TACGACGTTA 

55 AGGCTCTGAT TAAGGATGTG GGGAAGTGGG 
CTGGAGGCCT GTCTGTTAGC CAGTGGTGGA 
TGCCATCTTC CCTGCGATCA GGCAAAAAAG 
TGTGTTATGT CCATTTTGCA GGATGAACTG 
TTGCTTTGTC TTTTCCATCC TCATCACAAG 

60 TCTTTCGATG GATGGAGATG ATCATTAGGT 
ATTTCTGTGA AAACTAGGAG AACAGAGATG 
TAACACAGTC TTTTTAAAAC TAACATAGGA 
TCTCCATTGT CTAAATCAGG AAAACAGGAA 
TTAATGCCCC CTACATATTT CCATCACCTT 

65 TATGATCCCA GAAAACATCT GTCTCTACTT 
TGGTTTGTGC ATTTTCTCAA CTAAAAATAG 
CTAATCAAAG ACACTATTTT CATACTAGAT 
TTTAAAAATA AATTGTGTTT TGGTCTGTTC 
AAGCTCTGGA ATCCCTTTAT TGTGCTGTTG 

70 TTCAGCAGAT TTTGCCCACT ATTCCTCTGA 
GCTTGAATTA GATCCCTGCA AAGGCTTGCT 
GTAATCACTT CATGAATGCT AAATGAGAAT 
TTTGTTTGAC TAATTCTGGA ATTACAAGAT 
ATGTTTCCCA AACTGTGAGG AGGGAAGGCT 

75 CAAAATGGTG CTTTGAGGGT CAGCCTTTAG 
TCTGTTATGT GCCTATCCTA ATAAACTCTT 



TTTTTTTGAA AGACTGGACT GGAATCTTAG 2220 

CCCTTGGAAC ATGCAGGTAT TTTCTACGGG 2280 

GTTGGCGTGC CACGGTGGGG ATTTCGTGAC 2340 

TCAAATTCCA ATGTGACCAA TTCCGGATCA 2400 

CCCTGAATAT CTTCTCTGCT CACTTCCACC 2460 

TGATTAGGAT TGAAATGATT TGTTTCTCTT 2520 

TCAATTAGGT AAGAAGATCT GGTTTTTTGG 2580 

TTCTTTACAT TTGCATTCCT CCATTTCGCC 2640 

GACATCCTCC AGAATGGCCA GAAGTGCAAT 2700 

TGCCTCTTTA GTTCTTACAT TTCTAATAGC 2760 

AATATGAGAA AAGTTGCTTG AAGTGCATTA 2820 

GGCTAGGGCG AGAGAGGCCA GGGATTTGTT 2880 

TACTGAGGTT ACCACACACT TGACTACGGA 2940 

AAGGACAACC TGTCTTTGAG CCAGGGCAGG 3000 

CACTTCATCC TGCCCGGAAT GCCAGTGCTC 3060 

GTTCCATCAG GCTGTTTCCT AAAGGATGTG 3120 

AGTTGTCATT TTAAAGCATT TTAGCACAGT 3180 

GATTTTAAAT CCTGAAGTGT GGGTGGCGCA 3240 

TGCTTAAGGA ACTTTTGTTC TCTGTCTCTT 3300 

AAGAGGGAAA GAGATGACTA ACTAAAATCA 3360 

TAAATTATAT CCTCATTTTA AAAGTTACAT 3420 

TCGATAGTGT GCACTCTTTC TCTCTCTCTC 34 80 

ACACACACAC AGAGACACGG CACCATTCTG 3540 

CACCGATGGT CAGAGTCACT AGAAGTTACC 3600 

GTGGGCTTTT TACCACCACT GTGCAGGAGA 3660 

GCCCCAAAGC CTCAGAGAAA GGGTGTTTCT 3720 

AGGTGACACT CTGGAGTGGT TGAAGGGCCA 3780 

TTGAAATATA GATGCTATGG TTCAGATTGT 3840 

GTGAAAGGAA AGATGGAGGT TTTGTGCGGC 3900 

GTCATCTCAT GGTCTCCAGT TTTCAGTTGG 3960 

AAGGCTGTGT CCATTTGGCA AAACTTCCTT 4020 

TGGGCAGTCT GTGGTGTGGA GAGCAGCCAT 4080 

ATGGCTGGAT GCGCTGCTGA CCAACATCAG 4140 

CCCTCTGGGC CTTGAAAATC CTTGCCCTTA 4200 

TTGGCTTCCC ACAGCCCCAA CGCAGTCTGT 4260 

TATTTTCACA GTGTTCTGAT TGCTCTCACA 4320 

AGCCCTGTTC TCAACAACAG GGAGGTCATG 4380 

AATGGGTGAT AAAGAATCCA GTTCCTCAAA 4440 

GCTGGGTGGT GCTTTCTCTT GCACACCACT 4500 

CCAGACAGCT CAGGTTGTGC ATCTGATGGA 4560 

TCCTGTTTAT TTTTGCTGTT ACTTTGAAGC 4620 

CTACAGTAGT GGGTAAACAA ATGCCCACCG 4680 

CCTGAGGGGC CCCAGCTTGC TCGGGCGTGG 4740 

GGGAGAGGTG CACCCTGCCA CCTGCTAACT 4800 

GACCTGTTCA GCAGCAGAAC AAGCCAGGGC 4860 

ATGGACACCA AGACAATGAA GATTTGTTGT 4920 

TGGAAAAAAC TGAATATATT CAGGACCAAC 4980 

GCAGACAATG TAAAGAGAGA ACATCTCACA 5040 

AATTTCAGGT GCCATCACTG CTCTTTCTTT 5100 

GGACAATTCC CACAGATAAT CTGAGGAATG 5160 

AATGCATTAG AACTTGGTGA GCATCCTCTG 5220 

GGCTTCCAGG CAAGACCAAC CACATCTGGT 5280 

GTCATCGTCA TTGCCATAGC ATCATGATGC 5340 

TGTTTGGGTA ATGTGGGGAT GAACTGCATG 5400 

CTGCGGTCAC TGTCGGCCTT GCAAGGCCAC 5460 

GGAGCAAGGC TTCAGGAAGG GCCAGCCACA 5520 

TGGAATTAAA AAGTCAAACC TTTATATGCA 5580 

AGTTTAAAAG AATTTTTTTT TCTCTTCAAG 5640 

CCCTTGTTTG AGTGTCTTAT CCCTGAGCAA 5700 

ACTTTTGTTT CAACCTTTAT TCCTGTAAAT 5760 

AGATTTGACA AAAAAAAATT GAATTAAAAA 5820 

AAGCCTTTCC TATTATTTCT CTTCTTAGCT 5880 

AACACAGCTT TCTAGCAGCT GCAAAATGGT 5940 

GAACAATAGC TTTAGCTTGG GAATCTGAGA 6000 

CGGCTGCAAA ACCCATGGTT TAAATCTATA 6060 

AGATGATAAT CCGAATTCTC CATATATTCA 6120 

TCCTGAGACA AATACTCACT GAAGGGCTTG 6180 

TTGTAGATAA TGCCCTTCTA TTTTAGGTAG 6240 

CTCTTATCTG CAAGGTGGCA AGCAGTTCTT 6300 

GCTGAAGTTC TTTGCATAGA TTTGGCTTAA 6360 

CTGTGATGTC AGATGTAATT GTAAATGTCA 6420 

GTAAGTATTT TTAAATGTGT GTATTTCAAA 6480 

TTCTATGCAG GATTTACCTT CATCCTGTGC 6S40 

CAGAGATCGA GCTTCTCCTC TGAGTTCTAA 6600 

GAAGGTGCAG CTTTGTTGTC CTTTGAGCTT 6660 
AAACACATT 
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Seq ID No: 131 Protein sequence: 
Protein Accession #: NP_036204.1 
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MATSMGLLLL LLLLLTQPGA GTGADTEAW CVGTACYTAH SGKLSAAEAQ NHCNQNGGNL 60 

ATVKSKEEAQ HVQRVLAQLL RREAALTARM SKFWIGLQRE KGKCLDPSLP LKGFSWVGGG 120 

EDTPYSNWHK ELRNSCISKR CVSLLLDLSQ PLLPNRLPKW SEGPCGSPGS PGSNIEGFVC 180 

KFSFKGMCRP LALGGPGQVT YTTPFQTTSS SLEAVPFASA ANVACGEGDK DETQSHYFLC 240 

KEKAPDVFDW GSSGPLCVSP KYGCNFNNGG CHQDCFEGGD GSFtiCGCRPG FRLLDDLVTC 300 

ASRNPCSSSP CRGGATCVLG PHGKNYTCRC PQGYQLDSSQ LDCVDVDECQ DSPCAQECVN 360 

TPGGFRCECW VGYEPGGPGE GACQDVDECA LGRSPCAQGC TNTDGSFHCS CEEGYVLAGE 420 

DGTQCQDVDE CVGPGGPLCD SLCFNTQGSF HCGCLPGWVL APNGVSCTMG PVSLGPPSGP 480 

PDEEDKGEKE GSTVPRAATA SPTRGPEGTP KATPTTSRPS LSSDAPITSA PLKMLAPSGS 540 

SGVWREPSIH HATAASGPQE PAGGDSSVAT QNNDGTDGQK LIjLFYILGTV VAILLLLALA 600 
LGLLVYRKRR AKREEKKEKK PQNAADSYSW VPERAESRAM ENQYSPTPGT DC 



Seq ID NO: 132 DNA sequence 

Nucleic Acid Accession #: NMJ)00963.1 

Coding sequence: 135-1949 (underlined sequences correspond to start and stop codons) 
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GACTTTTATG TCACTACTCT AAAGATTTTG 



TCAGCAGCGC 


60 


CGCCGCGCCC 


120 


GCGCTCAGCC 


180 


ATGAGTGTGG 


240 


AACTGCTCAA 


300 


ACAGTGCACT 


360 


TTCCTTCGAA 


420 


CCACCAACTT 


480 


TATTATACTA 


540 


GGTAAAAAGC 


600 


TTCATCCCTG 


660 


CATCAGTTTT 


720 


GGGGTGGACT 


780 


TTCAAGGATG 


840 


AAAGATACTC 


900 


GTGGGGCAGG 


960 


CGGGAACACA 


1020 


CAGTTGTTCC 


1080 


GATTATGTGC 


1140 


TTCAACAAAC 


1200 


TGGCATCCCC 


1260 


TTTATCTACA 


1320 


ACCAGGCAAA 


1380 


GTATCACAGG 


1440 


CGCAAACGCT 


1500 


ATGTCTGCAG 


1560 


CTTCTGGTAG 


1620 


GCACCATTCT 


1680 


CCAAGCACTT 


.1740 


CTCATCTGCA 


1800 


CTCATTAAAA 


1860 


CCCACAGTAC 


1920 


TTATTTATAT 


1980 


TATGTTACTT 


2040 


TACTTGTGAA 


2100 


AAACAGTTTT 


2160 


GAATTTCAAC 


2220 


CAAGATGGCA 


2280 


TGCATTAGAA 


2340 


AAACAAAAAC 


2400 


TTTCATGTCT 


2460 


GGGTAGAATC 


2520 


CCAAAAAGAA 


2580 


CTTGTTAAAA 


2640 


GTGACTGTTA 


2700 


TGCAGTGTTA 


2760 


CTGGTAACAT 


2820 


AGAAGAAAAC 


2880 


ATCCTTGTGC 


2940 
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10 



15 



20 



25 



ACTGCAGGCC 
ATAACGATAT 
TTGCAAAAGT 
CATTAATTTT 
ACCTGCATGC 
CTCATCACTT 
TGGACTCTGC 
AGGACTGCTA 
AATAGTATAC 
ATCTGTAACC 
TGACTGGAAA 
CAAATGATAT 
ATCTCATTGT 
TTTATGTCTT 
TTTTTTGTTA 
AACAATCCAA 
CATATTGAGA 
AAAGAATATT 
GGGATCTGTG 
TCAAGCACTG 
TATAAATAAT 
AAAGATAACT 
AGAAATAGTC 
GATTTGTTAT 
GTTTCAGTGC 
TGTCTGTTTA 



TGGTACTCAG 
GTTTTCTCAG 
AGCAATGACC 
ATCTCAGTCT 
TGTTCCTTTT 
CGTTTCTCCT 
CTATATTTTC 
TTTAGCTCCT 
ACTTATTTTA 
AAGATGGATG 
AAGTTACGTT 
CTAAGTAGTT 
CACTGACATT 
ATTAGGACAC 
TGTCACAATC 
AGAAATGATT 
TATTTAAGGT 
GTCTCATTAG 
GATGCTTCGT 
TGGGTTTTAA 
TGAAAAAAAT 
CAGGAGAATC 
AATATGCTTG 
TAACATTGAT 
CTCAGACAAA 
TTTTTGTACT 



ATTTTGCTAT 
ATTTTCTGTT 
TCATAAAATA 
TGAAGCCAAT 
CTTTTCTTCT 
ATTTTGTTTT 
TTACCTGAAC 
CTTAAGAAGA 
AGTGAAAAGC 
CAAAGAGGCT 
CCCATTCTAA 
CTCAGCAATA 
TAATGGTACT 
TATGGTTATA 
AGTATATTTT 
GTATTAAGAT 
TGAATGTTTG 
CCTGAATGTG 
TAATTTGTTC 
TATTTTTAAA 
TTTCTTTTGG 
TTCTTTACAA 
TATAAAACAC 
CTGCTGACAA 
TGTGTATTTA 
ATTTA 



GAGGTTAATG 
GTACAGTTTA 
CCTCTTCAAA 
TCAGTAGGTG 
TTTAGCCATT 
ACTAGTTTTA 
TTTTGCAAGT 
TTAAAAGAGA 
AGAGAATTTT 
AGTGCCTCAG 
TTAATGCCCT 
ATAATAATGA 
GTATATTACT 
AACTGTGTTT 
CTTTGGGGTT 
TTGTGAATAA 
TCCTTAGGAT 
CCATAAGACT 
AGCCACAATT 
TCAAACGCTG 
GAAGAGGGAG 
TTTTACGTTT 
TGTTCACTGT 
AACCTGGGAA 
ACTTATGTAA 



AAGTACCAAG 
ATTTAGCAGT 
ATGCTTAAAT 
CATTGGAATC 
TTGCTAAGAG 
AGATCAGAGT 
TTTCAGGTAA 
AAAAAAAAGG 
ATTTATAGCT 
AGAGAACTGT 
TTCTTATTTA 
CGATAATACT 
TAATTTATTG 
AAGCCTACAA 
ACCTCTCTGA 
ATTTTTAGAA 
AGGCCTATGT 
GACCTTTTAA 
TATTGAGAAA 
ATTACAGATA 
AAAATGAAAT 
AGAATGTTTA 
TTTTTTTAAA 
TTTGGGTTGT 
AAGATAAGTC 



CTGTGCTTGA 
CCATATCACA 
TCATTTCACA 
AAGCCTGGCT 
ACACAGTCTT 
TCACTTTCTT 
ACCTCAGCTC 
CCCTTTTAAA 
AATTTTAGCT 
ACGGGGTTTG 
AAAACAAAAC 
TCTTTTCCAC 
AAGATTATTA 
TCATTGATTT 
ATATTATGTA 
ATCTGATTGG 
GCTAGCCCAC 
AATGTTTTGA 
ATATTCTGTG 
ATAGTATTTA 
AAATATCATT 
AGGTTAAGAA 
AAAAAAACTT 
GTATGCGAAT 
TGGAAATAAA 



3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 



30 



Seq ID No: 133 Protein sequence ; 
Protein Accession ft: NP 000954.1 



35 



40 



45 



50 



MIiARALLLCA 
TRIKLFLKPT 
GYKSWEAFSN 
NMMFAFFAQH 
QIIDGEMYPP 
VLKQEHPEWG 
NRIAAEFNTL 
AGGRNVPPAV 
YGDIDAVELY 
GFQIINTASI 
STEh 



11 
I 

VLALSHTANP 
PNTVHYILTH 
LSYYTRALPP 
FTHQFFKTDH 
TVKDTQAEMI 
DEQLFQTSRL 
YHWHPLLPDT 
QKVSQASIDQ 
PALLVEKPRP 
QSLICNNVKG 



21 
I 

CCSHPCQNRG 
FKGFWNWNN 
VPDDCPTPLG 
KRGPAFTNGL 
YPPQVPEHLR 
1LIGETIKIV 
FQIHDQKYNY 
SRQMKYQSFN 
DAIFGBTMVE 
CPFTSFSVPD 



31 
I 

VCMSVGFDQY 
IPFLRNAIMS 
VKGKKQLPDS 
GHGVDLNHIY 
FAVGQEVFGL 
IEDYVQHLSG 
QQFIYNNSIL 
EYRKRFMLKP 
VGAPFSLKGL 
PELIKTVTIN 



41 
I 

KCDCTRTGFY 
YVLTSRSHLI 
NEIVEKLLLR 
GETLARQRKL 
VPGLMMYATI 
YHFKLKFDPE 
LEHGITQFVE 
YESFEELTGE 
MGNVICSPAY 
ASSSRSGLDD 



51 

I 

GENCSTPEFIi 
DSPPTYNADY 
RKFIPDPQGS 
RLFKDGKMKY 
WLREHNRVCD 
LLFNKQFQYQ 
SFTRQIAGRV 
KEMSAELEAIi 
WKPSTFGGEV 
INPTVUjKER 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



Seq ID NO: 134 DNA sequence 

Nucleic Acid Accession ft: XM_05964 8.1 

Coding sequence: 35-664 (underlined sequences correspond to start and stop codons) 



11 



21 



31 



41 



51 



55 



60 



65 



70 



75 



AGGCTGCTGA 
AAAATGTTCC 
AGTAACGCTA 
CTTTGAAGTG 
TTCACTAGTT 
GAAGGAACTG 
TCAGTTTGGA 
CTACGGAGTA 
TGCATTTAGA 
TCTTGTCAAC 
CATCAGGCCT 
ATGAGAATGC 
ATTTTAAACA 
TAGTGCGTTC 
TGAAGGATTT 
GGATGTTACC 
ACTTCCTCTG 
TATTGTTCAA 
ACTCTTTGGA 
ATGTTAATGG 
TCCTTATATA 
TTCCTAGTGT 
TATAAATGAC 



GACTTCCCTC 
GGGCCCAGAG 
TTTCTTCTAC 
AAGGATGCAA 
GTAAACGTGG 
CACAAAGAGT 
GAATCGGAGC 
ACTTTCCCCA 
TTTCTTGTTG 
CCTGAGGGTC 
GACATAGCAG 
CATTGCGTTT 
TTTTTTTTTT 
TCAGCTCATT 
TTTTTTAATG 
CAAAGCAAAA 
ACCATACTAA 
CTTGACATTT 
TTCAAGAGCA 
AATAAAACAC 
TGTATGCTTG 
TTTACTATAA 
CGTATTATAA 



TAGAATCCTC 
CAAAGGTATT 
AACTAAAATT 
AAGGAAGAAC 
CCAGTGACTG 
TTGGACCATC 
CCCGCCCAAG 
TCTTCCACAA 
ATTCTTCAAA 
AAGTTGTGAA 
CTCTGGTTAG 
CTAATAGAAC 
GGAGACAGTG 
GCAACCTCTG 
TTATCTTGCT 
ATCAAGAGTA 
AGAATTCAGA 
TCTAGGACTG 
CTGTGTATGA 
AAATGTTGAA 
TTTTGTGTAC 
CTGTCACTAT 
CA 



CAACATGGAG 
TGCAGTTTTG 
CCTCAAACCT 
TGTTTCTCTG. 
CCAACTCACA 
CCACTTCAGC 
CAAGGAAGTA 
GATTAAGATT 
GAAGGAACCA 
GTTCTGGAAG 
ACAAGTGATC 
AGAGAAATGT 
TCTCACTCTG 
CCTTTTTAAA 
ATTAAGTGGT 
GCCAAAGAAT 
ATACACAGTG 
TACTTGATGA 
CTGAAATTTC 
AAATGTAAAA 
AGGATTTTGT 
GTATGTAACT 



CCTCTTGCAG 
CTGTCTATAG 
AAAATCAACA 
GAAAAGTATA 
GACAGAAATT 
GTGTTGGCTT 
GAATCTTTTG 
CTAGGATCTG 
AGGTGGAATT 
CCAGAGGAGC 
ATAAAAAAGA 
CTCCATGAGG 
TCACCCAGGC 
CATGCTATTA 
AATGAATGTT 
CAACATGAAA 
ACCAATGTGC 
AAATGCCAAC 
TGGAATAACT 
TATATATACA 
TTTTTCTTTT 
GACATATATA 



CTTACCCGCT 
TTCTATGCAC 
GCTTTTATGC 
AAGGCAAAGT 
ACTTAGGGCT 
TTCCCTGCAA 
CAAGAAAAAA 
AAGGAGAACC 
TTTGGAAGTA 
CCATTGAAGT 
AAGAGGATCT 
GTTTGGTCTC 
TGGAGTGCAG 
AATGTGGCAA 
CCCAGGATGA 
TATATTAACT 
CTCAATATCT 
ACACTAGACC 
GTAAATGGTT 
TAGATTCAAA 
TAAGTACAGG 
AATAGTCATT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
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Seq ID No: 135 Protein Beouence: 
Protein Accession #: XP_059648.1 



1 11 21 31 41 51 

I I I I I I 

MEPIAAYPLK CSGPRAKVFA VLLSIVLCTV TLFLLQLKFL KPKINSFYAF EVKDAKGRTV 60 
SLEKYKGKVS LWNVASDCQ LTDRNYLGLK ELHKEFGPSH FSVLAFPCNQ FGESBPRPSK 120 
EVESFARKNY GVTFPIFHKI KILGSEGEPA FRFLVDSSKK EPRWNFWKYIi VNPEGQWKF 180 
WKPEEPIEVI RPDIAALVRQ VIIKKKEDL 



Seq ID NO: 136 DNA sequence 

Nucleic Acid Accession #: NM_003 003.1 

Coding sequence: 304-2451 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I i 

CAAGTGCCGT CGCCGCGCCC CTTCCCCCTC CCGCCTCCCC GGCCCCCTCC CCGGAACCGG 60 

CGGTCGAGCT ACGGTCGCGG ACGAGTGGAA CCGAGACTGC CCCGCGGAGC CGCCGGTATG 120 

AGCGCCCCTC GCCACCCCGT GTCCCAGGCC CGGCCTTTCT GACAAGAGCT AGACTTCGGG 180 

CTCCTTGAGG ATATTCAGTT TTGTATGTTT GAATATCCTC TCACCATGTT CAGCATAAAG 240 

TACCATTCTT AATGATTATC CTCAACAAGA CAGGTGTGAG AGGGTTGCTG TTGCATTGCA 300 

ATCATGGTGC AAAAATACCA GTCCCCAGTG AGAGTGTACA AATACCCCTT TGAATTAATT 360 

ATGGCTGCCT ATGAAAGGAG GTTCCCTACA TGTCCTTTGA TTCCGATGTT CGTGGGCAGT 420 

GACACTGTGA GTGAATTCAA GAGCGAAGAT GGGGCTATTC ATGTCATTGA AAGGCGCTGC 480 

AAGCTGGATG TAGATGCACC CAGACTGCTG AAGAAGATTG CAGGAGTTGA TTATGTTTAT 540 

TTTGTCCAGA AAAACTCACT GAATTCTCGG GAACGTACTT TGCACATTGA GGCTTATAAT 600 

GAAACGTTTT CCAATCGGGT CATCATTAAT GAGCATTGCT GCTACACCGT TCACCCTGAA 660 

AATGAAGATT GGACCTGTTT TGAACAGTCT GCAAGTTTAG ATATTAAATC TTTCTTTGGT 720 

TTTGAAAGTA CAGTGGAAAA AATTGCAATG AAACAATATA CCAGCAACAT TAAAAAAGGA 780 

AAGGAAATCA TCGAATACTA CCTTCGCCAA TTAGAAGAAG AAGGCATAAC CTTTGTGCCC 840 

CGTTGGAGTC CGCCTTCCAT CACGCCCTCT TCAGAGACAT CTTCATCATC CTCCAAGAAA 900 

CAAGCAGCGT CCATGGCCGT CGTCATCCCA GAAGCTGCCC TCAAGGAGGG GCTGAGTGGT 960 

GATGCCCTCA GCAGCCCCAG TGCACCTGAG CCCGTGGTGG GCACCCCTGA CGACAAACTA 1020 

GATGCCGACC ACATCAAGAG ATACCTGGGC GATTTGACTC CGCTGCAGGA GAGCTGCCTC 1080 

ATTAGACTTC GCCAGTGGCT CCAGGAGACC CACAAGGGCA AAATTCCAAA AGATGAGCAT 1140 

ATTCTTCGGT TCCTCCGTGC ACGGGATTTT AATATTGACA AAGCCAGAGA GATCATGTGT 1200 

CAGTCTTTGA CGTGGAGAAA GCAGCATCAG GTAGACTACA TTCTTGAAAC CTGGACCCCT 1260 

CCTCAGGTCC TTCAGGATTA CTACGCGGGA GGCTGGCATC ATCACGACAA AGATGGGCGG 1320 

CCCCTCTACG TGCTCAGGCT GGGGCAGATG GACACCAAAG GCTTGGTGAG AGCGCTCGGG 1380 

GAGGAAGCCC TGCTGAGATA CGTTCTCTCC GTAAATGAAG AACGGCTAAG GCGATGCGAA 1440 

GAGAATACAA AAGTCTTTGG TCGGCCTATC AGCTCATGGA CCTGCCTGGT GGACTTGGAA 1500 

GGGCTGAACA TGCGCCACTT GTGGAGACCT GGTGTGAAAG CGCTGCTGCG GATCATCGAG 1560 

GTGGTGGAGG CCAACTACCC TGAGACACTG GGCCGCCTTC TCATCCTGCG GGCGCCCAGG 1620 

GTATTTCCTG TGCTCTGGAC GCTGGTTAGT CCGTTCATTG ATGACAACAC CAGAAGGAAG 1680 

TTCCTCATTT ATGCAGGAAA TGACTACCAG GGTCCTGGAG GCCTGCTGGA TTACATCGAC 1740 

AAAGAGATTA TTCCAGATTT CCTGAGTGGG GAGTGCATGT GCGAAGTGCC AGAGGGTGGA 1800 

CTGGTCCCCA AATCTCTGTA CCGGACTGCA GAGGAGCTGG AGAACGAAGA CCTGAAGCTC 1860 

TGGACTGAGA CCATCTACCA GTCTGCAAGC GTCTTCAAAG GAGCCCCACA TGAGATTCTC 1920 

ATTCAGATTG TGGATGCCTC GTCAGTCATC ACTTGGGATT TCGACGTGTG CAAAGGGGAC 1980 

ATTGTGTTTA ACATCTATCA CTCCAAGAGG TCGCCACAAC CACCCAAAAA GGACTCCCTG 2040 

GGAGCCCACA GCATCACCTC TCCGGGTGGG AACAATGTGC AGCTCATAGA CAAAGTCTGG 2100 

CAGCTGGGCC GCGACTACAG CATGGTGGAG TCGCCTCTGA TCTGCAAAGA AGGAGAAAGC 2160 

GTGCAGGGTT CCCATGTGAC CAGGTGGCCG GGCTTCTACA TCCTGCAGTG GAAATTCCAC 2220 

AGCATGCCTG CGTGCGCCGC CAGCAGCCTT CCCCGGGTGG ACGACGTGCT TGCGTCCCTG 2280 

CAGGTCTCTT CGCACAAGTG TAAAGTGATG TACTACACCG AGGTGATCGG CTCGGAGGAT 2340 

TTCAGAGGTT CCATGACGAG CCTGGAGTCC AGCCACAGCG GCTTCTCCCA GCTGAGTGCC 2400 

GCCACCACCT CCTCCAGCCA GTCCCACTCC AGCTCCATGA TCTCCAGGTJ^GTGCCGCGCT 2460 

GCCTGCACCT AGTGTGCAGA GGGGACGGCC GCCCCTCCTC GGACAGCAGC TGCACCCGCC 2520 

CACCCAGCGG CGACATTGTA CAGACTCCTC TCACCTCTAG ATAGCAAATA GCTCTCAGAT 2580 

GGTAAACGTA GTCGTTTGAT CCCAAAACTA CCTTGGCAGG TAGTTTTAAC TCTGATCCTA 2640 

ACTTAACTCA ATAGCCATAG ATTTTGTATA CGTTGTGCAC AAAATCCAAC CAGAGCGCAA 2700 

GGGCTCTCTT GAAAGAAAAG TAGTTTCTGT ACCAATTAAA GGATTGACGT GGTCTCAGAT 2760 

ATTGATGCAA AAAATTTTTC CAACGAACTC CGCATTGTCC ATTAGTGAAT GAATTCCTGT 2820 

GACATCCTCC AGAGATGGCC CCTCCTCACC TGGGACGGAA GCTGCCAGCT CGCTTCCCCC 2880 

AAGCTGCCTC ATGGCCCGCA CGCCGCCTCA CGGCCCCCAT GCTTCCCGCC AGTCAAGATG 2940 

GTCTGTGGAC TTAGGGCCAG CCCTTGAGGT CCTTATCCTC TGAGGATTCA GAGGTTGCCT 3000 

GOGGAGTACC TTGTCCCAGG GCCAGACACA CCCACACCAC CCACTGTCTG CAGTGGGGCC 3060 

GGGGGCTCAG GAGGGGCTCT CAGGGACTCC TGGTGACTCC AGGAAAATGC TGCCATCGTT 3120 

AAACATTACT TTCTCTTTCC TCCTTTTCAA ATCTTTTTGA TACTTTTTAG AGCAGGATTT 3180 

TTCTGTATGT GAACTTGGGT GGGGGGGTTC TTCCCGTTTC CTTCCGTGCG TCGCCCCTCT 3240 

CACCTGCAGT CAGCTCCCAG CCCAGTGTAG GCCATCTCCT CTGTGCCCTC TGGAGGCTCA 33 00 

TTGTCTCAGA GCCCAGACAG TTCCAGCCAC TAGGAGGCCG TCTTGGAACC AGCAAGTCGC 3360 

ATTTGCCACT TGACACTGTC CATGGGGTTT TATTAGTAGC TAAGCAGCAG CTCTCGCATC 3420 
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CACTTCAGGG TGGCGTGTGG CATGTAGGAG TCCTGCTTCT TTGTACATGG GAATTGTGGA 3480 

CTCATGCGTG TGTGTGTGTG CATGTGCTGT GTGTGTGCAT GTGTGCATGA CGGTGGGGGT 3540 

GCTGGGGGGA CGGGGTGAGT GGAAACTTAG TTTGAGTAAT GAAGGAATCT TCACAGAAGC 3600 

AAATCAGAAT ATGGGATTTG TTTGCCTTTT ACATTTTGTT TAATTCCTGA TTTTAAAGCC 3660 

TGCTCTATCT GGTACAGGCC CTTATTTTTT CAGCTTTTTA TGGGAAAAGC AGGTTATTTG 3720 

AGAATCTGTC CAGAAGTTGC ATAGGGGATG GCCTCCACGA TAAGGACATG CAACACGTGT 3780 

TTCTGTGTGC AGCAGAGGCC GTGTTTTTCA TGCCAAACCC CACGCGGCTG TCAACTGTGT 3840 

GCGTGGTAGG CATGGAGATC CTGGTTGTGC CGTCTCAGCT CCGCTCTGAA GGCACTGTGT 3900 

GGGTGCTGCG TGACTGGAGA GCTGTGTGGA GGCCATGTGT GCCCCGTGCA GGGATCAGGA 3960 

GGGCGGGGGA GGGACCGAGC AGCCCTCTTG CCCGGTCGGG TCAGCCCTAG TGGCTGCCTG 4020 

CACACTGTAG ACGTCCCAGG GCCTGTGCTG TGATCACCTG CCTTTGGACC ACATTTGTGT 4080 

TTGCTCTTAG AGATCGAGCT CCTCAGTGGT ACCTGAAGCC TTTGCTTCCG GAAAGCGCGG 4140 

TAGGGTTCGT AGGTAGGGCT AGTAGGTAGG GTTAGTAGGT AGGGCTAGTA GGTAGGGCTA 4200 

GTAGGTAGGG TTAGTAGGTA GGGTTCGTAG GTAGGGCTGG TAGGTAGGGT TAGTAGGTAG 4260 

GGCTAGTAGG TAGGGTTCGT AGGTAGGGCT AGTAGGTAGG GTTAGTAGGT AGGGCTAGTA 4320 

GGTAGGGCTA GTAGGTAGGG TTAGTAGGTA GGGTTCGTAG GTAGGGCTGG TAGGTAGGGT 4380 

TAGTAGGTAG GGCTAGTAGG TAGGGTTCGT AGGTAGGGCT AGTAGGTAGG GTTAGTAGGT 4440 

AGGGCTAGTA GGTAGGGCTA GTAGGTAGGG TTAGTAGGTA GGGTTCGTAG GTAGGGCTGG 4500 

TAGGTAGGGT TAGTAGGTAG GGCTAGTAGG TAGGGCTAGT AGGTAGGGCT AGTAGGTAGG 4560 

GTTAGTAGGT AGGGCTAGTA GGTAGGGCTA GTAGGTAGGG TTAGTAGGTA GGGTTCGTAG 4620 

GTAGGGCTGG TAGGTAGGGT TAGTAGGTAG GGCTAGTAGG TAGGGCTAGT AGGTAGGGCT 4680 

AGTAGGTAGG GCTAGTAGGT AGGGCTAGTA GGTAGGGCTA GTAGGTAGGG CTAGTAGGTA 4740 

GGGTTCGTAG GTAGGGTTCG TAGGTAGGGT TCGTAGGTAG GGTTAGTAGC GCGTCTGTGC 4800 

TGCTTCCACC TGGTGCTTCC TGTTCCCAAA TCACAAGGGC CTGAAGGTGG TCCCTGCTTT 4860 

CTCTTTCTCT TTCTCTGTGT CTCAGATGGC GATTTTGCTG ACAGCTGCCA AGAAAATGCT 4920 

TCACTCAACA GTCCTCATGT GCCCAGAGAT GTTTATAGAA CTGTTTGAAT TGCAGCCATC 4980 

CCCTGCCCCC TCCCAGGCTG AAGATCTGTT CTTTTTAAGT TGATTCGGGA GTGGCATTCT 5040 

TTTATACCCA AAGACTGTAG TGCATCTTGA AGAGCTCAAA GCACATGACC GCACAAATGC 5100 

TTACAGGGTT TCCTCCCGAG TAATCCAATC TCACTCCCCT TGTAAGGGAA TTCTGGGGCA 5160 

GCTATGGTTT GAGTATGCAG TTTGCATCGT GTTTCTACCT TTAGTACCTT GCCACTCTTT 5220 

TAAAACGCTG CTGTCATTTC CCATTTCTTA GTACTAATGA TTCTTTGATT CTCCCTCTAT 5280 

TATGTCTTAA TTCACTTTCC TTCCTAAATT TGTTATTTGC ATATCAAATT CTGTAAATGT 5340 

TTTGTAAACA TATTACCTCA CTTGGTAATA CAATACTGAT AGTCTTTAAA AGATTTTTTT 5400 
ATTGTTATCA ATAATAAATG TGAACTATTT AAAG 

Seq ID No: 137 Protein sequence: 
Protein Accession #: NP 002994.1 



1 U 21 31 41 51 

I I I I I I 

MVQKYQSPVR VYKYPFELIM AAYERRFPTC PLIPMFVGSD TVSEFKSEDG AIHVIERRCK 60 

LDVDAPRLLK KIAGVDYVYF VQKNSLNSRE RTLHIEAYNE TFSNRVIINE HCCYTVHPEN 120 

EDWTCFEQSA SLDIKSFFGF ESTVEKIAMK QYTSNIKKGK EIIEYYLRQL EEEGITFVPR 180 

WSPPSITPSS ETSSSSSKKQ AASMAWIPE AALKEGLSGD ALSSPSAPEP WGTPDDKLD 240 

ADHIKRYLGD LTPLQESCLI RLRQWLQETH KGKIPKDEHI LRFLRARDFN IDKAREIMCQ 300 

SLTWRKQHQV DYILETWTPP QVLQDYYAGG WHHHDKDGRP LYVLRLGQMD TKGLVRALGE 360 

EALLRYVLSV NEERLRRCEE NTKVFGRPIS SWTCLVDLEG LNMRHLWRPG VKALLRIIEV 420 

VEANYPETLG RLLILRAPRV FPVLWTLVSP FIDDNTRRKF LIYAGNDYQG PGGLLDYIDK 480 

EIIPDFLSGE CMCEVPEGGL VPKSLYRTAE ELENEDLKLW TETIYQSASV FKGAPHEILX 540 

QIVDASSVIT WDFDVCKGDI VFNIYHSKRS PQPPKKDSLG AHSITSPGGN NVQLIDKVWQ 600 

LGRDYSMVES PLICKEGESV QGSHVTRWPG FYILQWKFHS MPACAASSLP RVDDVLASLQ 660 
VSSHKCKVMY YTEVIGSEDF RGSMTSLESS HSGFSQLSAA TTSSSQSHSS SMISR 

Seq ID NO: 138 DNA sequence 

Nucleic Acid Accession #: NMJJ04181.1 

Coding sequence: 32-670 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

1*1 I I I I cn 

GCAGAAATAG CCTAGGGAGA TCAACCCCGA G ATG CTGAAC AAAGTGCTGT CCCGGCTGGG 60 

GGTCGCCGGC CAGTGGCGCT TCGTGGACGT GCTGGGGCTG GAAGAGGAGT CTCTGGGCTC 120 

GGTGCCAGCG CCTGCCTGCG CGCTGCTGCT GCTGTTTCCC CTCACGGCCC AGCATGAGAA 180 

CTTCAGGAAA AAGCAGATTG AAGAGCTGAA GGGACAAGAA GTTAGTCCTA AAGTGTACTT 240 

CATGAAGCAG ACCATTGGGA ATTCCTGTGG CACAATCGGA CTTATTCACG CAGTGGCCAA 300 

TAATCAAGAC AAACTGGGAT TTGAGGATGG ATCAGTTCTG AAACAGTTTC TTTCTGAAAC 360 

AGAGAAAATG TCCCCTGAAG ACAGAGCAAA ATGCTTTGAA AAGAATGAGG CCATACAGGC 420 

AGCCCATGAT GCCGTGGCAC AGGAAGGCCA ATGTCGGGTA GATGACAAGG TGAATTTCCA 4 80 

TTTTATTCTG TTTAACAACG TGGATGGCCA CCTCTATGAA CTTGATGGAC GAATGCCTTT 540 

TCCGGTGAAC CATGGCGCCA GTTCAGAGGA CACCCTGCTG AAGGACGCTG CCAAGGTGTG '600 

CAGAGAATTC ACCGAGCGTG AGCAAGGAGA AGTCCGCTTC TCTGCCGTGG CTCTCTGCAA 660 

GGCAGCCTAA TGCTCTGTGG GAGGGACTTT GCTGATTTCC CCTCTTCCCT TCAACATGAA 720 

AATATATACC CCCCATGCAG TCTAAAATGC TTCAGTACTT GTGAAACACA GCTGTTCTTC 780 

TGTTCTGCAG ACACGCCTTC CCCTCAGCCA CACCCAGGCA CTTAAGCACA AGCAGAGTGC 840 

ACAGCTGTCC ACTGGGCCAT TGTGGTGTGA GCTTCAGATG GTGAAGCATT CTCCCCAGTG 900 
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TATGTCTTGT ATCCGATATC TAACGCTTTA AATGGCTACT TTGGTTTCTG TCTGTAAGTT 960 
AAGACCTTGG ATGTGGTTAT GTTGTCCTAA AGAATAAATT TTGCTGATAG TAGC 



Seq ID No: 139 Protein sequence: 
Protein Accession #: NPJ)04172.1 

1 11 21 31 41 51 

I I I I I I 

MLNKVLSRLG VAGQWRFVDV LGLBEESLGS VPAPACALLL LFPLTAQHEN FRKKQIEELK 60 
GQEVSPKVYF MKQTIGNSCG TIGLIHAVAN NQDKLGFEDG SVIiKQFLSET EKMSPEDRAK 120 
CFEKNEAIQA AHDAVAQEGQ CRVDDKVNFH FILFNNVDGH LYELDGRMPF PVNHGASSED 180 
TLLKDAAKVC REFTEREQGE VRFSAVALCK AA 

Seq ID NO: 140 DNA sequence 

Nucleic Acid Accession #: NM__000201.1 

Coding sequence: 58-1656 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

GCGCCCCAGT CGACGCTGAG CTCCTCTGCT ACTCAGAGTT GCAACCTCAG CCTCGCT ATG 60 

GCTCCCAGCA GCCCCCGGCC CGCGCTGCCC GCACTCCTGG TCCTGCTCGG GGCTCTGTTC 120 

CCAGGACCTG GCAATGCCCA GACATCTGTG TCCCCCTCAA AAGTCATCCT GCCCCGGGGA 180 

GGCTCCGTGC TGGTGACATG CAGCACCTCC TGTGACCAGC CCAAGTTGTT GGGCATAGAG 240 

ACCCCGTTGC CTAAAAAGGA GTTGCTCCTG CCTGGGAACA ACCGGAAGGT GTATGAACTG 300 

AGCAATGTGC AAGAAGATAG CCAACCAATG TGCTATTCAA ACTGCCCTGA TGGGCAGTCA 360 

ACAGCTAAAA CCTTCCTCAC CGTGTACTGG ACTCCAGAAC GGGTGGAACT GGCACCCCTC 420 

CCCTCTTGGC AGCCAGTGGG CAAGAACCTT ACCCTACGCT GCCAGGTGGA GGGTGGGGCA 480 

CCCCGGGCCA ACCTCACCGT GGTGCTGCTC CGTGGGGAGA AGGAGCTGAA ACGGGAGCCA 540 

GCTGTGGGGG AGCCCGCTGA GGTCACGACC ACGGTGCTGG TGAGGAGAGA TCACCATGGA 600 

GCCAATTTCT CGTGCCGCAC TGAACTGGAC CTGCGGCCCC AAGGGCTGGA GCTGTTTGAG 660 

AACACCTCGG CCCCCTACCA GCTCCAGACC TTTGTCCTGC CAGCGACTCC CCCACAACTT 720 

GTCAGCCCCC GGGTCCTAGA GGTGGACACG CAGGGGACCG TGGTCTGTTC CCTGGACGGG 780 

CTGTTCCCAG TCTCGGAGGC CCAGGTCCAC CTGGCACTGG GGGACCAGAG GTTGAACCCC 840 

ACAGTCACCT ATGGCAACGA CTCCTTCTGG GCCAAGGCCT CAGTCAGTGT GACCGCAGAG 900 

GACGAGGGCA CCCAGCGGCT GACGTGTGCA GTAATACTGG GGAACCAGAG CCAGGAGACA 960 

CTGCAGACAG TGACCATCTA CAGCTTTCCG GCGCCCAACG TGATTCTGAC GAAGCCAGAG 1020 

GTCTCAGAAG GGACCGAGGT GACAGTGAAG TGTGAGGCCC ACCCTAGAGC CAAGGTGACG 1080 

CTGAATGGGG TTCCAGCCCA GCCACTGGGC CCGAGGGCCC AGCTCCTGCT GAAGGCCACC 1140 

CCAGAGGACA ACGGGCGCAG CTTCTCCTGC TCTGCAACCC TGGAGGTGGC CGGCCAGCTT 1200 

ATACACAAGA ACCAGACCCG GGAGCTTCGT GTCCTGTATG GCCCCCGACT GGACGAGAGG 1260 

GATTGTCCGG GAAACTGGAC GTGGCCAGAA AATTCCCAGC AGACTCCAAT GTGCCAGGCT 1320 

TGGGGGAACC CATTGCCCGA GCTCAAGTGT CTAAAGGATG GCACTTTCCC ACTGCCCATC 1380 

GGGGAATCAG TGACTGTCAC TCGAGATCTT GAGGGCACCT ACCTCTGTCG GGCCAGGAGC 1440 

ACTCAAGGGG AGGTCACCCG CGAGGTGACC GTGAATGTGC TCTCCCCCCG GTATGAGATT 1500 

GTCATCATCA CTGTGGTAGC AGCCGCAGTC ATAATGGGCA CTGCAGGCCT CAGCACGTAC 1560 

CTCTATAACC GCCAGCGGAA GATCAAGAAA TACAGACTAC AACAGGCCCA AAAAGGGACC 1620 

CCCATGAAAC CGAACACACA AGCCACGCCT CCCTGAACCT ATCCCGGGAC AGGGCCTCTT 1680 

CCTCGGCCTT CCCATATTGG TGGCAGTGGT GCCACACTGA ACAGAGTGGA AGAGATATGC 1740 

CATGCAGCTA CACCTACCGG CCCTGGGACG CCGGAGGACA GGGCATTGTC CTCAGTCAGA 1800 

TACAACAGCA TTTGGGGCCA TGGTACCTGC ACACCTAAAA CACTAGGCCA CGCATCTGAT 1860 

CTGTAGTCAC ATGACTAAGC CAAGAGGAAG GAGCAAGACT CAAGACATGA TTGATGGATG 1920 

TTAAAGTCTA GCCTGATGAG AGGGGAAGTG GTGGGGGAGA CATAGCCCCA CCATGAGGAC 1980 

ATACAACTGG GAAATACTGA AACTTGCTGC CTATTGGGTA TGCTGAGGCC CACAGACTTA 2040 

CAGAAGAAGT GGCCCTCCAT AGACATGTGT AGCATCAAAA CACAAAGGCC CACACTTCCT 2100 

GACGGATGCC AGCTTGGGCA CTGCTGTCTA CTGACCCCAA CCCTTGATGA TATGTATTTA 2160 

TTCATTTGTT ATTTTACCAG CTATTTATTG AGTGTCTTTT ATGTAGGCTA AATGAACATA 2220 

GGTCTCTGGC CTCACGGAGC TCCCAGTCCA TGTCACATTC AAGGTCACCA GGTACAGTTG 2280 

TACAGGTTGT ACACTGCAGG AGAGTGCCTG GCAAAAAGAT CAAATGGGGC TGGGACTTCT 2340 

CATTGGCCAA CCTGCCTTTC CCCAGAAGGA GTGATTTTTC TATCGGCACA AAAGCACTAT 2400 

ATGGACTGGT AATGGTTCAC AGGTTCAGAG ATTACCCAGT GAGGCCTTAT TCCTCCCTTC 2460 

CCCCCAAAAC TGACACCTTT GTTAGCCACC TCCCCACCCA CATACATTTC TGCCAGTGTT 2520 

CACAATGACA CTCAGCGGTC ATGTCTGGAC ATGAGTGCCC AGGGAATATG CCCAAGCTAT 2SB0 

GCCTTGTCCT CTTGTCCTGT TTGCATTTCA CTGGGAGCTT GCACTATTGC AGCTCCAGTT 2640 

TCCTGCAGTG ATCAGGGTCC TGCAAGCAGT GGGGAAGGGG GCCAAGGTAT TGGAGGACTC 2700 

CCTCCCAGCT TTGGAAGGGT CATCCGCGTG TGTGTGTGTG TGTATGTGTA GACAAGCTCT 2760 

CGCTCTGTCA CCCAGGCTGG AGTGCAGTGG TGCAATCATG GTTCACTGCA GTCTTGACCT 2820 

TTTGGGCTCA AGTGATCCTC CCACCTCAGC CTCCTGAGTA GCTGGGACCA TAGGCTCACA 2880 

ACACCACACC TGGCAAATTT GATTTTTTTT TTTTTTTTCA GAGACGGGGT CTCGCAACAT 2940 
TGCCCAGACT TCCTTTGTGT TAGTTAATAA AGCTTTCTCA ACTGCC 
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MLQFVRAGAR AWLRPTGSQG LSSLAEEAAR ATENPEQVAS EGLPEPVLRK VELPVPTHRR 60 
PVQAWVESIiR GFEQERVGLA DLHPDVFATA PRLDILHQVA MWQKNFKRIS YAKTKTRAEV 120 
RGGGGKPLAA ERHWAGPAWQ HPLSALARRR CCPWPPGPTS YYYMLPMKVR ALGLKVALTV 180 
KLAQDDLHIM DSLELPTGDP QYLTELAHYR RWGDSVLLVD LTHEEMPQSI VEATSRLKTP 240 
NLIPAVGLNV HSMLKHQTIiV LTLPTVAFUS DKLLWQDSRY RPLYPFSLPY SDFPRPLPHA 300 
TQGPAATPYH C 

Seg ID NO: 142 DNA sequence 

Nucleic Acid Accession #: NM_000270.1 

Coding sequence: 110-979 (underlined sequences correspond to start and stop codons) 
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AACTGTGCGA ACCAGACCCG GCAGCCTTGC TCAGTTCAGC ATAGCGGAGC GGATCCGATC 60 

GGATCGGAGC ACACCGGAGC AGGCTCATCG AGAAGGCGTC TG CGAGACCAJE£GAGAACGG 120 

ATACACCTAT GAAGATTATA AGAACACTGC AGAATGGCTT CTGTCTCATA CTAAGCACCG 180 

ACCTCAAGTT GCAATAATCT GTGGTTCTGG ATTAGGAGGT CTGACTGATA AATTAACTCA 240 

GGCCCAGATC TTTGACTACA GTGAAATCCC CAACTTTCCT CGAAGTACAG TGCCAGGTCA 300 

TGCTGGCCGA CTGGTGTTTG GGTTCCTGAA TGGCAGGGCC TGTGTGATGA TGCAGGGCAG 360 

GTTCCACATG TATGAAGGGT ACCCACTCTG GAAGGTGACA TTCCCAGTGA GGGTTTTCCA 420 

CCTTCTGGGT GTGGACACCC TGGTAGTCAC CAATGCAGCA GGAGGGCTGA ACCCCAAGTT 4 80 

TGAGGTTGGA GATATCATGC TGATCCGTGA CCATATCAAC CTACCTGGTT TCAGTGGTCA 540 

GAACCCTCTC AGAGGGCCCA ATGATGAAAG GTTTGGAGAT CGTTTCCCTG CCATGTCTGA 600 

TGCCTACGAC CGGACTATGA GGCAGAGGGC TCTCAGTACC TGGAAACAAA TGGGGGAGCA 660 

ACGTGAGCTA CAGGAAGGCA CCTATGTGAT GGTGGCAGGC CCCAGCTTTG AGACTGTGGC 720 

AGAATGTCGT GTGCTGCAGA AGCTGGGAGC AGACGCTGTT GGCATGAGTA CAGTACCAGA 780 

AGTTATCGTT GCACGGCACT GTGGACTTCG AGTCTTTGGC TTCTCACTCA TCACTAACAA 840 

GGTCATCATG GATTATGAAA GCCTGGAGAA GGCCAACCAT GAAGAAGTCT TAGCAGCTGG 900 

CAAACAAGCT GCACAGAAAT TGGAACAGTT TGTCTCCATT CTTATGGCCA GCATTCCACT 960 

CCCTGACAAA GCCAG TTGA C CTGCCTTGGA GTCGTCTGGC ATCTCCCACA CAAGACCCAA 1020 

GTAGCTGCTA CCTTCTTTGG CCCCTTGCTG GAGTCATGTG CCTCTGTCCT TAGGTTGTAG 1080 

CAGAAAGGAA AAGATTCCTG TCCTTCACCT TTCCCACTTT CTTCTACCAG ACCCTTCTGG 1140 

TGCCAGATCC TCTTCTCAAA GCTGGGATTA CAGGTGTGAG CATAGTGAGA CCTTGGCGCT 1200 

ACAAAATAAA GCTGTTCTCA TTCCTGTTCT TTCTTACACA AGAGCTGGAG CCCGTGCCCT 1260 

ACCACACATC TGTGGAGATG CCCAGGATTT GACTCGGGCC TTAGAACTTT GCATAGCAGC 1320 

TGCTACTAGC TCTTTGAGAT AATACATTCC GAGGGGCTCA GTTCTGCCTT ATCTAAATCA 1380 
CCAGAGACCA AACAAGGACT AATCCAATAC CTCTTGGA 

Seq ID No: 143 Protein sequence: 
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MENGYTYEDY KNTAEWLLSH TKHRPQVAII CGSGLGGLTD KLTQAQIFDY SEIPNFPRST 
VPGHAGRLVF GFLNGRACVM MQGRFHMYEG YPLWKVTFPV RVFHLLGVDT LWTNAAGGL 
NPKFEVGDIM LIRDHINLPG FSGQNPLRGP NDERFGDRFP AMSDAYDRTM RQRALSTWKQ 
MGEQRELQEG TYVMVAGPSF ETVAECRVLQ KLGADAVGMS TVPEVIVARH CGLRVFGFSI* 
ITNKVIMDYE SLEKANHEEV LAAGKQAAQK LEQFVSILMA SIPLPDKAS 

Seq ID NO: 144 DNA sequence 
Nucleic Acid Accession #: NM_015577.1 
Coding sequence: 112-3054 (underlined sequences correspond to start and stop codons) 
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GAAGCGGCGG GCGGGGTGGA GCAGCCAGCT GGGTCCGGGG AGCGCCGCCG CCGCCTCGAT 60 

GGGGTGTTGA AAAGTCTCCT CTAGAGCTTT GGAAGGCTGA ATGCACTAAA CATGA AGAGC 120 

TTGAAAGCGA AGTTCAGGAA GAGTGACACC AATGAGTGGA ACAAGAATGA TGACCGGCTA 180 

CTGCAGGCCG TGGAGAATGG AGATGCGGAG AAGGTGGCCT CACTGCTCGG CAAGAAGGGG 240 

GCCAGTGCCA CCAAACACGA CAGTGAGGGC AAGACCGCTT TCCATCTTGC TGCTGCAAAA 300 

GGACACGTGG AATGCCTCAG GGTCATGATT ACACATGGTG TGGATGTGAC AGCCCAAGAT 360 

ACTACCGGAC ACAGCGCCTT ACATCTCGCA GCCAAGAACA GCCACCATGA ATGCATCAGG 420 

AGGCTGCTTC AGTCTAAATG CCCAGCCGAA AGTGTCGACA GCTCTGGGAA AAC AG CTTTA 480 

CATTATGCAG CGGCTCAGGG CTGCCTTCAA GCTGTGCAGA TTCTCTGCGA ACACAAGAGC 540 

CCCATAAACC TCAAAGATTT GGATGGGAAT ATACCGCTGC TTCTTGCTGT ACAAAATGGT 600 

CACAGTGAGA TCTGTCACTT TCTCCTGGAT CATGGAGCAG ATGTCAATTC CAGGAACAAA 660 

AGTGGAAGAA CTGCTCTCAT GCTGGCCTGT GAGATTGGCA GCTCTAACGC TGTGGAAGCC 720 

TTAATTAAAA AGGGTGCAGA CCTAAACCTT GTAGATTCTC TTGGATACAA TGCCTTACAT 780 

TATTCCAAAC TCTCAGAAAA TGCAGGAATT CAAAGCCTTC TATTATCAAA AATCTCTCAG 840 

GATGCTGATT TAAAGACCCC AACAAAACCA AAGCAGCATG ACCAAGTCTC TAAAATAAGC 900 
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TCAGAAAGAA GTGGAACTCC AAAAACACGC AAAGCTCCAC CACCTCCTAT CAGTCCTACC 960 

CAGTTGAGTG ATGTCTCTTC CCCAAGATCA ATAACTTCGA CTCCACTATC GGGAAAGGAA 1020 

TCGGTATTTT TTGCTGAACC ACCCTTCAAG GCTGAGATCA GTTCTATACG AGAAAACAAA 1080 

GACAGACTAA GTGACAGTAC TACAGGTGCT GATAGCTTAT TGGATATAAG TTCTGAAGCT 1140 

GACCAACAAG ATCTTCTCTC TCTATTGCAA GCAAAAGTTG CTTCCCTTAC CTTACACAAT 1200 

AAGGAGTTAC AAGATAAATT ACAGGCCAAA TCACCCAAGG AGGCGGAAGC AGACCTAAGC 1260 

TTTGACTCAT ACCATTCCAC CCAAACTGAC TTGGGCCCAT CCCTGGGAAA ACCTGGTGAA 1320 

ACCTCTCCCC CAGACTCCAA ATCATCTCCA TCTGTCTTAA TACATTCTTT AGGTAAATCC 1380 

ACTACTGACA ATGATGTCAG AATTCAGCAA CTGCAAGAGA TTTTGCAAGA TCTACAGAAG 1440 

AGATTAGAGA GCTCTGAAGC AGAGAGAAAA CAGCTACAGG TCGAACTCCA ATCCCGAAGG 1500 

GCAGAACTGG TATGCTTAAA CAACACTGAG ATTTCAGAGA ACAGCTCTGA CCTCAGCCAG 1560 

AAACTTAAAG AAACTCAGAG CAAATACGAG GAGGCTATGA AAGAAGTCCT TAGTGTGCAG 1620 

AAGCAGATGA AACTCGGTCT TGTCTCACCT GAAAGCATGG ATAATTATTC ACATTTCCAC 1680 

GAGCTGAGGG TCACGGAAGA GGAAATAAAT GTGCTAAAGC AGGATCTGCA GAATGCATTA 1740 

GAAGAAAGTG AAAGAAATAA AGAGAAAGTG AGAGAGTTAG AGGAAAAACT GGTAGAGAGG 1800 

GAGAAAGGTA CAGTGATTAA GCCACCTGTG GAAGAGTACG AGGAAATGAA AAGTTCATAT 1860 

TGCTCTGTTA TTGAGAATAT GAATAAGGAG AAAGCATTTT TGTTTGAGAA ATACCAAGAA 1920 

GCCCAAGAAG AAATCATGAA ATTAAAAGAC ACACTAAAAA GTCAGATGAC ACAGGAAGCC 1980 

AGTGATGAAG CTGAGGACAT GAAAGAAGCC ATGAATAGGA TGATAGATGA ACTCAATAAA 2040 

CAGGTGAGCG AGCTGTCACA GCTGTACAAA GAAGCCCAGG CTGAGCTGGA GGATTACAGG 2100 

AAGAGGAAAT CTCTAGAGGA TGTCACAGCT GAATATATCC ATAAAGCAGA GCATGAGAAA 2160 

CTGATGCAAT TGACAAACGT GTCCAGGGCT AAAGCAGAAG ATGCACTGTC TGAAATGAAG 2220 

TCTCAGTATT CAAAAGTGTT GAATGAGTTG ACCCAGCTCA AACAACTGGT GGATGCACAA 2280 

AAAGAGAACT CTGTCTCTAT CACAGAACAT TTGCAAGTGA TAACCACGCT GCGGACTGCA 2340 

GCAAAAGAGA TGGAAGAAAA AATAAGCAAT CTTAAGGAAC ACCTTGCAAG CAAGGAAGTG 2400 

GAAGTAGCAA AGCTGGAGAA ACAACTCTTA GAAGAGAAAG CTGCTATGAC TGATGCAATG 2460 

GTACCTCGGT CTTCCTATGA AAAACTCCAG TCATCCTTAG AGAGTGAAGT GAGTGTGTTG 2520 

GCATCGAAAT TAAAGGAATC TGTGAAAGAG AAAGAGAAGG TCCATTCAGA GGTTGTCCAG 2580 

ATTAGAAGTG AGGTCTCACA GGTGAAAAGA GAAAAGGAAA ATATTCAGAC TCTCTTGAAA 2640 

TCCAAAGAGC AAGAAGTAAA TGAACTTCTG CAAAAATTCC AGCAAGCTCA GGAAGAACTT 2700 

GCAGAAATGA AAAGATACGC TGAGAGCTCT TCAAAACTGG AGGAAGATAA AGATAAAAAG 2760 

ATAAATGAGA TGTCGAAGGA AGTCACCAAA TTGAAGGAGG CCTTGAACAG CCTCTCCCAG 2820 

CTCTCCTACT CAACAAGCTC ATCCAAAAGG CAGAGTCAGC AGCTGGAGGC GCTGCAGCAG 2880 

CAAGTCAAAC AGCTCCAGAA CCAGCTGGCG GAATGCAAGA AACAACACCA GGAGGTCATA 2940 

TCAGTTTACA GAATGCATCT TCTGTATGCT GTGCAGGGCC AGATGGATGA AGATGTCCAG 3000 

AAAGTACTGA AGCAAATCCT TACCATGTGT AAAAACCAGT CTCAAAAGAA GTAAAGTGGA 3060 

TTCCTTGGCA GGACACTGCC CCTTGTCATC TGTCTTTGTG TTAGATCCAG AGTTGTCGGC 3120 

AGCCGCTGCC ATTGTTCTCA TTCGTGGTAT GCACTGTGGC CTAGCGTAGC TTCTTCCCTT 3180 

TCCAAAGGTT TCTGAGGACT TCTCCCAGGA GAAGACTGCC CGCCTCAGAA CTGCTTAGAG 3240 

ACTTCAAACC AGCAGAGGTG AAAGTCCCTG TCATCCCTTC AGATTCCAGA GCTGGGATCA 3300 

GCCATGCCCA GAGGTCTGGT CCTGATGCTG GCAGGGGGGC CCCCTCCTCC ATCCCTGACT 3360 

GGCTGAGTGG CTTTATCACC ACCGAGTGAT GTGCTGAGGC CTCCTGCAGT GAATGCTCCT 3420 

TCCATTCCTG TACTCGGGCA GTGCCATTCA GCACAGGAGA GCTCTTTTTG CCTTTGGCTT 3480 

TCAATTCCAA AACATGATTT AATTTCTAAC TAAATTAGTA TGGCACTAGT TATGAAGTAT 3540 

CTGCTTAAAA CCCTTCATCA TGATATCCTG TGGATTTAAA AACTCTAATT CCATGTTTTC 3600 

TTCCCATCTG CCTTATATAT CTCATCACCC TGCTTATCAA TATTCAGTTT GATGAGCACT '3660 

ATTAACTAAA ATATGAAACT TAAAAACAAA AGCAAGTTGT CCTTAAAAGT TCTTTTTTTA 3720 

AGTAAATTGT TGACATACTG CAAATTTTCT ATGCAAACTT GCCTCCTGCT GTTATCTGTG 3780 

AAGCTCAGGA AATCCAAACA TTTGTGTTTC AACAAGGGAC AGTAAACTGT GTGTTTACAG 3840 

CCAAAAGAAA TGCCTCATAG TTCTTAACCT CAACTTTTGT AGAAGTATTT TTTTCTCTGT 3900 

AATATTTTTA TTGGCTCATA AAGATGTTTT CATATCTGAA CTCCTAAATA AGTGAAATTA 3960 

CAGTAGATTA TATTAACAAA ATACTTTTTA GGTAGCCATG CTTGAGACTT TTTAAAAATA 4020 

TAACTTTTTC CTTAAAGTTT TCAGCTATAG CAAAAGGTAG TTATGTATGC CAGACCTAAT 4080 

ATGAGCTGCC ACCAACACCC CTAGAACTTT CAGCCATGGT GTCTTCAGAA TTGTAGCGCA 4140 

TTTCTGAATC TAGCAAATCC TCCTTTTACC CGTTGAATGT TTTGAATGCC CTGACTCTAC 4200 

CAGCGCCCAT AAATGATCTC TAGAAGGACT GTTAGTACCA ATCTGTTTTT CAACTTTGAA 4260 

GCTAAAAACC CTGATATGGT AATATTATGG TGCATAGCAG AGGTCTCGGA AAAAAAATAT 4320 

TTCTGTTCAC TTTACTTTCA GGTTAAAAAT GTTTCTAACA CGCTTGCAAC TTCCCTTATG 4380 

GCATTAATCT TGTTGAGGGA GAGAGACAGA ATCCTGGACT CTCCAAAGTA TTTAACTGAA 4440 

AGTAGGGCCT GCTCTGACAG GGCCCATGTC CCACAAGGCT GCTTGGCCTC AGTGGGTGCT 4500 

TGGCTGTGCT GGATGATATG TTGATCTGTA TTGGATAAGG ACCAATGACA GCAAAGCAAA 4560 

AATGGCTTTA AAGCTTGGTG TTACTTTTCT TAAGTTGTTT AATTATAGTT AAGCAATTTC 4620 

AAAAATGCTC CAAAGAAATG TGAAAGGACC TTTTGTCACA GCACTTCAGA AAATACACAA 4680 

CAGCCCCTTC TGCCCCCGCA CAGAAATGCT GCAGAGTATA TAAAACTTGA GACATTTTTG 4740 

TAGGATGCCT GACGAGGTGT AGCCTTTTAT CTTGTTTCCG GATGCATATT TATTACGAGT 4800 

ACTCTGGTTA AATATTGAAA AGTTATATGC TGTAGTTTTT AGTATTTTGT CTTTGTAATT 4860 

TACAGAAGTT ATTGGAGAAA ATAAACTTGT TTCATTTTGC AAAAAAAAAA AAAAAAAAAA 4920 
AAAAA 
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MKSLKAKFRX SDTNEWNKND DRIiLQAVENG DAEKVASLLG KKGASATKHD SEGKTAFHLA 60 
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AAKGHVECLR 
TALHYAAAQG 
RNKSGRTALM 
ISQDADLKTP 
GKESVFFAEP 
LHNKELQDKL 
GKSTTDNDVR 
LSQKLKETQS 
NALEESERNK 
YQEAQEEIMK 
DYRKRKSLED 
DAQKENSVSI 
DAMVPRSSYE 
LLKSKEQEVN 
LSQLSYSTSS 
DVQKVLKQIL 



VMITHGVDVT 
CLQAVQILCE 
LACEIGSSNA 
TKPKQHDQVS 
PFKAEISSIR 
QAKS PKEAEA 
IQQLQEILQD 
KYEEAMKEVL 
EKVRELEEKL 
LKDTLKSQMT 
VTAEYIHKAE 
TEHLQVITTL 
KLQSSLESEV 
ELLQKFQQAQ 
SKRQSQQLEA 
TMCKNQSQKK 



AQDTTGHSAL 
HKSPINLKDL 
VEALIKKGAD 
KISSERSGTP 
ENKDRLSDST 
DLSFDSYHST 
LQKRLESSEA 
SVQKQMKIjGL 
VEREKGTVIK 
QEASDEAEDM 
HEKLMQLTNV 
RTAAKEMEEK 
SVIiASKLKES 
EEIAEMKRYA 
LQQQVKQLQN 



HLAAKNSHHE 
DGNIPLLLAV 
LNLVDSLGYN 
KTRKAPPPPI 
TGADSLLDIS 
QTDLGPSLGK 
ERKQLQVELQ 
VSPESMDNYS 
PPVEEYEEMK 
KEAMNRMIDE 
SRAKAEDALS 
ISNLKEHLAS 
VKEKEKVHSE 
ESSSKLEEDK 
QLAECKKQHQ 



CIRRLLQSKC 
QNGHSEICHF 
ALHYSKLSEN 
SPTQLSDVSS 
SEADQQDLLS 
PGETSPPDSK 
SRRAELVCLN 
HFHELRVTEE 
SSYCSVIENM 
LNKQVSELSQ 
BMKSQYSKYIi 
KEVEVAKLEK 
WQIRSEVSQ 
DKKINEMSKB 
EVISVYRMHL 



PAESVDSSGK 
LLDHGADVNS 
AGIQSLLLSK 
PRSITSTPLS 
LLQAKVASLT 
SSPSVLIHSIi 
NTEISENSSD 
EINVLKQDLQ 
NKEKAFLPEK 
LYKEAQAELE 
NELTQLKQLV 
QLLEEKAAMT 
VKREKENIQT 
VTKLKEALNS 
LYAVQGQMDB 



120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
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25 CTTCTGTGCT GTTCCTTCTT GCCTCTAACT TGTAAACAAG ACGTACTAGG ACGATGCTAA 60 

TGGAAAGTCA CAAACCGCTG GGTTTTTGAA AGGATCCTTG GGACCTCATG CACATTTGTG 120 

GAAACTGGAT GGAGAGATTT GGGGAAGC AT GG ACTCTTTA GCCAGCTTAG TTCTCTGTGG 180 

AGTCAGCTTG CTCCTTTCTG GAACTGTGGA AGGTGCCATG GACTTGATCT TGATCAATTC 240 

CCTACCTCTT GTATCTGATG CTGAAACATC TCTCACCTGC ATTGCCTCTG GGTGGCGCCC 300 

30 CCATGAGCCC ATCACGATAG GAAGGGACTT TGAAGCCTTA ATGAACCAGC ACCAGGATCC 360 

GCTGGAAGTT ACTCAAGATG TGACCAGAGA ATGGGCTAAA AAAGTTGTTT GGAAGAGAGA 420 

AAAGGCTAGT AAGATCAATG GTGCTTATTT CTGTGAAGGG CGAGTTCGAG GAGAGGCAAT 480 

CAGGATACGA ACCATGAAGA TGCGTCAACA AGCTTCCTTC CTACCAGCTA CTTTAACTAT 540 

GACTGTGGAC AAGGGAGATA ACGTGAACAT ATCTTTCAAA AAGGTATTGA TTAAAGAAGA 600 

35 AGATGCAGTG ATTTACAAAA ATGGTTCCTT CATCCATTCA GTGCCCCGGC ATGAAGTACC 660 

TGATATTCTA GAAGTACACC TGCCTCATGC TCAGCCCCAG GATGCTGGAG TGTACTCGGC 720 

CAGGTATATA GGAGGAAACC TCTTCACCTC GGCCTTCACC AGGCTGATAG TCCGGAGATG 780 

TGAAGCCCAG AAGTGGGGAC CTGAATGCAA CCATCTCTGT ACTGCTTGTA TGAACAATGG 840 

TGTCTGCCAT GAAGATACTG GAGAATGCAT TTGCCCTCCT GGGTTTATGG GAAGGACGTG 900 

40 TGAGAAGGCT TGTGAACTGC ACACGTTTGG CAGAACTTGT AAAGAAAGGT GCAGTGGACA 960 

AGAGGGATGC AAGTCTTATG TGTTCTGTCT CCCTGACCCC TATGGGTGTT CCTGTGCCAC 1020 

AGGCTGGAAG GGTCTGCAGT GCAATGAAGC ATGCCACCCT GGTTTTTACG GGCCAGATTG 1080 

TAAGCTTAGG TGCAGCTGCA ACAATGGGGA GATGTGTGAT CGCTTCCAAG GATGTCTCTG 1140 

CTCTCCAGGA TGGCAGGGGC TCCAGTGTGA GAGAGAAGGC ATACCGAGGA TGACCCCAAA 1200 

45 GATAGTGGAT TTGCCAGATC ATATAGAAGT AAACAGTGGT AAATTTAATC CCATTTGCAA 1260 

AGCTTCTGGC TGGCCGCTAC CTACTAATGA AGAAATGACC CTGGTGAAGC CGGATGGGAC 1320 

AGTGCTCCAT CCAAAAGACT TTAACCATAC GGATCATTTC TCAGTAGCCA TATTCACCAT 1380 

CCACCGGATC CTCCCCCCTG ACTCAGGAGT TTGGGTCTGC AGTGTGAACA CAGTGGCTGG 1440 

GATGGTGGAA AAGCCCTTCA ACATTTCTGT TAAAGTTCTT CCAAAGCCCC TGAATGCCCC 1500 

50 AAACGTGATT GACACTGGAC ATAACTTTGC TGTCATCAAC ATCAGCTCTG AGCCTTACTT 1560 

TGGGGATGGA CCAATCAAAT CCAAGAAGCT TCTATACAAA CCCGTTAATC ACTATGAGGC 1620 

TTGGCAACAT ATTCAAGTGA CAAATGAGAT TGTTACACTC AACTATTTGG AACCTCGGAC 1680 

AGAATATGAA CTCTGTGTGC AACTGGTCCG TCGTGGAGAG GGTGGGGAAG GGCATCCTGG 1740 

ACCTGTGAGA CGCTTCACAA CAGCTTCTAT CGGACTCCCT CCTCCAAGAG GTCTAAATCT 1800 

55 CCTGCCTAAA AGTCAGACCA CTCTAAATTT GACCTGGCAA CCAATATTTC CAAGCTCGGA 1860 

AGATGACTTT TATGTTGAAG TGGAGAGAAG GTCTGTGCAA AAAAGTGATC AGCAGAATAT 1920 

TAAAGTTCCA GGCAACTTGA CTTCGGTGCT ACTTAACAAC TTACATCCCA GGGAGCAGTA 1980 

CGTGGTCCGA GCTAGAGTCA ACACCAAGGC CCAGGGGGAA TGGAGTGAAG ATCTCACTGC 2040 

TTGGACCCTT AGTGACATTC TTCCTCCTCA ACCAGAAAAC ATCAAGATTT CCAACATTAC 2100 

60 ACACTCCTCG GCTGTGATTT CTTGGACAAT ATTGGATGGC TATTCTATTT CTTCTATTAC 2160 

TATCCGTTAC AAGGTTCAAG GCAAGAATGA AGACCAGCAC GTTGATGTGA AGATAAAGAA 2220 

TGCCACCATC ATTCAGTATC AGCTCAAGGG CCTAGAGCCT GAAACAGCAT ACCAGGTGGA 2280 

CATTTTTGCA GAGAACAACA TAGGGTCAAG CAACCCAGCC TTTTCTCATG AACTGGTGAC 2340 

CCTCCCAGAA TCTCAAGCAC CAGCGGACCT CGGAGGGGGG AAGATGCTGC TTATAGCCAT 2400 

65 CCTTGGCTCT' GCTGGAATGA CCTGCCTGAC TGTGCTGTTG GCCTTTCTGA TCATATTGCA 2460 

ATTGAAGAGG GCAAATGTGC AAAGGAGAAT GGCCCAAGCC TTCCAAAACG TGAGGGAAGA 2520 

ACCAGCTGTG CAGTTCAACT CAGGGACTCT GGCCCTAAAC AGGAAGGTCA AAAACAACCC 2580 

AGATCCTACA ATTTATCCAG TGCTTGACTG GAATGACATC AAATTTCAAG ATGTGATTGG 2640 

GGAGGGCAAT TTTGGCCAAG TTCTTAAGGC GCGCATCAAG AAGGATGGGT TACGGATGGA 2700 

70 TGCTGCCATC AAAAGAATGA AAGAATATGC CTCCAAAGAT GATCACAGGG ACTTTGCAGG 2760 

AGAACTGGAA GTTCTTTGTA AACTTGGACA CCATCCAAAC ATCATCAATC TCTTAGGAGC 2820 

ATGTGAACAT CGAGGCTACT TGTACCTGGC CATTGAGTAC GCGCCCCATG GAAACCTTCT 2880 

GGACTTCCTT CGCAAGAGCC GTGTGCTGGA GACGGACCCA GCATTTGCCA TTGCCAATAG 2940 

CACCGCGTCC ACACTGTCCT CCCAGCAGCT CCTTCACTTC GCTGCCGACG TGGCCCGGGG 3000 

75 CATGGACTAC TTGAGCCAAA AACAGTTTAT CCACAGGGAT CTGGCTGCCA GAAACATTTT 3060 

AGTTGGTGAA AACTATGTGG CAAAAATAGC AGATTTTGGA TTGTCCCGAG GTCAAGAGGT 3120 
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GTACGTGAAA AAGACAATGG GAAGGCTCCC AGTGCGCTGG ATGGCCATCG AGTCACTGAA 3180 

TTACAGTGTG TACACAACCA ACAGTGATGT ATGGTCCTAT GGTGTGTTAC TATGGGAGAT 3240 

TGTTAGCTTA GGAGGCACAC CCTACTGCGG GATGACTTGT GCAGAACTCT ACGAGAAGCT 3300 

GCCCCAGGGC TACAGACTGG AGAAGCCCCT GAACTGTGAT GATGAGGTGT ATGATCTAAT 3360 

GAGACAATGC TGGCGGGAGA AGCCTTATGA GAGGCCATCA TTTGCCCAGA TATTGGTGTC 3420 

CTTAAACAGA ATGTTAGAGG AGCGAAAGAC CTACGTGAAT ACCACGCTTT ATGAGAAGTT 3480 

TACTTATGCA GGAATTGACT GTTCTGCTGA AGAAGCGGCC TAGG ACAQAA CATCTGTATA 3540 

CCCTCTGTTT CCCTTTCACT GGCATGGGAG ACCCTTGACA ACTGCTGAGA AAACATGCCT 3600 

CTGCCAAAGG ATGTGATATA TAAGTGTACA TATGTGCTGG AATTCTAACA AGTCATAGGT 3660 

TAATATTTAA GACACTGAAA AATCTAAGTG ATATAAATCA GATTCTTCTC TCTCATTTTA 3720 

TCCCTCACCT GTAGCATGCC AGTCCCGTTT CATTTAGTCA TGTGACCACT CTGTCTTGTG 3780 

TTTCCACAGC CTGCAAGTTC AGTCCAGGAT GCTAACATCT AAAAATAGAC TTAAATCTCA 3840 

TTGCTTACAA GCCTAAGAAT CTTTAGAGAA GTATACATAA GTTTAGGATA AAATAATGGG 3900 

ATTTTCTTTT CTTTTCTCTG GTAATATTGA CTTGTATATT TTAAGAAATA ACAGAAAGCC 3960 

TGGGTGACAT TTGGGAGACA TGTGACATTT ATATATTGAA TTAATATCCC TACATGTATT 4020 

GCACATTGTA AAAAGTTTTA GTTTTGATGA GTTGTGAGTT TACCTTGTAT ACTGTAGGCA 4080 
CACTTTGCAC TGATATATCA TGAGTGAATA AATGTCTTGC CTACTCAAAA AAAAAAAA 

Seq ID No: 147 Protein sequence: 
Protein Accession #: NP — 000450.1 



1 
I 

MDSIiASLVLC 
FEALMNQHQD 
QASPLPATLT 
AQPODAGVYS 
ICPPGFMGRT 
ACHPGFYGPD 
VNSGKFNPIC 
VWVCSVNTVA 
LLYKPVNKYE 
IGLPPPRGLN 
LLNNLHPREQ 
ILDGYSISSI 
SNPAFSHELV 
MAQAFQNVRE 
ARIKKDGLRM 
AIEYAPHGNL 
IHRDLAARNI 
VWSYGVLLWE 
ERPSFAQILV 



11 
I 

GVSLLLSGTV 
PLEVTQDVTR 
MTVDKGDNVN 
ARYIGGNLFT 
CEKACELHTF 
CKLRCSCNNG 
KASGWPLPTN 
GMVEKPFNIS 
AWQHTQVTNE 
LLPKSQTTLN 
YWRARVNTK 
TIRYKVQGKN 
TLPESQAPAD 
EPAVQFNSGT 
DAAIKRMKEY 
LDFLRKSRVL 
LVGENYVAKI 
IVSLGGTPYC 
SLNRMIiEERK 



21 
I 

EGAMDLILIN 
EWAKKWWKR 
ISFKKVLIKE 
SAFTRLIVRR 
GRTCKERCSG 
EMCDRFQGCL 
EEMTLVKPDG 
VKVLPKPLNA 
IVTLNYLEPR 
LTWQPIFPSS 
AQGEWSEDLT 
EDQHVDVKIK 
LGGGKMLLIA 
LALNRKVKNN 
ASKDDHRDFA 
ETDPAFAIAN 
ADFGLSRGQE 
GMTCAELYEK 
TYVNTTLYEK 



31 

I 

SLPLVSDAET 
EKASKINGAY 
EDAVIYKNGS 
CEAQKWGPEC 
QEGCKSYVFC 
CSPGWQGLQC 
TVLHPKDFNH 
PNVIDTGHNF 
TEYELCVQLV 
EDDFYVEVER 
AWTLSDILPP 
NATIIQYQLK 
ILGSAGMTCL 
PDPTIYPVLD 
GEIiEVLCKLG 
STASTLSSQQ 
VYVKKTMGRL 
LPQGYRLEKP 
FTYAGIDCSA 



41 

I 

SLTCIASGWR 
FCEGRVRGEA 
FIHSVPRHEV 
NHLCTACMNN 
LPDPYGCSCA 
EREGIPRMTP 
TDHFSVA1FT 
AVINISSEPY 
RRGEGGEGHP 
RSVQKSDQQN 
QPENIKISNI 
GLEPETAYQV 
TVLLAFLIIL 
WNDIKFQDVI 
HHPNIINLLG 
LLHFAADVAR 
PVRWMA1ESL 
LNCDDEVYDL 
EEAA 



51 
I 

PHEPITIGRD 
IRIRTMKMRQ 
PDILEVHLPH 
GVCHEDTGEC 
TGWKGLQCNE 
KIVDLPDHIE 
IHRILPPDSG 
FGDGPIKSKK 
GPVRRFTTAS 
IKVPGNLTSV 
THSSAVISWT 
DIFAENNIGS 
QLKRANVQRR 
GEGNFGQVLK 
ACEHRGYLYL 
GMDYLSQKQF 
NYSVYTTNSD 
MRQCWREKPY 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 



Seq ID NO: 148 DNA sequence ~ 
Nucleic Acid Accession #: NMJJ00552.2 

Coding sequence: 311-8752 (underlined sequences correspond to start and stop codons) 



11 



21 



31 



41 



51 



AGCTCACAGC 
TATCTCCCCC 
GAGCTGTAGC 
GCACCATTGT 
GCAGGGGAAG 
GCCCTCATTT 
GCCAGGGACC 
TTTCGGAAGT 
CAGTTACCTC 
GAATGGCAAG 
TGTCAATGGT 
GCTGTATCTA 
GGCCAGGATC 
GACCTGCGGG 
AGGGACCTTG 
ACAGTGGTGT 
GCAGAAGGGC 
CCACCCTCTG 
TGCTGGGGGG 
GGAGGGAATG 
TGGTATGGAG 
CAATGAAATG 
GGATGAAGGC 
CCCTCCCGGC 
GATCTGCAGC 



TATTGTGGTG 
AGCAGTGGGG 
AGACCTGATT 
CCAGCAGCTG 
GCACCATTGT 
AlgATTCCTG 
CTTTGTGCAG 
GACTTCGTCA 
CTGGCAGGGG 
AGAGTGAGCC 
ACCGTGACAC 
GAAACTGAGG 
GATGGCAGCG 
CTGTGTGGCA 
ACCTCGGACC 
GAACGGGCAT 
CTGTGGGAGC 
GTGGACCCCG 
CTGGAGTGCG 
GTGCTGTACG 
TATAGGCAGT 
TGTCAGGAGC 
CTCTGCGTGG 
ACCTCCCTCT 
AATGAAGAAT 



GGAAAGGGAG 
ACTCCACAGC 
GAGCCTTTGC 
AGTTTCCCAG 
CCAGCAGCTG 
CCAGATTTGC 
AAGGAACTCG 
ACACCTTTGA 
GCTGCCAGAA 
TCTCCGTGTA 
AGGGGGACCA 
CTGGGTACTA 
GCAACTTTCA 
ACTTTAACAT 
CTTATGACTT 
CTCCTCCCAG 
AGTGCCAGCT 
AGCCTTTTGT 
CCTGCCCTGC 
GCTGGACCGA 
GTGTGTCCCC 
GATGCGTGGA 
AGAGCACCGA 
CTCGAGACTG 
GTCCAGGGGA 



GGTGGTTGGT 
CCCTGGGCTA 
AGCAGCTGAG 
GGACCTTGGA 
AGTTTCCCAG 
CGGGGTGCTG 
CGGCAGGTCA 
TGGGAGCATG 
ACGCTCCTTC 
TCTTGGGGAA 
AAGAGTCTCC 
CAAGCTGTCC 
AGTCCTGCTG 
CTTTGCTGAA 
TGCCAACTCA 
CAGCTCATGC 
TCTGAAGAGC 
GGCCCTGTGT 
CCTCCTGGAG 
CCACAGCGCG 
TTGCGCCAGG 
TGGCTGCAGC 
GTGTCCCTGC 
CAACACCTGC 
GTGCCTTGTC 



GGATGTCACA 
CATAACAGCA 
AGCATGGCCT 
GATAGCCGCA 
GGACCTTGGA 
CTTGCTCTGG 
TCCACGGCCC 
TACAGCTTTG 
TCGATTATTG 
TTTTTTGACA 
ATGCCCTATG 
GGTGAGGCCT 
TCAGACAGAT 
GATGACTTTA 
TGGGCTCTGA 
AACATCTCCT 
ACCTCGGTGT 
GAGAAGACTT 
TACGCCCGGA 
TGCAGCCCAG 
ACCTGCCAGA 
TGCCCTGAGG 
GTGCATTCCG 
ATTTGCCGAA 
ACTGGTCAAT 



GCTTGGGCTT 

AGACAGTCCG 

AGGGTGGGCG 

GCCCTCATTT 

GATAGCCGCA 

CCCTCATTTT 

GATGCAGCCT 

CGGGATACTG 

GGGACTTCCA 

TCCATTTGTT 

CCTCCAAAGG 

ATGGCTTTGT 

ACTTCAACAA 

TGACCCAAGA 

GCAGTGGAGA ' 

CTGGGGAAAT 

TTGCCCGCTG 

TGTGTGAGTG 

CCTGTGCCCA 

TGTGCCCTGC 

GCCTGCACAT 

GACAGCTCCT 

GAAAGCGCTA 

ACAGCCAGTG 

CCCACTTCAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 



250 



WO 02/079492 



GAGCTTTGAC AACAGATACT TCACCTTCAG TGGGATCTGC CAGTACCTGC TGGCCCGGGA 1560 

TTGCCAGGAC CACTCCTTCT CCATTGTCAT TGAGACTGTC CAGTGTGCTG ATGACCGCGA 1620 

CGCTGTGTGC ACCCGCTCCG TCACCGTCCG GCTGCCTGGC CTGCACAACA GCCTTGTGAA 1680 

ACTGAAGCAT GGGGCAGGAG TTGCCATGGA TGGCCAGGAC ATCCAGCTCC CCCTCCTGAA 1740 

AGGTGACCTC CGCATCCAGC ATACAGTGAC GGCCTCCGTG CGCCTCAGCT ACGGGGAGGA 1800 

CCTGCAGATG GACTGGGATG GCCGCGGGAG GCTGCTGGTG AAGCTGTCCC CCGTCTACGC 1860 

CGGGAAGACC TGCGGCCTGT GTGGGAATTA CAATGGCAAC CAGGGCGACG ACTTCCTTAC 1920 

CCCCTCTGGG CTGGCAGAGC CCCGGGTGGA GGACTTCGGG AACGCCTGGA AGCTGCACGG 1980 

GGACTGCCAG GACCTGCAGA AGCAGCACAG CGATCCCTGC GCCCTCAACC CGCGCATGAC 2040 

CAGGTTCTCC GAGGAGGCGT GCGCGGTCCT GACGTCCCCC ACATTCGAGG CCTGCCATCG 2100 

TGCCGTCAGC CCGCTGCCCT ACCTGCGGAA CTGCCGCTAC GACGTGTGCT CCTGCTCGGA 2160 

CGGCCGCGAG TGCCTGTGCG GCGCCCTGGC CAGCTATGCC GCGGCCTGCG CGGGGAGAGG 2220 

CGTGCGCGTC GCGTGGCGCG AGCCAGGCCG CTGTGAGCTG AACTGCCCGA AAGGCCAGGT 2280 

GTACCTGCAG TGCGGGACCC CCTGCAACCT GACCTGCCGC TCTCTCTCTT ACCCGGATGA 2340 

GGAATGCAAT GAGGCCTGCC TGGAGGGCTG CTTCTGCCCC CCAGGGCTCT ACATGGATGA 2400 

GAGGGGGGAC TGCGTGCCCA AGGCCCAGTG CCCCTGTTAC TATGACGGTG AGATCTTCCA 2460 

GCCAGAAGAC ATCTTCTCAG ACCATCACAC CATGTGCTAC TGTGAGGATG GCTTCATGCA 2520 

CTGTACCATG AGTGGAGTCC CCGGAAGCTT GCTGCCTGAC GCTGTCCTCA GCAGTCCCCT 2580 

GTCTCATCGC AGCAAAAGGA GCCTATCCTG TCGGCCCCCC ATGGTCAAGC TGGTGTGTCC 2640 

CGCTGACAAC CTGCGGGCTG AAGGGCTCGA GTGTACCAAA ACGTGCCAGA ACTATGACCT 2700 

GGAGTGCATG AGCATGGGCT GTGTCTCTGG CTGCCTCTGC CCCCCGGGCA TGGTCCGGCA 2760 

GGAGTGCATG AGCATGGGCT GTGTCTCTGG CTGCCTCTGC CCCCCGGGCA TGGTCCGGCA 2760 

TGAGAACAGA TGTGTGGCCC TGGAAAGGTG TCCCTGCTTC CATCAGGGCA AGGAGTATGC 2820 

CCCTGGAGAA ACAGTGAAGA TTGGCTGCAA CACTTGTGTC TGTCGGGACC GGAAGTGGAA 2880 

CTGCACAGAC CATGTGTGTG ATGCCACGTG CTCCACGATC GGCATGGCCC ACTACCTCAC 294 0 

CTTCGACGGG CTCAAATACC TGTTCCCCGG GGAGTGCCAG TACGTTCTGG TGCAGGATTA 3000 

CTGCGGCAGT AACCCTGGGA CCTTTCGGAT CCTAGTGGGG AATAAGGGAT GCAGCCACCC 3060 

CTCAGTGAAA TGCAAGAAAC GGGTCACCAT CCTGGTGGAG GGAGGAGAGA TTGAGCTGTT 3120 

TGACGGGGAG GTGAATGTGA AGAGGCCCAT GAAGGATGAG ACTCACTTTG AGGTGGTGGA 3180 

GTCTGGCCGG TACATCATTC TGCTGCTGGG CAAAGCCCTC TCCGTGGTCT GGGACCGCCA 324 0 

CCTGAGCATC TCCGTGGTCC TGAAGCAGAC ATACCAGGAG AAAGTGTGTG GCCTGTGTGG 3300 

GAATTTTGAT GGCATCCAGA ACAATGACCT CACCAGCAGC AACCTCCAAG TGGAGGAAGA 3360 

CCCTGTGGAC TTTGGGAACT CCTGGAAAGT GAGCTCGCAG TGTGCTGACA CCAGAAAAGT 3420 

GCCTCTGGAC TCATCCCCTG CCACCTGCCA TAACAACATC ATGAAGCAGA CGATGGTGGA 3480 

TTCCTCCTGT AGAATCCTTA CCAGTGACGT CTTCCAGGAC TGCAACAAGC TGGTGGACCC 354 0 

CGAGCCATAT CTGGATGTCT GCATTTACGA CACCTGCTCC TGTGAGTCCA TTGGGGACTG 3600 

CGCCTGCTTC TGCGACACCA TTGCTGCCTA TGCCCACGTG TGTGCCCAGC ATGGCAAGGT 3660 

GGTGACCTGG AGGACGGCCA CATTGTGCCC CCAGAGCTGC GAGGAGAGGA ATCTCCGGGA 3720 

GAACGGGTAT GAGTGTGAGT GGCGCTATAA CAGCTGTGCA CCTGCCTGTC AAGTCACGTG 3780 

TCAGCACCCT GAGCCACTGG CCTGCCCTGT GCAGTGTGTG GAGGGCTGCC ATGCCCACTG 3840 

CCCTCCAGGG AAAATCCTGG ATGAGCTTTT GCAGACCTGC GTTGACCCTG AAGACTGTCC 3900 

AGTGTGTGAG GTGGCTGGCC * GGCGTTTTGC CTCAGGAAAG AAAGTCACCT TGAATCCCAG 3960 

TGACCCTGAG CACTGCCAGA TTTGCCACTG TGATGTTGTC AACCTCACCT GTGAAGCCTG 4020 

CCAGGAGCCG GGAGGCCTGG TGGTGCCTCC CACAGATGCC CCGGTGAGCC CCACCACTCT 4080 

GTATGTGGAG GACATCTCGG AACCGCCGTT GCACGATTTC TACTG CAGCA GGCTACTGGA 4140 

CCTGGTCTTC CTGCTGGATG GCTCCTCCAG GCTGTCCGAG GCTGAGTTTG AAGTGCTGAA 4200 

GGCCTTTGTG GTGGACATGA TGGAGCGGCT GCGCATCTCC CAGAAGTGGG TCCGCGTGGC 4260 

CGTGGTGGAG TACCACGACG GCTCCCACGC CTACATCGGG CTCAAGGACC GGAAGCGACC 4320 

GTCAGAGCTG CGGCGCATTG CCAGCCAGGT GAAGTATGCG GGCAGCCAGG TGGCCTCCAC 4380 

CAGCGAGGTC TTGAAATACA CACTGTTCCA AATCTTCAGC AAGATCGACC GCCCTGAAGC 4440 

CTCCCGCATC GCCCTGCTCC TGATGGCCAG CCAGGAGCCC CAACGGATGT CCCGGAACTT 4500 

TGTCCGCTAC GTCCAGGGCC TGAAGAAGAA GAAGGTCATT GTGATCCCGG TGGGCATTGG 4560 

GCCCCATGCC AACCTCAAGC AGATCCGCCT CATCGAGAAG CAGGCCCCTG AGAACAAGGC 4620 

CTTCGTGCTG AGCAGTGTGG ATGAGCTGGA GCAGCAAAGG GACGAGATCG TTAGCTACCT 4680 

CTGTGACCTT GCCCCTGAAG CCCCTCCTCC TACTCTGCCC CCCCACATGG CACAAGTCAC 4740 

TGTGGGCCCG GGGCTCTTGG GGGTTTCGAC CCTGGGGCCC AAGAGGAACT CCATGGTTCT 4800 

GGATGTGGCG TTCGTCCTGG AAGGATCGGA CAAAATTGGT GAAGCCGACT TCAACAGGAG 4860 

CAAGGAGTTC ATGGAGGAGG TGATTCAGCG GATGGATGTG GGCCAGGACA GCATCCACGT 4920 

CACGGTGCTG CAGTACTCCT ACATGGTGAC CGTGGAGTAC CCCTTCAGCG AGGCACAGTC 4980 

CAAAGGGGAC ATCCTGCAGC GGGTGCGAGA GATCCGCTAC CAGGGCGGCA ACAGGACCAA 5040 

CACTGGGCTG GCCCTGCGGT ACCTCTCTGA CCACAGCTTC TTGGTCAGCC AGGGTGACCG 5100 

GGAGCAGGCG CCCAACCTGG TCTACATGGT CACCGGAAAT CCTGCCTCTG ATGAGATCAA 5160 

GAGGCTGCCT GGAGACATCC AGGTGGTGCC CATTGGAGTG GGCCCTAATG CCAACGTGCA 5220 

GGAGCTGGAG AGGATTGGCT GGCCCAATGC CCCTATCCTC ATCCAGGACT TTGAGACGCT 5280 

CCCCCGAGAG GCTCCTGACC TGGTGCTGCA GAGGTGCTGC TCCGGAGAGG GGCTGCAGAT 5340 

CCCCACCCTC TCCCCTGCAC CTGACTGCAG CCAGCCCCTG GACGTGATCC TTCTCCTGGA 5400 

TGGCTCCTCC AGTTTCCCAG CTTCTTATTT TGATGAAATG AAGAGTTTCG CCAAGGCTTT 5460 

CATTTCAAAA GCCAATATAG GGCCTCGTCT CACTCAGGTG TCAGTGCTGC AGTATGGAAG 5520 

CATCACCACC ATTGACGTGC CATGGAACGT GGTCCCGGAG AAAGCCCATT TGCTGAGCCT 5580 

TGTGGACGTC ATGCAGCGGG AGGGAGGCCC CAGCCAAATC GGGGATGCCT TGGGCTTTGC 5640 

TGTGCGATAC TTGACTTCAG AAATGCATGG TGCCAGGCCG GGAGCCTCAA AGGCGGTGGT 5700 

CATCCTGGTC ACGGACGTCT CTGTGGATTC AGTGGATGCA GCAGCTGATG CCGCCAGGTC 5760 

CAACAGAGTG ACAGTGTTCC CTATTGGAAT TGGAGATCGC TACGATGCAG CCCAGCTACG 5820 

GATCTTGGCA GGCCCAGCAG GCGACTCCAA CGTGGTGAAG CTCCAGCGAA TCGAAGACCT 5880 

CCCTACCATG GTCACCTTGG GCAATTCCTT CCTCCACAAA CTGTGCTCTG GATTTGTTAG 5940 

GATTTGCATG GATGAGGATG GGAATGAGAA GAGGCCCGGG GACGTCTGGA CCTTGCCAGA 6000 
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CCAGTGCCAC ACCGTGACTT GCCAGCCAGA TGGCCAGACC TTGCTGAAGA GTCATCGGGT 6060 

CAACTGTGAC CGGGGGCTGA GGCCTTCGTG CCCTAACAGC CAGTCCCCTG TTAAAGTGGA 6120 

AGAGACCTGT GGCTGCCGCT GGACCTGCCC CTGCGTGTGC ACAGGCAGCT CCACTCGGCA 61 BO 

CATCGTGACC TTTGATGGGC AGAATTTCAA GCTGACTGGC AGCTGTTCTT ATGTCCTATT 6240 

5 TCAAAACAAG GAGCAGGACC TGGAGGTGAT TCTCCATAAT GGTGCCTGCA GCCCTGGAGC 6300 

AAGGCAGGGC TGCATGAAAT CCATCGAGGT GAAGCACAGT GCCCTCTCCG TCGAGCTGCA 6360 

CAGTGACATG GAGGTGACGG TGAATGGGAG ACTGGTCTCT GTTCCTTACG TGGGTGGGAA 6420 

CATGGAAGTC AACGTTTATG GTGCCATCAT GCATGAGGTC AGATTCAATC ACCTTGGTCA 6480 

CATCTTCACA TTCACTCCAC AAAACAATGA GTTCCAACTG CAGCTCAGCC CCAAGACTTT 6540 

10 TGCTTCAAAG ACGTATGGTC TGTGTGGGAT CTGTGATGAG AACGGAGCCA ATGACTTCAT 6600 

GCTGAGGGAT GGCACAGTCA CCACAGACTG GAAAACACTT GTTCAGGAAT GGACTGTGCA 6660 

GCGGCCAGGG CAGACGTGCC AGCCCATCCT GGAGGAGCAG TGTCTTGTCC CCGACAGCTC 6720 

CCACTGCCAG GTCCTCCTCT TACCACTGTT TGCTGAATGC CACAAGGTCC TGGCTCCAGC 6780 

CACATTCTAT GCCATCTGCC AGCAGGACAG TTGCCACCAG GAGCAAGTGT GTGAGGTGAT 6840 

15 CGCCTCTTAT GCCCACCTCT GTCGGACCAA CGGGGTCTGC GTTGACTGGA GGACACCTGA 6900 

TTTCTGTGCT ATGTCATGCC CACCATCTCT GGTCTACAAC CACTGTGAGC ATGGCTGTCC 6960 

CCGGCACTGT GATGGCAACG TGAGCTCCTG TGGGGACCAT CCCTCCGAAG GCTGTTTCTG 7020 

CCCTCCAGAT AAAGTCATGT TGGAAGGCAG CTGTGTCCCT GAAGAGGCCT GCACTCAGTG 7080 

CATTGGTGAG GATGGAGTCC AGCACCAGTT CCTGGAAGCC TGGGTCCCGG ACCACCAGCC 7140 

20 CTGTCAGATC TGCACATGCC TCAGCGGGCG GAAGGTCAAC TGCACAACGC AGCCCTGCCC 7200 

CACGGCCAAA GCTCCCACGT GTGGCCTGTG TGAAGTAGCC CGCCTCCGCC AGAATGCAGA 7260 

CCAGTGCTGC CCCGAGTATG AGTGTGTGTG TGACCCAGTG AGCTGTGACC TGCCCCCAGT 7320 

GCCTCACTGT GAACGTGGCC TCCAGCCCAC ACTGACCAAC CCTGGCGAGT GCAGACCCAA 7380 

CTTCACCTGC GCCTGCAGGA AGGAGGAGTG CAAAAGAGTG TCCCCACCCT CCTGCCCCCC 7440 

25 GCACCGTTTG CCCACCCTTC GGAAGACCCA GTGCTGTGAT GAGTATGAGT GTGCCTGCAA 7500 

CTGTGTCAAC TCCACAGTGA GCTGTCCCCT TGGGTACTTG GCCTCAACCG CCACCAATGA 7560 

CTGTGGCTGT ACCACAACCA CCTGCCTTCC CGACAAGGTG TGTGTCCACC GAAGCACCAT 7620 

CTACCCTGTG GGCCAGTTCT GGGAGGAGGG CTGCGATGTG TGCACCTGCA CCGACATGGA 7680 

GGATGCCGTG ATGGGCCTCC GCGTGGCCCA GTGCTCCCAG AAGCCCTGTG AGGACAGCTG 7740 

30 TCGGTCGGGC TTCACTTACG TTCTGCATGA AGGCGAGTGC TGTGGAAGGT GCCTGCCATC 7800 

TGCCTGTGAG GTGGTGACTG GCTCACCGCG GGGGGACTCC CAGTCTTCCT GGAAGAGTGT 7860 

CGGCTCCCAG TGGGCCTCCC CGGAGAACCC CTGCCTCATC AATGAGTGTG TCCGAGTGAA 7920 

GGAGGAGGTC TTTATACAAC AAAGGAACGT CTCCTGCCCC CAGCTGGAGG TCCCTGTCTG 7980 

CCCCTCGGGC TTTCAGCTGA GCTGTAAGAC CTCAGCGTGC TGCCCAAGCT GTCGCTGTGA 8040 

35 GCGCATGGAG GCCTGCATGC TCAATGGCAC TGTCATTGGG CCCGGGAAGA CTGTGATGAT 8100 

CGATGTGTGC ACGACCTGCC GCTGCATGGT GCAGGTGGGG GTCATCTCTG GATTCAAGCT 8160 

GGAGTGCAGG AAGACCACCT GCAACCCCTG CCCCCTGGGT TACAAGGAAG AAAATAACAC 8220 

AGGTGAATGT TGTGGGAGAT GTTTGCCTAC GGCTTGCACC ATTCAGCTAA GAGGAGGACA 8280 

GATCATGACA CTGAAGCGTG ATGAGACGCT CCAGGATGGC TGTGATACTC ACTTCTGCAA 8340 

40 GGTCAATGAG AGAGGAGAGT ACTTCTGGGA GAAGAGGGTC ACAGGCTGCC CACCCTTTGA 8400 

TGAACACAAG TGTCTGGCTG AGGGAGGTAA AATTATGAAA ATTCCAGGCA CCTGCTGTGA 8460 

CACATGTGAG GAGCCTGAGT GCAACGACAT CACTGCCAGG CTGCAGTATG TCAAGGTGGG 8520 

AAGCTGTAAG TCTGAAGTAG AGGTGGATAT CCACTACTGC CAGGGCAAAT GTGCCAGCAA 8580 

AGCCATGTAC TCCATTGACA TCAACGATGT GCAGGACCAG TGCTCCTGCT GCTCTCCGAC 8640 

45 ACGGACGGAG CCCATGCAGG TGGCCCTGCA CTGCACCAAT GGCTCTGTTG TGTACCATGA 8700 

GGTTCTCAAT GCCATGGAGT GCAAATGCTC CCCCAGGAAG TGCAGCAAG T GAG GCTGCTG 8760 

CAGCTGCATG GGTGCCTGCT GCTGCCTGCC TTGGCCTGAT GGCCAGGCCA GAGTGCTGCC 8820 

AGTCCTCTGC ATGTTCTGCT CTTGTGCCCT TCTGAGCCCA CAATAAAGGC TGAGCTCTTA 8880 
TCTTGCTGCA TGTTCTGCTC TTGTGCCCTT CTGAGCCCAC AAT 
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55 1 11 21 31 41 51 

I I I I I I 

MIPARFAGVL LALALILPGT LCAEGTRGRS STARCSLFGS DFVNTFDGSM YSFAGYCSYL 60 

LAGGCQKRSF SIIGDFQNGK RVSLSVYLGE FFDIHLiFVNG TVTQGDQRVS MPYASKGLYL 120 

ETBAGYYKLS GEAYGFVARI DGSGNFQVLL SDRYFNKTCG LCGNFNIFAE DDFMTQEGTL 180 

60 TSDPYDFANS WALSSGEQWC ERASPPSSSC NISSGEMQKG LWEQCQLLKS TSVFARCHPL 240 

VDPEPFVALC EKTLCECAGG LECACPALLE YARTCAQEGM VLYGWTDHSA CSPVCPAGME 300 

YRQCVSPCAR TCQSLHINEM CQERCVDGCS CPEGQLLDEG LCVESTECPC VHSGKRYPPG 360 

TSLSRDCNTC ICRNSQWICS NEECPGECLV TGQSHFKSFD NRYFTFSGIC QYLLARDCQD 420 

HSFSIVIETV QCADDRDAVC TRSVTVRLPG LHNSLVKLKH GAGVAMDGQD IQLPLLKGDL 480 

65 RIQHTVTASV RLSYGEDLQM DWDGRGRLLV KLSPVYAGKT CGLCGNYNGN QGDDFLTPSG 540 

LAEPRVEDFG NAWKLHGDCQ DLQKQHSDPC ALNPRMTRFS EEACAVLTSP TFEACHRAVS 600 

PLPYLRNCRY DVCSCSDGRE CLCGALASYA AACAGRGVRV AWREPGRCEL NCPKGQVYLQ 660 

CGTPCNLTCR SLSYPDEECN EACLEGCFCP PGLYMDERGD CVPKAQCPCY YDGEIFQPED 720 

IFSDHHTMCY CEDGFMHCTM SGVPGSLLPD AVLSSPLSHR SKRSLSCRPP MVKLVCPADN 780 

70 LRAEGLECTK TCQNYDLECM SMGCVSGCLC PPGMVRHENR CVALERCPCF HQGKEYAPGB 840 

TVKIGCNTCV CRDRKWNCTD HVCDATCSTI GMAHYLTFDG LKYLFPGECQ YVLVQDYCGS 900 

NPGTFRIIiVG NKGCSHPSVK CKKRVTILVE GGEIELFDGE VNVKRPMKDE THFEWESGR 960 

YIILLLGKAL SWWDRHLSI SWLKQTYQE KVCGLCGNFD GIQNNDLTSS NLQVEEDPVD 1020 

FGNSWKVSSQ CADTRKVPLD SSPATCHNNI MKQTMVDSSC RILTSDVFQD CNKLVDPEPY 1080 

75 LDVCIYDTCS CESIGDCACF CDTIAAYAHV CAQHGKWTW RTATLCPQSC EERNLRENGY 1140 

ECEWRYNSCA PACQVTCQHP EPLACPVQCV EGCHAHCPPG KILDELLQTC VDPEDCPVCE 1200 
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15 



20 



25 



VAGRRFASGK 
DISEPPLHDF 
YHDGSHAYIG 
ALLLMASQEP 
SSVDELEQQR 
FVLEGSDKIG 
ILQRVREIRY 
GDIQWPIGV 
SPAPDCSQPL 
IDVPWNWPE 
TDVSVDSVDA 
VTLGNSFLHK 
RGLRPSCPNS 
EQDLEVILHN 
NVYGAIMHEV 
GTVTTDWKTL 
AICQQDSCHQ 
DGNVSSCGDH 
CTCLSGRKVN 
ERGLQPTLTN 
STVSCPLGYIi 
MGLRVAQCSQ 
WASPENPCLI 
ACMIiNGTVIG 
CGRCLPTACT 
CLAEGGKIMK 
SIDINDVQDQ 



KVTLNPSDPE 
YCSRLLDLVP 
LKDRKRPSEL 
QRMSRNFVRY 
DEIVSYLCDL 
EADFNRSKEF 
QGGNRTNTGIi 
GPNANVQELE 
DVILLLDGSS 
KAHLLSLVDV 
AADAARSNRV 
LCSGFVRICM 
QSPVKVEETC 
GACSPGARQG 
RFNHLGHIFT 
VQEWTVQRPG 
EQVCEVIASY 
PSEGCFCPPD 
CTTQPCPTAK 
PGECRPNFTC 
ASTATNDCGC 
KPCEDSCRSG 
NECVRVKEEV 
PGKTVMIDVC 
IQLRGGQIMT 
IPGTCCDTCE 
CSCCSPTRTE 



HCQICHCDW 
LLDGSSRLSE 
RRIASQVKYA 
VQGLKKKKVI 
APEAPPPTLP 
MEEVIQRMDV 
ALRYLSDHSF 
RIGWPNAPIL 
SFPASYFDEM 
MQREGGPSQI 
TVFPIGIGDR 
DEDGNEKRPG 
GCRWTCPCVC 
CMKSIEVKHS 
FTPQNNEFQL 
QTCQPILEEQ 
AHLCRTNGVC 
KVMLEGSCVP 
APTCGLCEVA 
ACRKEECKRV 
TTTTCLPDKV 
FTYVLHEGEC 
FIQQRNVSCP 
TTCRCMVQVG 
LKRDETLQDG 
EPECNDITAR 
PMQVALHCTN 



NLTCEACQEP 
AEFEVLKAFV 
GSQVASTSEV 
VIPVGIGPHA 
PHMAQVTVGP 
GQDSIHVTVL 
LVSQGDREQA 
IQDFETLPRE 
KSFAKAFISK 
GDALGFAVRY 
YDAAQLRILA 
DVWTLPDQCH 
TGS3TRHIVT 
ALSVELHSDM 
QLSPKTFASK 
CLVPDSSHCQ 
VDWRTPDFCA 
EEACTQCIGE 
RLRQNADQCC 
SPPSCPPHRL 
CVHRSTIYPV 
CGRCLPSACE 
QLEVPVCPSG 
VTSGFKLECR 
CDTHFCKVNE 
LQYVKVGSCK 
GSWYHEVLN 



GGLWPPTDA 
VDMMERLRIS 
LKYTIiFQIFS 
NIJCQIRLIEK 
GLLGVSTLGP 
QYSYMVTVEY 
PNLVYMVTGN 
APDLVLQRCC 
ANIGPRLTQV 
LTSEMHGARP 
GPAGDSNWK 
TVTCQPDGQT 
FDGQNFKLTG 
EVTVNGRLVS 
TYGLCGICDE 
VLLLPLFAEC 
MSCPPSLVYN 
DGVQHQFLEA 
PEYECVCDPV 
PTLRKTQCCD 
GQFWEEGCDV 
WTGSPRGDS 
FQLSCKTSAC 
KTTCNPCPLG 
RGEYFWEKRV 
SEVEVDIHYC 
AMECKCSPRK 



PVSPTTLYVE 
QKWVRVAWE 
KIDRPEASRI 
QAPENKAFVti 
KRNSMVLDVA 
PFSEAQSKGD 
PASDEIKRLP 
SGEGLQIPTL 
SVLQYGSITT 
GASKAWILV 
LQRIEDLPTM 
LLKSHRVNCD 
SCSYVLFQNK 
VPYVGGNMEV 
NGANDFMLRD 
HKVLAPATFY 
HCEHGCPRHC 
WVPDHQPCQI 
SCDLPPVPHC 
EYECACNCVN 
CTCTDMEDAV 
QSSWKSVGSQ 
CPSCRCERME 
YKEENNTGEC 
TGCPPFDEHK 
QGKCASKAMY 
CSK 



1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 



30 Seq ID NO: 150 DNA sequence 

Nucleic Acid Accession #: NM_001508.1 

Coding sequence: 1-1362 (underlined sequences correspond to start and stop codons) 



35 



40 



45 



50 



55 



60 



ATG GCTTCAC 
CCCGAGTTTG 
TTCGTGATGG 
AAAGGATACT 
TTGGTGTTCC 
ACGTCCAGCT 
GCTACGCTGC 
TTCAGGTACA 
GTCACCTCCG 
GTGAACGTGC 
CAGCCCGAGA 
CAGTCCAGCA 
ATGTGCTGGA 
ACGCGGCCTC 
ACCATCATCT 
ATTCGGAGGA 
GCGTACATGA 
CCGCTCCTGT 
TGCCGCCTGT 
ACCACCGACA 
TCTGCAAGGA 
TCTAAGTCCC 
AATTCTGCTG 



11 
I 

CCAGCCTCCC 
AGGTGGCCAC 
GCCTTCTGGG 
TGCAGAAGGA 
TCATCX3GCAT 
ACACCCTGTC 
TGCACGTGCT 
AGGCTGTGTC 
CCCTGGTGGC 
CCAGCCACCG 
CCTCCAATAT 
TCTTCGGCGC 
ACATGATGCA 
CGCAGCTGAG 
TCCTGAGGCT 
TCATGGCTGC 
TCCTCCTCCC 
ACACGGTGTC 
CGCTGCAGCA 
GCGCCCXJCTT 
GAACTGAGAA 
AGTCATTGAG 
CAGAGAATGG 



21 
I 

GGGCAGTGAC 
CTGGATCAAA 
GAACAGCGTC 
GGTGACAGAC 
GCCCATGGAG 
CTGCAAGCTG 
GACGCTCAGC 
GGGACCTTGC 
ACTGCCCTTG 
GGGTCTCACT 
GTCCATCTGT 
CTTCGTGGTC 
GGTGCTCATG 
GAAGTCCGAG 
GATTGTTGTG 
GGCCAAACCC 
CTTCTCGGAG 
CTCGCAGCAG 
CGCCAACCAC 
TGTGCAGCGC 
GATTTTCTTA 
TCTCGAGTCA 
TTTTCAGGAG 



31 



41 



TGCTCCCAAA TCATTGATCA 
ATCACCCTTA TTCTGGTGTA 
ACCATTCGGG TCACCCAGGT 
CACATGGTGA GTTTGGCTTG 
TTCTACAGCA TCATCTGGAA 
CACACTTTCC TCTTCGAGGC 
TTTGAGCGCT ACATCGCCAT 
CAGGTGAAGC TGCTGATTGG 
CTGTTTGCCA TGGGTACTGA 
TGCAACCGCT CCAGCACCCG 
ACCAACCTCT CCAGCCGCTG 
TACCTCGTGG TCCTGCTCTC 
AAAAGCCAGA AGGGCTCGCT 
AGCGAAGAGA GCAGGACCGC 
ACATTGGCCG TATGCTGGAT 
AAGCACGACT GGACGAGGTC 
ACGTTTTTCT ACCTCAGCTC 
TTTCGGCGGG TGTTCGTGCA 
GAGAAGCGCC TGCGCGTACA 
CCGTTGCTCT TCGCGTCCCG 
AGCACTTTTC AGAGCGAGGC 
CTAGAGCCCA ACTCAGGCGC 
CATGAAGTT T GA 



51 
I 

CAGTCATGTC 
CCTGATCATC 
GCTGCAGAAG 
CTCGGACATC 
TCCCCTGACC 
CTGCAGCTAC 
CTGTCACCCC 
CTTCGTCTGG 
GTACCCCCTG 
CCACCACGAG 
GACCGTGTTC 
CGTAGCCTTC 
GGCCGGGGGC 
CAGGAGGCAG 
GCCCAACCAG 
CTACTTCCGG 
GGTCATCAAC 
GGTGCTGTGC 
TGCGCACTCC 
GCGCCAGTCC 
CGAGCCCCAG 
GAAACCAGCC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 



Seq ID No: 151 Protein sequence: 
Protein Accession #: NP 001499.1 



65 



70 



75 



i 
I 

MASPSLPGSD 
KGYLQKEVTD 
ATLIiHVLTLS 
VNVPSHRGLT 
MCWNMMQVLM 
IRRIMAAAKP 
CRLSLQHANH 
SKSQSLSLES 



11 
I 

CSQIIDHSHV 
HMVSLACSDI 
FERYIAICHP 
CNRSSTRHHE 
KSQKGSLAGG 
KHDWTRSYFR 
EKRLRVHAHS 
LEPNSGAKPA 



21 
I 

PEFEVATWIK 
LVFLIGMPME 
FRYKAVSGPC 
QPETSNMSIC 
TRPPQLRKSE 
AYMILLPFSE 
TTDSARFVQR 
NSAAENGFQE 



31 

I 

ITLILVYLII 
FYSIIWNPLT 
QVKLLIGFVW 
TNLSSRWTVF 
SEESRTARRQ 
TFFYLSSVIN 
PLLFASRRQS 
HEV 



41 

I * 

FVMGLLGNSV 

TSSYTLSCKL 

VTSALVALPL 

QSSIFGAFW 

TIIFLRLIW 

PLLYTVSSQQ 

SARRTEKIFL 



51 
I 

TIRVTQVLQK 
HTFLFEACSY 
LFAMGTEYPL 
YLWLLSVAF 
TLAVCWMPNQ 
FRRVFVQVLC 
STFQSEAEPQ 



60 
120 
180 
240 
300 
360 
420 



Seq ID NO: 152 DNA sequence 



253 



WO 02/079492 



PCTAJS02/04915 



Nucleic Acid Accession #: none found 

Coding sequence: 3-65 (underlined sequences correspond to start and stop codons) 



1 11 21 

I I I 

TTATTATTTT GTGTAAACTA TATTCTGCTT 
CTTGARAAGT ATTCCAAGGA ATATTATGAA 
CATGTAATTG AAATTCATGC AAGGAAACAA 
CACATGTTAT CAACCTCGTA ACTTTTGGTG 
CCCGCTCTTT GGGAATGCTA CATACCCATT 
TAAATGACAA AACACAGCAG TGTTTTGAGG 
ACATACTACC AGAGCTGCTT GGAAAATTAA 
ATACGTTTAA ATTTGCCCTA GGATTSAGCT 
GAGAGAAAGA AAGGAGKAAA CAGTGGTAAT 
AATATTAATA AATCATAATA TGACAAGACC 
GAAGCCAGCT GTTGGTAGGC ATTAATGAGT 
CACTATTTTG CCAAAACCAA AGTAATTATA 
GTGGTTATAA AGGGCATATT TACATAAATT 
GTAACCCAAT TTTACTTCTT TAAAAAGTCT 
AATCAACTAG ACAGTGGTTT GTTAAATTTA 
GTCATGAGTT CTTGAATCCC AGAGAAATAA 
TTGGCCTAGA GAAGTGGCCA TTTTATCAAC 
CCCGTTGCCT TCTGAAAAAC AGCAAGTTAT 
ATGGAAAATT AATTTATTAA TTAATAGCCT 
ATATTTTGAG ATAAAATGTT GAATAAAACC 
TGTAATATTT AATTATTTTA TAAGTTTTAT 



31 . 41 51 

I I I 

ATAGAGAGTC TCTGAGACTA AAATTGACAA 60 

AATAGGGCAA CATGGACTGT TTAAGATCTC 120 

CT CAT AG AAA AGATAAATAT GGATGCCCTT 180 

CTTGCTGAAT CAGTCCATGA AAAGCTACAG 240 

TCTGGTATTT AAAAAATATC TAGGAGGAGC 300 

GAGAAAGGAC CATCATTTAT AATGCTCTGT 360 

AGGCCACTTG TGGCTTTTTC CTACCAACTG 420 

AACAGCAAAA AAAAAAAAAA AAAAAAAARA 480 

AAAAAAATCC ATCTGTCTTC TTGCTATGTT 540 

CTCACTGAAT AAGAGTATTT TCAGTCATCA 600 

TTAAAATTGT TCTCAATTGA AAAAACATCA 660 

ATACTGTGTC CTCCTGTAAT TTTTTGAGAA 720 

CTACTTTATT CCTCAACTTC TTTGATGAAT 780 

CAATTCAAGC TGGATTAGCC AGCTCAGCAT 840 

GCAGCATACT TCGTTCCCAT TCTAATTAAA 900 

TGCTTAGGAA CTTCTCTCAA TCTGCTTGGC 960 

AGGRAAAAAA AAAATTTTCT CTACTACAAC 1020 

TTCTTTATAT AATTATCATT TTATTATTTT 1080 

ATTATGTGTT CTCACTTGCT TCTCTAAGTA 1140 

ATGGATTATA GAGAAAAGTC AAAATATATG 1200 
AATAAAGTAT TCCATTTCTT TATCTT 



Seq ID No: 153 Protein sequence: 
Protein Accession #: none found 



1 11 21 31 41 51 

I I I I I I 

IILCKLYSAY RESLRLKLTT 

Seq ID NO: 154 DNA sequence 

Nucleic Acid Accession #: none found 

Coding sequence: 1-36 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

CTGGATGATA TGGAAGAAAT GGATGGGTTA AGG TAAA AGG CTGATCACAG ATGGGTTCCT 60 

CTCAAGGTTA AAATAGTTTA AGTGCCAGAA GAAAAGGTGG GCACCAGCGA ATTAAGAACC 120 

ATCTTTGAAT GGTCCCCTTG GTTAAATACT TAACTTTTGT CATCAGTGTC TGCATTTATG 180 

AAATGAAGAG GAATTCACTA ATATGCTACG TGATCTTTTG TTTGTCATGA AAAGAGTTAC 240 

TGTTGTGTAG TTCTCTGTTC CAGGGCTGCC TTTGCTCCAC AAAGCACTGA GAAGCAGTGG 300 

CCCTGTACAA CCATACTGCC TCTCAACACT GTGTAATAGG CTAACACCGC CCAGCGAACC 360 
TTCCTGGGAG ATATAAAATA CATAGGTTTA GGCTGGCAAA AAAAAAAAAA AAA 

Seq ID NO: 155 Protein sequence: 
Protein Accession #: none found 



1 11 21 31 41 51 

I I.I I I I 

LDDMEEMDGL R 



Seq ID NO: 156 DNA sequence 

Nucleic Acid Accession #: NMJ)32961.1 

Coding sequence: 827-3949 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

CAGGCTCAGA GGCTGAAGCA GGAGGAAGGA AGGACTGGAA GGAAAAAGAG ACAGGTTAGA 60 

GGGAAAGAGG CTTGGGAAGA AAACAGCAGA AAAGAAACTG CTCATTACAC TTACAGAGAG 120 

GCAAGTAACG GTGGAGATGA GGACAGAGGG AACCAAGACT CTGAAAGACA AAAAATACAA 180 

ATAGAGCGAA AGAGGAAAAA AATGTCAAGA AGAACATCCA TCCGGAGAAA TGAAGAGAAT 240 

GAAAGTTTTA AACTGCAGAG CCGTTCTGTG CTTTTCCGGC ACAAAATTAT ATCGCTGATT 300 

TTAAGCCCTT TTGCATTTGC CAGCCGTTGA CATTAAGAGG CATGTTTAAC GGTGCCAACA 360 

GCATCTCCTT TTCCTTCTCC TCTTCCTCTT CTTCTTCTTC CTCCTCCTCC TCCTCTTTTT 420 



254 



WO 02/079492 



PCT/US02/04915 



CCTCCTCCTC GTTCTCCTCC CATCAGCAAG 
GAAATTTCCT CTTTGGGATT TGCCAGCGCC 
TGTATTATTG TTATTTTATT AATTAGTCAG 
CCTGTCACCC TTCCTGTGCT AAGATTTAAA 
5 AAAATGAAGC AAAAGGAGTA AGATTTTTAA 
CGCACTTTTA TTTGTATTTT TTCAGATTTT 
TGGGTGGCTG ACTGGCTGCG GGAAGCTACT 
ATTGTTTGCC TTGCTCTGGA TGGTGGAAGG 
GGAGGAGCAG GAACATGGCA CTTTCGTGGG 

10 TACAAAACTT TCGGCTCGCG GGTTTCAGAC 
CCTCAACCTG GAGACAGGGG TGCTGTACGT 
CAAACAGAGC CCCTCCTGTG TCCTGCACCT 
GTTCCAGGTG GAGATCGAGG TGCTGGACAT 
AGACCTGACG GTGGAAATCT CTGAGAGCGC 

15 CGCATTCGAC CCAGACGTGG GCACCAACTC 
CTACTTCTCC CTGGACGTGC AGACCCAGGG 
GGAGAAGCCA CTGGACCGAG AGCAGCAAGC 
CGGAGGAGGT GGGGGAGGAG TAGGAGAAGG 
CCCCCAGCAG CAGCGCACCG GCACGGCCCT 

20 CAATGTGCCC GCTTTCGACC AACCCGTCTA 
AGGCACTCTC GTGATCCAGC TCAACGCCAC 
CGTGTACTCC TTCAGCAGCC ACATTTCGCC 
GCGCACTGGC AGACTGGAGG TAAGCGGCGA 
AGTGTACGTG CAAGCCAAGG ACCTGGGCCC 

25 AGTGCGAGTA CTGGATGCTA ATGACAACGC 
AGCGGTGAGT GAGGGCGCGG CGCCCGGCAC 
CGACTCAGAG GAGAATGGGC AGGTGCAGTG 
CAAGTCTTCC TTTAAGAATT ACTACACCAT 
GGGGGACTCC TACACCCTGA CTGTAGTGGC 

30 CAGTAAGTCG ATCCAGGTAC AAGTGTCGGA 
GCCGGTCTAC GACGTGTATG TGACTGAAAA 
GAGCGCCACC GACCGGGATG AGGGCGCCAA 
CCAGATCCAG GGCATGAGCG TCTTCACCTA 
GTACGCCCTG CGCTCCTTCG ACTATGAGCA 

35 CCGGGACGCT GGCAGCCCCC AGGCGCTGGC 
GGATCAAAAT GACAACGCCC CTGCCATCGT 
AGCGCGTGAG GTGCTGCCCC GCTCGGCGGA 
CGTGGACGCG GACGACGGCG AGAACGCCCG 
AATGAACCTC TTTCGCATGG ACTGGCGCAC 

40 GGCCAAGCGC GACCCCCAGC GGCCTTATGA 
GCCGCCCCTT TCCTCCACCG CCACCCTGGT 
CCAGGGCGGG GGCGGGAGCG GAGGCGGAGG 
TGGCGGCGGG GAAACCTCGC TAGACCTCAC 
GTCCTTCATC TTCCTGCTGG CCATGATCGT 

45 GCTCAACATC TATACTTGTC TGGCCAGCGA 
CGGAGGTTCG ACCTGCTGTG GCCGCCAAGC 
AGACATCATG CTGGTGCAGA GCTCCAATGT 
GGAGTCCGGG GGCTTTGGCT CCCACCACCA 
GACCCCTGAG TCCGCCAAGA CCGACCTGAT 

50 TACGGACACT GAGCACAACC CCTGCGGGGC 
TGATATCATC TCCAACGGAA GCATTTTGTC 
CAGCTATCTA GTTGACAGAC CTCGCCGAGT 
AGTAAGCTCT AAGGACAGTG GTCATGGAGA 
CACCAACCGT GCCCAGTCAG CTGGTATGGA 

55 AGCTCTGGGC CACTCAGATC GGTGCTGGAT 
GGCTGCTGAT TATCGCAGCA ATCTGCATGT 
GGTGTTTGAA ACTCCAGAAG CCCAGCCTGG 
AGAGAAGGCC CTTCACAGCA CTCTGGAGAG 
GCGAGCGCCT TACAAACCAC CATATTTGAC 

60 GGACTTACCT GAAGCAGCAT GATTTGCACA 
ACTTCATTAT CTTGGCCATC CAGTTAGTCA 
GTCATCATGG CCAATTATAG GACCTAATTG 
TGTGCAGAAC TGTAGAAACT TTAGAGGCAA 
TGTTTACAGC ACTATCTATC TTTCTCTCTC 

65 ATTCACCACG AGAAGCCAGT CATAAAGATA 
CACTGTTTTA AACTTGACTG TTTTATATTA 
TTCCAACTTT ACAAGAGAAA TTGTGATTAT 
TTGTATTCTG AAGACCCACA AAATATCAAA 
AAGTGTTTAC TGTACTATTT CAAAGCTTCT 

70 TATAATTTTC CTAAAATGTG GTACAACTCA 
CATCATACAA TAAAATAAAA GGTAATTCAG 
TCATTAATAG TTTTCTCCCA ATTTCCATAT 
AGAAAATGAT GCTCTAAGCT ACAAAATTTT 
AAAGATGTAG CTATTGATGT TATCAGACAG 

75 ACAATCTGCA TAAGTCTGAT TCTATTTCTA 
TTATAAAGAA TCGATAAATT CACCTGTATT 



AAGACAAACC GAGGACAGTC TTGAAATATC 480 

AAGACTGTCG GAATAAAGGA CGCTGACTAT 540 

TGGAAAGATT ACAGATGAGG AAAGGGGACG 600 

AAAAAATGAG GCTGGATTGC GGGAAGCTCT 660 

AGACAGAAAG CCACAGGAGC CCCCACGTAG 720 

TTTTTGTTTC GTGGTGGTGG GGGAGGTGAT 780 

TCCTTTCCTT TTGGAGATGA TTGTGCTATT 840 

AGTCTTTTCC CAGCTTCACT ACACGGTACA 900 

GAATATCGCT GAAGATCTGG GTCTGGACAT 960 

GGTGCCCAAC TCAAGGACCC CTTACTTAGA 1020 

GAACGAGAAA ATAGACCGCG AACAAATCTG 1080 

GGAGGTCTTT CTGGAGAACC CCCTGGAGCT 1140 

TAATGACAAC CCCCCCTCTT TCCCGGAGCC 1200 

CACGCCAGGC ACTCGCTTCC CCTTGGAGAG 1260 

CTTGCGCGAC TACGAGATCA CCCCCAACAG 1320 

GGATGGCAAC CGATTCGCTG AGCTGGTGCT 1380 

GGTGCACCGC TACGTGCTGA CCGCGGTGGA 1440 

AGGGGGAGGT GGCGGGGGAG CAGGCCTGCC 1500 

ACTCACCATC CGAGTGCTGG ACTCCAATGA 1560 

CACTGTGTCC CTACCAGAGA ACTCTCCCCC 1620 

CGACCCGGAC GAGGGCCAGA ACGGTGAGGT 1680 

CCGGGCGCGG GAGCTTTTCG GACTCTCGCC 1740 

GTTGGACTAT GAAGAGAGCC CAGTGTACCA 1800 

CAACGCCGTG CCTGCGCACT GCAAGGTGCT 1860 

GCCAGAGATC AGCTTCAGCA CCGTGAAGGA 1920 

TGTGGTGGCC CTTTTCAGCG TGACTGACCG 1980 

CGAGCTACTG GGAGACGTGC CTTTCCGCCT 2040 

CGTTACCGAA GCCCCCCTGG ACCGAGAGGC 2100 

TCGGGACCGG GGCGAGCCTG CGCTCTCCAC 2160 

TGTGAACGAC AACGCGCCGC GTTTCAGCCA 2220 

CAACGTGCCT GGCGCCTACA TCTACGCGGT 2280 

CGCCCAGCTT GCCTACTCTA TCCTCGAGTG 2340 

CGTTTCTATC AACTCTGAGA ACGGCTACTT 2400 

GCTGAAGGAC TTCAGTTTTC AGGTGGAAGC 2460 

TGGTAACGCC ACTGTCAACA TCCTCATAGT 2520 

GGCGCCTCTA CCAGGGCGCA ACGGGACTCC 2580 

GCCGGGTTAC CTGCTCACCC GCGTGGCCGC 2640 

GCTCACTTAC AGCATCGTGC GTGGCAACGA 2700 

CGGGGAGCTG CGCACAGCAC GCCGAGTCCC 2760 

GCTGGTGATC GAGGTGCGCG ACCATGGGCA 2820 

GGTTCAGCTG GTGGATGGCG CCGTGGAGCC 2880 

GTCAGGAGAG CACCAGCGCC CCAGTCGCTC 2940 

CCTCATCCTC ATCATCGCGT TGGGCTCGGT 3000 

GCTGGCCGTG CGTTGCCAAA AAGAGAAGAA 3060 

TTGCTGCCTC TGCTGCTGCT GCTGCGGTGG 3120 

CCGGGCGCGC AAGAAGAAAC TCAGCAAGTC 3180 

ACCCAGTAAC CCGGCCCAGG TGCCGATAGA 3240 

CAACCAGAAT TACTGCTATC AGGTATGCCT 3300 

GTTTCTTAAG CCCTGCAGCC CTTCGCGGAG 3360 

CATCGTCACC GGTTACACCG ACCAGCAGCC 3420 

CAACGAGACT AAACACCAGC GAGCAGAGCT 3480 

TAACAGTTCT GCATTCCAGG AAGCCGACAT 3540 

CAGTGAACAG GGAGATAGTG ATCATGATGC 3600 

TCTCTTCTCC AATTGCACTG AGGAATGTAA 3660 

GCCTTCTTTT GTCCCTTCTG ATGGACGCCA 3720 

TCCTGGCATG GACTCTGTTC CAGACACTGA 3780 

GGCAGAGCGG TCCTTTTCCA CCTTTGGCAA 3840 

GAAGGAGCTG GATGGACTGC TGACTAATAC 3900 

ACGGAAAAGG ATATGCTAGT CAATTCTACA 3960 

AAGTCGACCA ACAAAAGCAT CAACTTTTCA 4020 

TGTGTAACTG AGTATTAGAT TTCGGATGGA 4080 

CTCTCAGCAG GCCTGAGAAA TGAGTTGAAA 4140 

CAGATTTTGC CTCCCCGATC AGTGTGTGCC 4200 

CAAATGTCAC TGAGCCCTTT AGATGTTTAT 4260 

AAGGAAATTT GTGCATTATA AATGCAATAT 4320 

TTTTTGTGTG ATCAAGTGTT CCGCAAGCTA 4380 

GTTCTTTTCA CCTGTGGGTT ATAAAAAATG 4440 

GACATTCTGT AGTTTATACA CCGTGTTGCA 4500 

AAATAAATAT AAAATATATA TATTATATTA 4560 

GTTGGTTTTT AAATGGATGC ATACAGTCCA 4620 

GGTCCCAAAG ACAAACTTAC TAAGAAAAAA 4680 

CTTACTCAAC CGTGTTTTTC CTTGTTTAAA 4740 

GTCAAAAACT CATATTGAAT TTTCAATGCC 4800 

AGCACTGACT ATGTACTATC AAACTATCTA 4860 

TGACTTTGAA TTTAGAATCA CTTAAAGCTT 4920 

TGTTGTTAGA AAAAAACTGG GTGTCTGTAC 4980 



255 



WO 02/079492 



PCT/US02/04915 



ATTTTGTGGT 
ACTCACACAG 
TATAATTCTA 
AAAATATGAA 
TTCAGTATTC 
AATTCTTCAT 
TTAATTATTA 



GTAAAATATG 
AATTTTATTT 
TATGAATATA 
AGCTCTAAAT 
CATGTAAAAA 
GAGTCAGCCT 
AATTTAGTAA 



TAATTGAAGA 
TACATAGTTT 
TAGAGATATA 
TTAAAATAAA 
TTTTATAGCT 
TCAAAAGTTA 
GACGCAAAAA 



TTACTATTTT AAGAAGTCAT CAGTCATATC 5040 

TGTGACTTAA TTACACATGA ATATAAAATC 5100 

GAAACATCTG AACTGGTAAA GAATAACTAT 5160 

TTTAGAGATA GAATCATGGT ACATTATTGT 5220 

TAAATGTAGT CAGTGTTTGA TTAATGAAAA 5280 

AGCTTGCCTT TTACTTTTAT GTCAACAATA 5340 
AAAAAAAAAA AAAA 



10 



Seq ID No: 157 Protein sequence : 
Protein Accession #: NP 116586.1 



15 



20 



25 



30 



MIVLLLFALL 
TPYLDLNLET 
SFPEPDLTVE 
AELVLEKPLD 
LDSNDNVPAF 
FGLSPRTGRL 
STVKEAVSEG 
LDREAGDSYT 
YIYAVSATDR 
FQVEARDAGS 
TRVAAVDADD 
RDHGQPPLSS 
ALGSVSFIFL 
KLSKSDIMLV 
SPSRSTDTEH 
QEADIVSSKD 
SDGRQAADYR 
LLTNTRAPYK 



11 
I 

WMVEGVFSQL 
GVLYVNEKID 
ISESATPGTR 
REQQAVHRYV 
DQPVYTVSLP 
EVSGELDYEE 
AAPGTWALF 
LTWARDRGE 
DEGANAQLAY 
PQALAGNATV 
GENARLTYSI 
TATLWQLVD 
LAMIVLAVRC 
QSSNVPSNPA 
NPCGAIVTGY 
SGHGDSEQGD 
SNLHVPGMDS 
PPYLTRKRIC 



21 
I 

HYTVQEEQEH 
REQICKQSPS 
FPLESAFDPD 
LTAVDGGGGG 
ENSPPGTLVI 
SPVYQVYVQA 
SVTDRDSEEN 
PALSTSKSIQ 
SILECQIQGM 
NILIVDQNDN 
VRGNEMNLFR 
GAVEPQGGGG 
QKEKKLNIYT 
QVPIEESGGF 
TDQQPDIISN 
SDHDATNRAQ 
VPDTEVFETP 



31 
I 

GTFVGNIAED 
CVLHLEVFLE 
VGTNSLRDYE 
GVGEGGGGGG 
QLNATDPDEG 
KDLGPNAVPA 
GQVQCELLGD 
VQVSDVNDNA 
SVFTYVSINS 
APAIVAPLPG 
MDWRTGELRT 
SGGGGSGEHQ 
CLASDCCLCC 
GSHHHNQNYC 
GSILSNETKH 
SAGMDLFSNC 
EAQPGAERSF 



41 

I 

LGLDITKLSA 
NPLELFOVEI 
ITPNSYFSLD 
GAGLPPQQQR 
QNGEWYSFS 
HCKVLVRVLD 
VPFRLKSSFK 
PRFSQPVYDV 
ENGYLYAIiRS 
RNGTPAREVI* 
ARRVPAKRDP 
RPSRSGGGET 
CCCGGGGSTC 
YQVCLTPESA 
QRAELSYLVD 
TEECKALGHS 
STFGKEKALH 



51 

I 

RGFQTVPNSR 
EVLDINDNPP 
VQTQGDGNRF 
TGTALLTIRV 
SHISPRAREL 
ANDNAPEISF 
NYYTIVTEAP 
YVTENNVPGA 
FDYEQLKDFS 
PRSAEPGYLIi 
QRPYELVIEV 
SLDLTLILII 
CGRQARARKK 
KTDLMFIiKPC 
RPRRVNSSAF 
DRCWMPSFVP 
STLERKEIiDG 



60 
120 
180 
240 
300 
360 
420 
460 
540 
600 
660 
720 
780 
840 
900 
960 
1020 



Seq ID NO: 158 DNA sequence 
35 Nucleic Acid Accession #: NM_022159.1 

Coding sequence: 70-1890 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

40 | | | | | | 

GTGAAATTTA AACTCCAGTC CTGTGGCGAA AATGCTAATT GCACTAACAC AGAAGGAAGT 60 

TATTATTGT A TG TGTGTACC TGGCTTCAGA TCCAGCAGTA ACCAAGACAG GTTTATCACT 120 

AATGATGGAA CCGTCTGTAT AGAAAATGTG AATGCAAACT GCCATTTAGA TAATGTCTGT 180 

ATAGCTGCAA ATATTAATAA AACTTTAACA AAAATCAGAT CCATAAAAGA ACCTGTGGCT 240 

45 TTGCTACAAG AAGTCTATAG AAATTCTGTG ACAGATCTTT CACCAACAGA TATAATTACA 300 

TATATAGAAA TATTAGCTGA ATCATCTTCA TTACTAGGTT ACAAGAACAA CACTATCTCA 360 

GCCAAGGACA CCCTTTCTAA CTCAACTCTT ACTGAATTTG TAAAAACCGT GAATAATTTT 420 

• GTTCAAAGGG ATACATTTGT AGTTTGGGAC AAGTTATCTG TGAATCATAG GAGAACACAT 480 

CTTACAAAAC TCATGCACAC TGTTGAACAA GCTACTTTAA GGATATCCCA GAGCTTCCAA 540 

50 AAGACCACAG AGTTTGATAC AAATTCAACG GATATAGCTC TCAAAGTTTT CTTTTTTGAT 600 

TCATATAACA TGAAACATAT TCATCCTCAT ATGAATATGG ATGGAGACTA CATAAATATA 660 

TTTCCAAAGA GAAAAGCTGC ATATGATTCA AATGGCAATG TTGCAGTTGC ATTTTTATAT 720 

TATAAGAGTA TTGGTCCTTT GCTTTCATCA TCTGACAACT TCTTATTGAA ACCTCAAAAT 780 

TATGATAATT CTGAAGAGGA GGAAAGAGTC ATATCTTCAG TAATTTCAGT CTCAATGAGC 840 

55 TCAAACCCAC CCACATTATA TGAACTTGAA AAAATAACAT TTACATTAAG TCATCGAAAG 900 

GTCACAGATA GGTATAGGAG TCTATGTGCA TTTTGGAATT ACTCACCTGA TACCATGAAT 960 

GGCAGCTGGT CTTCAGAGGG CTGTGAGCTG ACATACTCAA ATGAGACCCA CACCTCATGC 1020 

CGCTGTAATC ACCTGACACA TTTTGCAATT TTGATGTCCT CTGGTCCTTC CATTGGTATT 1080 

AAAGATTATA ATATTCTTAC AAGGATCACT CAACTAGGAA TAATTATTTC ACTGATTTGT 1140 

60 CTTGCCATAT GCATTTTTAC CTTCTGGTTC TTCAGTGAAA TTCAAAGCAC CAGGACAACA 1200 

ATTCACAAAA ATCTTTGCTG TAGCCTATTT CTTGCTGAAC TTGTTTTTCT TGTTGGGATC 1260 

AATACAAATA CTAATAAGCT CTTCTGTTCA ATCATTGCCG GACTGCTACA CTACTTCTTT 1320 

TTAGCTGCTT TTGCATGGAT GTGCATTGAA GGCATACATC TCTATCTCAT TGTTGTGGGT 1380 

GTCATCTACA ACAAGGGATT TTTGCACAAG AATTTTTATA TCTTTGGCTA TCTAAGCCCA 1440 

65 GCCGTGGTAG TTGGATTTTC GGCAGCACTA GGATACAGAT ATTATGGCAC AACCAAAGTA 1500 

TGTTGGCTTA GCACCGAAAA CAACTTTATT TGGAGTTTTA TAGGACCAGC ATGCCTAATC 1560 

ATTCTTGTTA ATCTCTTGGC TTTTGGAGTC ATCATATACA AAGTTTTTCG TCACACTGCA 1620 

GGGTTGAAAC CAGAAGTTAG TTGCTTTGAG AACATAAGGT CTTGTGCAAG AGGAGCCCTC 1680 

GCTCTTCTGT TCCTTCTCGG CACCACCTGG ATCTTTGGGG TTCTCCATGT TGTGCACGCA 1740 

70 TCAGTGGTTA CAGCTTACCT CTTCACAGTC AGCAATGCTT TCCAGGGGAT GTTCATTTTT 1800 

TTATTCCTGT GTGTTTTATC TAGAAAGATT CAAGAAGAAT ATTACAGATT GTTCAAAAAT 1860 

GTCCCCTGTT GTTTTGGATG TTTAAGGTAA ACATAGAGAA TGGTGGATAA TTACAACTGC 1920 

ACAAAAATAA AAATTCCAAG CTGTGGATGA CCAATGTATA AAAATGACTC ATCAAATTAT 1980 

CCAATTATTA ACTACTAGAC AAAAAGTATT TTAAATCAGT TTTTCTGTTT ATGCTATAGG 2040 

75 AACTGTAGAT AATAAGGTAA AATTATGTAT CATATAGATA TACTATGTTT TTCTATGTGA 2100 

AATAGTTCTG TCAAAAATAG TATTGCAGAT ATTTGGAAAG TAATTGGTTT CTCAGGAGTG 2160 
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ATATCACTGC ACCCAAGGAA AGATTTTCTT TCTAACACGA GAAGTATATG AATGTCCTGA 2220 

AGGAAACCAC TGGCTTGATA TTTCTGTGAC TCGTGTTGCC TTTGAAACTA GTCCCCTACC 2280 

ACCTCGGTAA TGAGCTCCAT TACAGAAAGT GGAACATAAG AGAATGAAGG GGCAGAATAT 2340 

CAAACAGTGA AAAGGGAATG ATAAGATGTA TTTTGAATGA ACTGTTTTTT CTGTAGACTA 2400 

GCTGAGAAAT TGTTGACATA AAATAAAGAA TTGAAGAAAC ACATTTTACC ATTTTGTGAA 2460 

TTGTTCTGAA CTTAAATGTC CACTAAAACA ACTTAGACTT CTGTTTGCTA AATCTGTTTC 2520 
TTTTTCTAAT ATTCTAAAA 



Seq ID No: 159 Protein sequence; ' 
Protein Accession #: NP_071442.1 



1 11 21 31 41 51 

I I I I I I 

MCVPGFRSSS NQDRFITNDG TVCIENVNAN CHLDNVCIAA NINKTLTKIR SIKEPVALLQ 60 

EVYRNSVTDL SPTDIITYIE ILAESSSLLG YKNNTISAKD TLSNSTLTEF VKTVNNFVQR 120 

DTFWWDKLS VNHRRTHLTK LMHTVEQATL RISQSFQKTT EFDTNSTDIA LKVFFFDSYN 180 

MKHIHPHMNM DGDYINIFPK RKAAYDSNGN VAVAFLYYKS IGPLLSSSDN FLLKPQNYDN 240 

SEEEERVISS VISVSMSSNP PTLYELEKIT FTLSHRKVTD RYRSLCAFWN YSPDTMNGSW 300 

SSEGCELTYS NETHTSCRCN HLTHFAILMS SGPSIGIKDY NILTRITQLG IIISLICLAI 360 

CIFTFWFFSE IQSTRTTIHK NLCCSLFLAE LVFLVGINTN TNKLFCSIIA GLLHYFFLAA 420 

FAWMCIEGIH LYLIWGVIY NKGFLHKNFY IFGYLSPAW VGFSAALGYR YYGTTKVCWL 480 

STENNFIWSF IGPACIillLV NLLAFGVIIY KVFRHTAGLK PEVSCFENIR SCARGALALL 540 

FLLGTTWIFG VLHWHASW TAYLFTVSNA FQGMFIFLFL CVLSRKIQEE YYRLFKNVPC 600 
CFGCLR 



Seq ID NO: 160 DNA sequence 

Nucleic Acid Accession #: none found 

Coding sequence: 1-216 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

1 I I I I I 

TGTCTGCTTA TGCGGTGGCT CGCTGCTCAG AACAGGATGG CAGAGATGAG CACCACCATC 60 

AAAAACTCAA GGACCAGTGC TGTGGGTCCA GTCATCTGTT TCATGGAATT CACCAGTCTG 120 

GTATCTTCAA AATCCAGAAG GATGATGGCA GATGGCAGGA AGGAGGAAGA GGGTAATCTG 180 

GAAGAGTTTC CTGACCTACT CTGCTGCTGT GAT TAAA CAA CCACCAGGAA ATTTTGATGA 240 

CACTGTTCTC CTGAGCTCCT CCCTTTCCTC GGGGAAGAAA AGCATTGAAA CTACAAAAAT 300 

AAAGTGTTAT TTGGCTGGAG TGAGGTCTCA TGTCTGCTTA TGCGGTGGCT CGCTGCTCAG 360 

AACAGGGAAC CATTGGAGAT ACTCATTACT CTTTGAAGGC TTACAGTGGA ATGAATTCAA 420 

ATACGACTTA TTTGAGGAAT TGAAGTTGAC TTTATGGAGC TGATAAGAAT CTTCTTGGAG 480 

AAAAAAAGAC TGGTACTTCT GAATTAACCA AAATCACAGT ATTCTGAAGA TGATTCTACA 540 

AAGCCTGCTG TTTCTACAAA GGCTGCTGAT GATTTCTACA AAGCCTGCTG TAGTGTTGCT. 600 

GTGGCCTCTG CTTAAAAAAG TAGAAAACAC ATTGATGCAG CATGTTCACC CCAACCTCCC 660 

TGCCTAAAGG CTCAGGGACC ATCTTGGAAG AGGAAGGCGC GTGAGATTGT AAGAGCCGAA 720 

TTAGGGGGAT GGAGTGTGGA GAATAAGGAC ACTTCATCTT GGATGCTCAC CTGCCAAATT 780 

GACTTCTGAT GAAAGCCAGC TCCAGAAATG TGCCTACAGT TACTACTTTC ACCTAAACCC 840 

TGCCCTTAGT CAAATCCTTC TCTTCTTCTA AGCAATCAAC TTCAATTCCT TGTATAACCC 900 

ACAGTATAAA AGGGCTTTTA TACCATTCTA TCCTATTGCA TGTAAGCCTT GGGTCTGGGA 960 

GGTAACAGTG TGGGATTCCA CCATCTCATC TCCCTGCCAC CCAAACATGC CTGCTCTTCT 1020 
TTAAGCAATA TTAAATGTTT GTACTTCA 



Seq ID No: 161 Protein sequence; 
Protein Accession #: none found 



1 .11 21 31 41 51 

I I I 1 I I 

CLtLMRWLiAAQ NRMAEMSTTI KNSRTSAVGP VICFMEFTSL VSSKSRRMMA DGRKEEEGNL 60 
EEFPDLLCCC D 



Seq ID NO: 162 DNA sequence 

Nucleic Acid Accession #: none found 

Coding sequence: 1-159 (underlined sequences correspond to start and stop codons) 



1 11 21 31 .41 51 

I I I I I I 

GAGACCCTCC AGAGGCAGGG CCCAGGATTG AAGAGGGAAG CCCTGCTCCA CACGTGTTCA 60 

TCAGGAAGGA CCCACAGACT GCTGCTCCTG GAGGCCTCTC GGTTTATGGA TGTGTGTTTG 120 

TTCCATAAAC CCTCAGAGGG TCACCTGGAG ACCCGC TAAA ATGCAGGTTC TTGGGCCACA 180 

TCCTAGACCT TCTGACCGAC CCAGGGAGTG GGGCCCAGGA AGCTGCATTT GACAGATATC 240 

CCCGTGTGAT CATCATGCAC ACAGGAGTGA GAGAACCAGT GTTCTCCCCG GGCAGAAGGG 300 

AAGCTCGTGT GCAGGACACC TCACACCTCC TTTCCCATTC CCCTGCCAGG CTCTCCCTGC 360 

TGACATTGTT TTTGCGGGAG AGCTGTGAAT TCTGAAGATT AGGTTGCTTC TCACCCCAAG 420 
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CTCCAGAAGT CCAGGCTGAG CCAAACCAAG CTTCAAGTTG TGCCTGGACT TGGAGAACCA 4 BO 

GGAGGTGAGG GGACTGACTA CTTGAAGATC ACATGGAGGA GGAGTCTGAT CCAGGCCCAG 540 

GCACCAAGGA AAGGCCATGC AAGGACACAG GGAGAAGGGC AGCTGTCTGT AAGCCAGAAA 600 

GAGCCTTCAC TAGAAACCAA ATCAGCCAGA ACCTTCATCT TGGACTTTCC AGCCTTCAGA 660 
GATGTGAAAA AATAAATTTC TGTTGATTAA CCTAAAAAA 

Seq ID No: 163 Protein sequence:, 
Protein Accession #: none found 



1 11 21 31 41 51 

I I I I I I 

ETLQRQGPGL KREALLHTCS SGRTHRLLLL EASRFMDVCL FHKPSEGHLE TR 

Seq ID NO: 164 DNA sequence 

Nucleic Acid Accession #: NM_020241.1 

Coding sequence: 4-1557 (underlined sequences correspond to atart and stop codons) 

1 11 21 31 41 51 

I I I I I I 

GCCATGCAGA CCCCGCGAGC GTCCCCTCCC CGCCCGGCCC TCCTGCTTCT GCTGCTGCTA 60 

CTGGGGGG.CG CCCACGGCCT CTTTCCTGAG GAGCCGCCGC CGCTTAGCGT GGCCCCCAGG 120 

GACTACCTGA ACCACTATCC CGTGTTTGTG GGCAGCGGGC CCGGACGCCT GACCCCCGCA 180 

GAAGGTGCTG ACGACCTCAA CATCCAGCGA GTCCTGCGGG TCAACAGGAC GCTGTTCATT 240 

GGGGACAGGG ACAACCTCTA CCGCGTAGAG TTGGAGCCCC CCACGTCCAC GGAGCTGCGG 300 

TACCAGAGGA AGCTGACCTG GAGATCTAAC CCCAGCGACA TAAACGTGTG TCGGATGAAG 360 

GGCAAACAGG AGGGCGAGTG TCGAAACTTC GTAAAGGTGC TGCTCCTTCG GGACGAGTCC 420 

ACGCTCTTTG TGTGCGGTTC CAACGCCTTC AACCCGGTGT GCGCCAACTA CAGCATAGAC 480 

ACCCTGCAGC CCGTCGGAGA CAACATCAGC GGTATGGCCC GCTGCCCGTA CGACCCCAAG 540 

CACGCCAATG TTGCCCTCTT CTCTGACGGG ATGCTCTTCA CAGCTACTGT TACCGACTTC 600 

CTAGCCATTG ATGCTGTCAT CTACCGCAGC CTCGGGGACA GGCCCACCCT GCGCACCGTG 660 

AAACATGACT CCAAGTGGTT CAAAGAGCCT TACTTTGTCC ATGCGGTGGA GTGGGGCAGC 720 

CATGTCTACT TCTTCTTCCG GGAGATTGCG ATGGAGTTTA ACTACCTGGA GAAGGTGGTG 780 

GTGTCCCGCG TGGCCCGAGT GTGCAAGAAC GACGTGGGAG GCTCCCCCCG CGTGCTGGAG 840 

AAGCAGTGGA CGTCCTTCCT GAAGGCGCGG CTCAACTGCT CTGTACCCGG AGACTCCCAT 900 

TTCTACTTCA ACGTGCTGCA GGCTGTCACG GGCGTGGTCA GCCTCGGGGG CCGGCCCGTG 960 

GTCCTGGCCG TTTTTTCCAC GCCCAGCAAC AGCATCCCTG GCTCGGCTGT CTGCGCCTTT 1020 

GACCTGACAC AGGTGGCAGC . TGTGTTTGAA GGCCGCTTCC GAGAGCAGAA GTCCCCCGAG 1080 

TCCATCTGGA CGCCGGTGCC GGAGGATCAG GTGCCTCGAC CCCGGCCCGG GTGCTGCGCA 114 0 

GCCCCCGGGA TGCAGTACAA TGCCTCCAGC GCCTTGCCGG ATGACATCCT CAACTTTGTC 1200 

AAGACCCACC CTCTGATGGA CGAAGCGGTG CCCTCGCTGG GCCATGCGCC CTGGATCCTG 1260 

CGGACCCTGA TGAGGCACCA GCTGACTCGA GTGGCTGTGG ACGTGGGAGC CGGCCCCTGG 1320 

GGCAACCAGA CCGTTGTCTT CCTGGGTTCT GAGGCGGGGA CGGTCCTCAA GTTCCTCGTC 1380 

CGGCCCAATG CCAGCACCTC AGGGACGTCT GGGCGTGTGT GTCAAGTGGG CCACGCGTGC 1440 

AGGGTGTGTG TCCACGAGCG ACGATCGTGG TGGCCCCAGC GGCCTGGGCG TTGGCTGAGC 1500 

CGACGCTGGG GCTTCCAGAA GGCCCGGGGG CCTCCGAGGT GCCGGTTAGG AGTTTGAACC 1560 

CCCCCCACTC TGCAGAGGGA AGCGGGGACA ATGCCGGGGT TTCAGGCAGG AGACACGAGG 1620 

AGGGCCTGCC CGGAAGTCAC ATCGGCAGCA GCTGTCTAAA GGGCTTGGGG GCCTGGGGGG 1680 

CGGCGAAGGT GGGTGGGGCC CCTCTGTAAA TACGGCCCCA GGGTGGTGAG AGAGTCCCAT 1740 

GCCACCCGTC CCCTTGTGAC CTCCCCCCTC TGACCTCCAG CTGACCATGC ATGCCACGTG 1800 
G 

Seq ID No: 165 Protein sequence: 
Protein Accession #: NP_064626.1 

1 11 21 31 41 51 

I I I I I I 

MQTPRASPPR PALLLLLIaLL GGAHGLFPEE PPPLSVAPRD YLNHYPVFVG SGPGRLTPAE 60 

GADDLNTQRV LRVNRTLFIG DRDNLYRVEL EPPTSTBLRY QRKLTWRSNP SDINVCRMKG 120 

KQEGECRNFV KVIiLLRDEST LFVCGSNAFN PVCANYSIDT LQPVGDNISG MARCPYDPKH 180 

ANVALFSDGM LFTATVTDFL AIDAVIYRSL GDRPTLRTVK HDSKWFKEPY FVHAVEWGSH 240 

VYFFFREIAM EFNYLEKWV SRVARVCKND VGGSPRVLEK QWTSFLKARL NCSVPGDSHF 300 

YFNVLQAVTG WSLGGRPW LAVFSTPSNS IPGSAVCAFD LTOVAAVFEG RFREQKSPES 360 

IWTPVPEDQV PRPRPGCCAA PGMQYNASSA LPDDILNFVK THPLMDEAVP SIjGHAPWILR 420 

TLMRHQLTRV AVDVGAGPWG NQTWFLGSE AGTVLKFLVR PNASTSGTSG RVCQVGHACR 480 
VCVHERRSWW PQRPGRWLSR RWGFQKARGP PRCRLGV 

Seq ID NO: 166 DNA sequence 

Nucleic Acid Accession #: NM_032108.1 

Coding sequence: 39-2705 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

TCCGAGGCGT CACCTCCTCC TGTCGCCTGG CCCTCGC CAT G CAGACCCCG CGAGCGTCCC 60 

CTCCCCGCCC GGCCCTGCTG CTTCTGCTGC TGCTACTGGG GGGCGCCCAC GGCCTCTTTC 120 
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CTGAGGACCC GCCGCCGCTT AGCGTGGCCC CCAGGGACTA CCTGAACCAC TATCCCGTGT 180 

TTGTGGGCAG CGGGCCCGGA CGCCTGACCC CCGCAGAAGG TGCTGACGAC CTCAACATCC 240 

AGCGAGTCCT GCGGGTCAAC AGGACGCTGT TCATTGGGGA CAGGGACAAC CTCTACCGCG 300 

TAGAGCTGGA GCCCCCCACG TCCACGGAGC TGCGGTACCA GAGGAAGCTG ACCTGGAGAT 360 

CTAACCCCAG CGACATAAAC GTGTGTCGGA TGAAGGGCAA ACAGGAGGGC GAGTGTCGAA 420 

ACTTCGTAAA GGTGCTGCTC CTTCGGGACG AGTCCACGCT CTTTGTGTGC GGTTCCAACG 480 

CCTTCAACCC GGTGTGCGCC AACTACAGCA TAGACACCCT GCAGCCCGTC GGAGACAACA 540 

TCAGCGGTAT GGCCCGCTGC CCGTACGACC CCAAGCACGC CAATGTTGCC CTCTTCTCTG 600 

ACGGGATGCT CTTCACAGCT ACTGTTACCG ACTTCCTAGC CATTGATGCT GTCATCTACC 660 

GCAGCCTCGG GGACAGGCCC ACCCTGCGCA CCGTGAAACA TGACTCCAAG TGGTTCAAAG 720 

AGCCTTACTT TGTCCATGCG GTGGAGTGGG GCAGCCATGT CTACTTCTTC TTCCGGGAGA 780 

TTGCGATGGA GTTTAACTAC CTGGAGAAGG TGGTGGTGTC CCGCGTGGCC CGAGTGTGCA 840 

AGAACGACGT GGGAGGCTCC CCCCGCGTGC TGGAGAAGCA GTGGACGTCC TTCCTGAAGG 900 

CGCGGCTCAA CTGCTCTGTA CCCGGAGACT CCCATTTCTA CTTCAACGTG CTGCAGGCTG 960 

TCACGGGCGT GGTCAGCCTC GGGGGCCGGC CCGTGGTCCT GGCCGTTTTT TCCACGCCCA 1020 

GCAACAGCAT CCCTGGCTCG GCTGTCTGCG CCTTTGACCT GACACAGGTG GCAGCTGTGT 1080 

TTGAAGGCCG CTTCCGAGAG CAGAAGTCCC CCGAGTCCAT CTGGACGCCG GTGCCGGAGG 1140 

ATCAGGTGCC TCGACCCCGG CCCGGGTGCT GCGCAGCCCC CGGGATGCAG TACAATGCCT 1200 

CCAGCX3CCTT GCCGGATGAC ATCCTCAACT TTGTCAAGAC CCACCCTCTG ATGGACGAGG 1260 

CGGTGCCCTC GCTGGGCCAT GCGCCCTGGA TCCTGCGGAC CCTGATGAGG CACCAGCTGA 1320 

CTCGAGTGGC TGTGGACGTG GGAGCCGGCC CCTGGGGCAA CCAGACCGTT GTCTTCCTGG 1380 

GTTCTGAGGC GGGGACGGTC CTCAAGTTCC TCGTCCGGCC CAATGCCAGC ACCTCAGGGA 1440 

CGTCTGGGCT CAGTGTCTTC CTGGAGGAGT TTGAGACCTA CCGGCCGGAC AGGTGTGGAC 1500 

GGCCCGGCGG TGGCGAGACA GGGCAGCGGC TGCTGAGCTT GGAGCTGGAC GCAGCTTCGG 1560 

GGGGCCTGCT GGCTGCCTTC CCCCGCTGCG TGGTCCGAGT GCCTGTGGCT CGCTGCCAGC 1620 

AGTACTCGGG GTGTATGAAG AACTGTATCG GCAGTCAGGA CCCCTACTGC GGGTGGGCCC 1680 

CCGACGGCTC CTGCATCTTC CTCAGCCCGG GCACCAGAGC CGCCTTTGAG CAGGACGTGT 1740 

CCGGGGCCAG CACCTCAGGC TTAGGGGACT GCACAGGACT CCTGCGGGCC AGCCTCTCCG 1800 

AGGACCGCGC GGGGCTGGTG TCGGTGAACC TGCTGGTAAC GTCGTCGGTG GCGGCCTTCG 1860 

TGGTGGGAGC CGTGGTGTCC GGCTTCAGCG TGGGCTGGTT CGTGGGCCTC CGTGAGCGGC 1920 

GGGAGCTGGC CCGGCGCAAG GACAAGGAGG CCATCCTGGC GCACGGGGCG GGCGAGGCGG 1980 

TGCTGAGCGT CAGCCGCCTG GGCGAGCGCA GGGCGCAGGG TCCCGGGGGC CGGGGCGGAG 2040 

GCGGTGGCGG TGGCGCCGGG GTTCCCCCGG AGGCCCTGCT GGCGCCCCTG ATGCAGAACG 2100 

GCTGGGCCAA GGCCACGCTG CTGCAGGGCG GGCCCCACGA CCTGGACTCG GGGCTGCTGC 2160 

CCACGCCCGA GCAGACGCCG CTGCCGCAGA AGCGCCTGCC CACTCCGCAC CCGCACCCCC 2220 

ACGCCCTGGG CCCCCGCGCC TGGGACCACG GCCACCCCCT GCTCCCGGCC TCCGCTTCAT 2280 

CCTCCCTCCT GCTGCTGGCG CCCGCCCGGG CCCCCGAGCA GCCCCCCGCG CCTGGGGAGC 2340 

CGACCCCCGA CGGCCGCCTC TATGCTGCCC GGCCCGGCCG CGCCTCCCAC GGCGACTTCC 2400 

CGCTCACCCC CCACGCCAGC CCGGACCGCC GGCGGGTGGT GTCCGCGCCC ACGGGCCCCT 2460 

TGGACCCAGC CTCAGCCGCC GATGGCCTCC CGCGGCCCTG GAGCCCGCCC CCGACGGGCA 2520 

GCCTGAGGAG GCCACTGGGC CCCCACGCCC CTCCGGCCGC CACCCTGCGC CGCACCCACA 2580 

CGTTCAACAG CGGCGAGGCC CGGCCTGGGG ACCGCCACCG CGGCTGCCAC GCCCGGCCGG 2640 

GCACAGACTT GGCCCACCTC CTCCCCTATG GGGGGGCGGA CAGGACTGCG CCCCCCGTGC 2700 

CCTAGGCCGG GGGCCCCCCG ATGCCTTGGC AGTGCCAGCC ACGGGAACCA GGAGCGAGAG 2760 

ACGGTGCCAG AACGCCGGGG CCCGGGGCAA CTCCGAGTGG GTGCTCAAGT GCCCCCCGCG 2820 

ACCCACCCGC GGAGTGGGGG GCCCCCTCCG CCACAAGGAA GCACAACCAG CTCGCCCTCC 2880 

CCCTACCCGG GGCCGCAGGA CGCTGAGACG GTTTGGGGGT GGGTGGGCGG GAGGACTTTG 2940 

CTATGGATTT GAGGTTGACC TTATGCGCGT AGGTTTTGGT TTTTTTTGCA GTTTTGGTTT 3000 

CTTTTGCGGT TTTCTAACCA ATTGCACAAC TCCGTTCTCG GGGTGGCGGC AGGCAGGGGA 3060 

GGCTTGGACG CCGGTGGGGA ATGGGGGGCC ACAGCTGCAG ACCTAAGCCC TCCCCCACCC 3120 

CTGGAAAGGT CCCTCCCCAA CCCAGGCCCC TGGCGTGTGT GGGTGTGCGT GCGTGTGCGT 3180 

GCCGTGTTCG TGTGCAAGGG GCCGGGGAGG TGGGCGTGTG TGTGCGTGCC AGCGAAGGCT " 3240 

GCTGTGGGCG TGTGTGTCAA GTGGGCCACG CGTGCAGGGT GTGTGTCCAC GAGCGACGAT 3300 

CGTGGTGGCC CCAGCGGCCT GGGCGTTGGC TGAGCCGACG CTGGGGCTTC CAGAAGGCCC 3360 

GGGGGTCTCC GAGGTGCCGG TTAGGAGTTT GAACCCCCCC CACTCTGCAG AGGGAAGCGG 3420 

GGACAATGCC GGGGTTTCAG GCAGGAGACA CGAGGAGGGC CTGCCCGGAA GTCACATCGG 3480 
CAGCAGCTGT CTAAAGGGCT TGGGGGCCTG GGGGGCGGCG AAAG 



Seg ID No: 167 Protein sequence t 
Protein Accession #: NP_115484.1 

1 11 21 31 41 51 

I I I I I I 

MQTPRASPPR PALLLLLLLL GGAHGLFPED PPPLSVAPRD YLNHYPVFVG SGPGRLTPAE 60 

GADDLKIQRV LRVNRTLFIG DRDNLYRVEL EPPTSTELRY QRKLTWRSNP SDINVCRMKG 120 

KQEGECRNFV KVLLLRDEST LFVCGSNAFN PVCANYSIDT LQPVGDNISG MARCPYDPKH 180 

ANVALFSDGM LFTATVTDFL AIDAVIYRSL GDRPTLRTVK HDSKWFKEPY FVHAVEWGSH 240 

VYFFFRE1AM EFNYLEKVW SRVARVCKND VGGSPRVLEK QWTSFLKARL NCSVPGDSHF 300 

YFNVLQAVTG WSLGGRPW LAVFSTPSNS IPGSAVCAFD LTQVAAVFEG RFREQKSPES 360 

IWTPVPEDQV PRPRPGCCAA PGMQYNASSA LPDDILNFVK THPLMDEAVP SLGHAPWILR 420 

TLMRHQLTRV AVDVGAGPWG NQTWFLGSE AGTVLKFLVR PNASTSGTSG LSVFLEEFET 480 

YRPDRCGRPG GGETGQRLLS LELDAASGGL LAAFPRCWR VPVARCQQYS GCMKNCIGSQ 540 

DPYCGWAPDG SCIFLSPGTR AAFEQDVSGA STSGLGDCTG LLRASLSEDR AGLVSVNLLV 600 

TSSVAAFWG AWSGFSVGW FVGLRERREL ARRKDKEAIL AHGAGEAVLS VSRLGERRAQ 660 

GPGGRGGGGG GGAGVPPEAL LAPLMQNGWA KATLLCjGGPH DLDSGLLPTP EQTPLPQKRL 720 
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PTPHPHPHAL GPRAWDHGHP LLPASASSSL LLLAPARAPE QPPAPGEPTP DGRLYAARPG 780 
RASHGDFPLT PHASPDRRRV VSAPTGPLDP ASAADGLPRP WSPPPTGSLR RPLGPHAPPA 840 
ATIiRRTHTPN SGEARPGDRH RGCHARPGTD LAHLLPYGGA DRTAPPVP 



10 



Seq ID NO: 168 DNA sequence 
Nucleic Acid Accession #: AW205664 

Coding sequence: 1-135 (underlined sequences correspond to start and stop codons) 



11 



21 



31 



41 



51 



15 



20 



CGGCACGAGG 
CTAATTTGGA 
ACCCTCTCCG 
CTTGGAAGGC 
GTAAGCCTGA 
TGCATGGCGT 
TAAAACAGCT 



AGAACAGGGG 
GCACAGTCTT 
TT TAG TACCT 
GGAATGTGTT 
GTGGATCCTG 
TATGTAGATC 
GCCCTGGATG 



CCTCTGCCTC 
CCCGGTGCCT 
GACCACCTGT 
TTCGTGTCTT 
ACTCAGCTGC 
ACGTGCGGCA 
AAACGGAATA 



AGTTTGCCCG 
AGACATGCCA 
TTCAAAACGC 
CTAGGAAGGG 
AGCCCTTACC 
GAGACAGCCA 
AACCAGTGAT 



GGAGCCAGCC 
AGGCCCCTCC 
AGGTGTTTCT 
TCTGCTGAGG 
TGCCTCGTGC 
CTGTCCTGTG 
GCTAAAAAAA 



AGGGCCCATC 
CACGTGGTAC 
GGTTTAGAAA 
ACCAGACCAC 
TGATGATCTA 
TGCGGGTTTT 
AAAAAAAAAA 



Seq ID No: 169 Protein sequence: 
Protein Accession #: AW205664 



60 
120 
180 
240 
300 
360 



25 1 11 • 21 31 41 

I I I I I 

RKEENRGLCL SLPGSQPGPI LIWSTVFPVP RHAKAPPTWY TLSV 
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30 Seq ID NO: 170 DNA sequence 

Nucleic Acid Accession #: AB033100 

Coding sequence: 32-2623 (underlined sequences correspond to start and stop codons) 



35 



40 



45 



50 



55 



60 



65 



70 



75 



AGGTCTGGGG 
GACGGTCTCG 
GCACTCCGTC 
CATCATCCCC 
GATCCATGAT 
TGAGCACTAC 
GGATGTCACT 
CCGGCAGGTG 
CAGGCGGGTC 
GCGGGAGGAA 
AGACAAGCAG 
CCTGGAGCTG 
CCATGTGTAC 
TGAGGACGAC 
CTACAGGTAC 
CGCCTTTGTC 
GCCTCCCCCA 
GGTCCTGGGC 
CCCCACGCAG 
CATGGTGCCC 
CGAGTTGCAT 
ACCGGAGAGC 
GAGCCTGGAG 
GCTGGCCTTT 
GCCCGTGACG 
CCTACGGGAG 
GGCCAACTTC 
GGCCCTGGGG 
CTGGGTGAGC 
GTGGCCTGGG 
CCATCTAAGC 
CCTTACCATG 
CCGCATCCCC 
GGCCCTGCGG 
CGGCCAGGGC 
AGGCTTCCCC 
GGGTGAATTT 
GAAGGAGGTG 
CCTGCGGGAG 
AATGCGGAGG 



11 
I 

TCCTGAGGCT 
GCAGGCACCC 
AGCATCCACT 
AACAAGGTGG 
GAGCTGCTCA 
CTGGTGCAAG 
GAGAAGATGG 
CAGGGTGGGC 
CTCCAGAAAC 
MCTGTGCTTT 
AACCTTCATG 
GCCATCCGGA 
CATAACACCG 
TTGCATGTGA 
CACCGCCTGC 
AGTGTTCTCC 
GCCCTCGTCT 
ACCCTCATCC 
GCCAAGCCCC 
CAGGGAAGGA 
GACCTGAAAG 
CCAGCCCAGG 
CGATACTTCT 
GCCCTCAGTT 
CTGAGCTCAG 
GACGATCTGG 
CGGCGGGTGC 
AGCATCCTGG 
CTTCGGGAGG 
CCCCCTGTGG 
GAGCCTCCCC 
CAGGAGGTCT 
ATGCCGGACT 
GCCGCCCTCT 
CGTACCACAA 
GAGGTGGGTG 
CAGGTAGTAA 
GACGCAGCGC 
ATCATCATCT 
CTGCAGCTGC 



21 
I 

GCTGGCAGAC 
CATTTGAGGG 
CCTTCCAGAG 
CCCCTGTTGT 
AGGCTCATTA 
GAGCTCAGGC 
ATGTGCTGGG 
TCACTGTGTT 
TCCAGAAGGA 
TCCTGCGTGC 
AGAACCTCCA 
AAGAGATCCA 
AGGACCTGTG 
CGGAGGAGGT 
CCCTGCCCGA 
GGGAGACCCC 
TCAGCTGCCA 
TGCTTCACCG 
TGCCTATGGA 
GGATGGTGGA 
AAGTGGTCTT 
GAAGCGGCAG 
ACCTGATCCT 
TCAGCCGCTG 
CAGGCCCTGT 
TCTCCCCGGA 
CCCGCATGCC 
CCTACCTGAC 
AGGCCGTGTT 
CTCCTGACCA 
CAGGCAAGGA 
TCAGCCAGCA 
TCTGTGCCCC 
CCAAGGACCC 
CTGCGATGGT 
AGGAGGAGCT 
TGAAGGTGGT 
TGGACACTGT 
GCACCTACCG 
GGAGCCTGCA 



31 
I 

TATGG GTACA 
CCTACAGGGC 
CACTAGCTTG 
GATCACGTAC 
CACGTTGGGC 
CTTACCCCAG 
CACCGTGGGA 
CGGCATGGGA 
CGGACATAGG 
AGATGAGGAC 
GGGCCTTGGA 
CGACTTTGCC 
GGGGGAGCCC 
GTACAAGCGG 
GCAAGGGAGT 
CAGCCTGCTG 
GATGGGCGTG 
CAGTGGGACC 
GCAGTTCCAG 
AGAGGTGGAC 
GGAAAACCAG 
CCGACACAGC 
GTTTAACTAC 
GCTGTGTGCC 
GGCTCCGAGG 
CGCGCTCAGC 
CATCTACGGC 
GGACGCCAAG 
GGAGTGTGAC 
GCTGGAGACC 
GGGCCCCCTG 
CCGCAGGGCC 
CCGAGAGGAG 
AGGCACTGGC 
GGTGGCTGTC 
CGTGAGTGTG 
GCAGCTGCTA 
CAGCGAGACC 
CCAGGCGAAG 
GTACTTGGAG 



41 
I 

ACGGCCAGCA 
AGTGGCACGA 
CATAACAGCA 
AACTGCAAGG 
CGGCTCTCGG 
GGCCGCTACT 
AGCTGTGGGG 
CAGCCCAGCC 
GAGTGTGTCA 
TTTGTGTCCT 
CCCGGGGTCC 
CAGCTGAGCG 
CATGCTGTGG 
CCCCTCTTCC 
CCCCTGGAGG 
CAGCTCCGTG 
GGCAGGACCA 
ACCTCCCAGC 
GTGATCCAGA 
AGAGCCATCA 
AAGAAGTTAG 
GTCTGGCAGA 
TACCTTCATG 
CACCCTGAGC 
GACCTCATCG 
ACTGTCAGAG 
ACGGCCCAGC 
AGGAGGCTGC 
GGGCACACCT 
CTGGAGGCCC 
ACCTACAGGT 
TGTCCTGGCC 
GACTTTGACC 
TTCGTGTTCA 
CTGGCCTTCT 
CCTGATGCCA 
CCCGATGGGC 
ATGACGCCCA 
GCAGCGAAAG 
CGCTATGTCT 



51 
I 

CAGCCCAGCA 
TGGACAGTCG 
AGGCCAAGTC 
AGGAGTTCCA 
ACAACACCCC 
TCCTGGTGCG 
CCCCCAACTT 
TCTTAGGGTT 
TCTTCTGTGT 
ACACACCTCG 
GGGTGGAGAG 
AGAACACATA 
CCATCCATGG 
TGCAGCCCAC 
CCCAGTTGGA 
ATGCCCACGG 
ACCTGGGCAT 
CAGAGGCTGC 
GCTTTCTCCG 
CTGCCTGTGC 
AAGGTATCCG 
GGGCGCTGTG 
AGCAGTACCC 
TGTACCGCCT 
CCAGGGGCTC 
AGATGGATGT 
CCAGCGCCAA 
GGAAGGTTGT 
ACAGCCTGCG 
AGCTGAAGGC 
TCCAGACCTG 
TCACCTACCA 
AGCTGCTGGA 
GCTGCCTCAG 
GGCACATCCA 
AGTTCACTAA 
ACCGTGTGAA 
TGCACTACCA 
AGGCGCAGGA 
GCCTGATTCT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
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1020 
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1140 
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1260 
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1800 
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2280 
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2400 
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CTTCAACGCG 
GATGCAGGAG 
CGAGCTGGAG 
GAGCTGCAGC 
CCCCCCACCC 
GGCCCTGAGG 
TTGGGAGCCT 
CAAACCACCA 
GTGCACACTG 
CCCCCAGTTG 
TGGCAGCCCC 
CTGCTCTGCC 
GCCTGAGGTG 
ACCCAGTTTT 
CCTACCCAGC 
AGGGAGCCCC 
AGGCCTGCTC 
GCTGCAGGTC 
CCTGCTGCAG 
TGGACAGTGA 
GGCCATCCTT 
CTGCTGAGTT 
CAGGTGGCAG 
GTGCTTGCTG 
GGGACCAGSA 
GATGTGTCAT 
CAGTATTGGA 
GTTCACTACC 
CTTTGCAGAC 
GTGTTCTCAT 
CTCTCTGCCT 
CRGGGGTGTT 
GCATTAAAAA 



TACCTCCACC 
GTGGCATCGA 
AGCGGGGAGG 
CTCGAGCCCT 
ACAGGGCCCC 
GGTGCTGGCC 
TTTTAGAAAG 
AGGTGTGTGG 
CTGTGTGTAC 
CCAAACACTG 
TGGCACAGAG 
ATTGCCGCTC 
GGTGGAGGGG 
CTGGACTCTC 
CTGGTGGGGC 
TCTCATGGGG 
CACTTGTCTG 
CCCCGGCATC 
CAGGAGCCCC 
GGTGTGCAAG 
GCTGAGCATC 
AGAGGCTGCT 
AGAAGTGCCA 
AAACCCAGGA 
AAGCCTGTCT 
TGGTCATGAT 
AAATATTTGA 
TTTTCAGGTT 
AAGGTCTAGA 
GGTTGGCTCT 
CAGTTTCCCC 
GTGAGGATTC 
CAGCTAAATG 



TGGAGAAGGC 
AGGCTGGCAT 
ACCAGCCCTT 
CTGCCCCCGA 
ACGCAGGCCT 
TTGAAATGAT 
AACTTTTTAT 
CTGACCTCCA 
CTTGCAGACA 
TGGATCTCTC 
CAGACCCGGC 
CCCTTCTTGC 
ACAGTGTTCT 
ATGCCCCCAT 
TGGCAGGATG 
AGGAAAGAGC 
GGAACCTGGG 
TCTCTCTGTC 
AAGGAGTGCT 
GGTGCACTGA 
TTTGAGCCTG 
GGGATCCACT 
TGTTTGCGTT 
GCTGAACAGT 
TTGGTTAGGC 
ATTTGAAAAG 
CCCCCTTGGC 
TATTGTTTTT 
TGCGGAGTCA 
GACTTTCAGC 
ATCTGTAAAA 
ATTTGTGATT 
TG 



CGACTCCTGG 
CTACGAGATC 
CTCCAGGCTG 
GGACTTGCTG 
GGGGTGTCTG 
TCCCCCACTT 
AGGACAGGGA 
GGGAGGAGCA 
GGCCGGCGTT 
TGTCCTCTTC 
CACTGGTAGC 
TGCCCAAGCA 
GGATAGATCT 
CTCCGACCTG 
GTGGAGGTTT 
TTCCAGGGGG 
CAGGAGGCAC 
CCGGCAGCCC 
AGCTGAGGGT 
GGGTGGTGGG 
CCTTCCGGTG 
GTTTCCACAC 
GAGCCTTGCA 
GAGGAGGCTG 
TCGTGTACTT 
GGGAGGAGGC 
TGAATTCTTT 
ATTTTTGCAT 
GAGATGGGAC 
TGTGTTGGGA 
TGGGAGAATA 
TTTTTTTTTT 



CAGAGGCCCT 
CTTAACGAGC 
CGCTACCGGT 
TAGGGGGCCT 
AGGTGCTCTT 
CCTGGAGAGA 
GACAGCACAG 
CTCACTGGAG 
CAGCCTCCAA 
TCCCCTCTCT 
TCCCCACTTC 
CTGCCCTCGG 
ATTATGTGAA 
GGAGACTTCA 
CTCAAGGAGC 
CGAACGCAGC 
AGAGGAAGCC 
AGGATGGCCT 
GGTTGCTGGG 
AGGGGATCAC 
GGAGCAGAAA 
AGCGGGAAGG 
GCTCTTCCAG 
TCCACCTTGC 
CTGCAGGAAA 
CGAAGTTGTT 
TGCAGAACTA 
GAATTAAGAC 
TGAATGGGGA 
CCACTGGCTG 
ATACTTGCCT 
TTTTTGTACA 



TCAGCACCTG 
TGGGCTTCCC 
GGCAGGAGCA 
TACTCCCTGT 
GGCTGGGAGC 
CTGAGCGGAG 
CCATCCCTTG 
TGCTCACAAG 
GGGGCTCACT 
CAGATTGGCC 
CTTACTCCTG 
GCGTCTGGCA 
AGGCAGCTTC 
GGAATGACAA 
TGGAGACTTC 
ACAGAGGAAG 
AAGGCCTGGA 
GGTGCCCCCA 
GTGGTCCTCA 
CTGGGTTCCA 
AGGCCAGACC 
CTGCTGGGAA 
CTGGGGACTG 
TTGGCTCACT 
AAAAAAAAAG 
CCCATTTATC 
CTGTGTGTCT 
GTTTTAATTT 
GGGATCCTTT 
ATCACATCAC 
ACCTACCTCA 
GAGCTTTTAA 



2460 
2520 
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2820 
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Seq ID No: 171 Protein sequence ; 
Protein Accession #: BAA86588.1 



MGTTASTAQQ 
ITYNCKEEFQ 
TVGSCGAPNF 
EDFVSYTPRD 
EPHAVAIHGE 
LLQLRDAHGP 
FQVIQSFLRM 
HSVWQRALWS 
PRDLIARGSL 
AKRRLRKWW 
PLTYRFQTCL 
TGFVFSCLSG 
LLPDGHRVKK 
LERYVCLILF 
RLRYRWQEQS 



11 
I 

TVSAGTPFEG 
IHDELLKAHY 
RQVQGGLTVF 
KQNIiHENLQG 
DDLHVTEEVY 
PPALVFSCQM 
VPQGRRMVEE 
LERYFYLILF 
REDDLVSPDA 
VSLREEAVLE 
TMQEVFSQHR 
QGRTTTAMW 
EVDAALDTVS 
NAYLHIiEKAD 
CSLEPSAPED 



21 

I 

LQGSGTMDSR 
TLGRLSDNTP 
GMGQPSLLGF 
LGPGVRVESL 
KRPLFLQPTY 
GVGRTNLGMV 
VDRAITACAE 
NYYLHEQYPL 
LSTVREMDVA 
CDGHTYSLRW 
RACPGLTYHR 
AVLAFWHIQG 
ETMTPMHYHL 
SWQRPFSTWM 
LL 



31 
I 

HSVSIHSFQS 
EHYLVQGAQA 
RRVLQKLQKD 
ELAIRKEIHD 
RYHRLPLPEQ 
LGTLILLHRS 
LHDLKEWLE 
AFALSFSRWL 
NFRRVPRMPI 
PGPPVAPDQIi 
IPMPDFCAPR 
FPEVGEEELV 
REIIICTYRQ 
QEVASKAGIY 



41 
I 

TSLHNSKAKS 
LPQGRYFLVR 
GHRECVIFCV 
FAQLSENTYH 
GSPLEAQLDA 
GTTSQPEAAP 
NQKKLEGIRP 
CAHPELYRLP 
YGTAQPSAKA 
ETLEAQLKAH 
EEDFDQLLEA 
SVPDAKFTKG 
AKAAKEAQEM 
EILNELGFPE 



51 
I 

IIPNKVAPW 
DVTEKMDVLG 
REEVLFLRAD 
VYHNTEDLWG 
FVSVLRETPS 
TQAKPLPMEQ 
ESPAQGSGSR 
VTLSSAGPVA 
LGSILAYLTD 
LSEPPPGKEG 
LRAALSKDPG 
EFQWMKWQ 
RRLQLRSLQY 
LESGEDQPFS 



60 
120 
180 
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300 
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Seq ID NO: 172 DNA sequence 

Nucleic Acid Accession #: AK021806.1 

Coding sequence: 1-645 (underlined sequences correspond to start and stop codons) 



ACTG TGCTTT 
AACCTTCATG 
GCCATCCGGA 
CATAACACCG 
TTGCATGTGA 
CACCGCCTGC 
AGTGTTCTCC 
GCCCTCGTCT 
ACCCTCATCC 
GCCAAGCCCC 
CAGGGAAGGA 
AGTTTTCTGG 
CCCAGCCTGG 
AGCCCCTCTC 



11 
I 

TCCTGCGTGC 
AGAACCTCCA 
AAGAGATCCA 
AGGACCTGTG 
CGGAGGAGGT 
CCCTGCCCGA 
GGGAGACCCC 
TCAGCTGCCA 
TGCTTCACCG 
TGCCTATGGA 
GGATGGTGGA 
ACTCTCATGC 
TGGGGCTGGC 
ATGGGGAGGA 



21 

I 

AGATGAGGAC 
GGGCCTTGGA 
CGACTTTGCC 
GGGGGAGCCC 
GTACAAGCGG 
GCAAGGGAGT 
CAGCCTGCTG 
GATGGGCGTG 
CAGTGGGACC 
GCAGTTCCAG 
AGAGGTGGAT 
CCCCATCTCC 
AGGATGGTGG 
AAGAGCTTCC 



31 
I 

TTTGTGTCCT 
CCCGGGGTCC 
CAGCTGAGCG 
CATGCTGTGG 
CCCCTCTTCC 
CCCCTGGAGG 
CAGCTCCGTG 
GGCAGGACCA 
ACCTCCCAGC 
GTGATCCAGA 
AGATCTATTA 
GACCTGGGAG 
AGGTTTCTCA 
AGGGGGCGAA 



41 

I 

ACACACCTCG 
GGGTGGAGAG 
AGAACACATA 
CCATCCATGG 
TGCAGCCCAC 
CCCAGTTGGA 
ATGCCCACGG 
ACCTGGGCAT 
CAGAGGCTGC 
GCTTTCTCCG 
TGTGAAAGGC 
ACTTCAGGAA 
AGGAGCTGGA 
CGCAGCACAG 



51 
I 

AGACAAGCAG 
CCTGGAGCTG 
CCATGTGTAC 
TGAGGACGAC 
CTACAGGTAC 
CGCCTTTGTC 
GCCTCCCCCA 
GGTCCTGGGC 
CCCCACGCAG 
CATGGTGCCC 
AGCTTCACCC 
TGACAACCTA 
GACTTCAGGG 
AGGAAGAGGC 



60 
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CTGCTCCACT TGTCTGGGAA CCTGGGCAGG 
CAGGTCCCCC GGCATCTCTC TCTGTCCCGG 
CTGCAGCAGG AGCCCCAAGG AGTGCTAGCT 
CAGTGAGGTG TGCAAGGGTG CACTGAGGGT 
ATCCTTGCTG AGCATCTTTG AGCCTGCCTT 
TGAGTTAGAG GCTGCTGGGA TCCACTGTTT 
TGGCAGAGAA GTGCCATGTT TGCGTTGAGC 
TTGCTGAAAC CCAGGAGCTG AACAGTGAGG 
CCAGGAAAGC CTGTCTTTGG TTAGGCTCGT 
TGTCATTGGT CATGATATTT GAAAAGGGGA 
ATTGGAAAAT ATTTGACCCC CTTGGCTGAA 
ACTACCTTTT CAGGTTTATT GTTTTTATTT 
GCAGACAAGG TCTAGATGCG GAGTCAGAGA 
TCTCATGGTT GGCTCTGACT TTCAGCTGTG 
CTGCCTCAGT TTCCCCATCT GTAAAATGGG 
GGTGTTGTGA GGATTCATTT GTGATTTTTT 
TAAAAACAGC TAAATGTG 



AGGCACAGAG GAAGCCAAGG CCTGGAGCTG 900 

CAGCCCAGGA TGGCCTGGTG CCCCCACCTG 960 

GAGGGTGGTT GCTGGGGTGG TCCTCATGGA 1020 

GGTGGGAGGG GATCACCTGG GTTCCAGGCC 1080 

CCGGTGGGAG CAGAAAAGGC CAGACCCTGC 1140 

CCACACAGCG GGAAGGCTGC TGGGAACAGG 1200 

CTTGCAGCTC TTCCAGCTGG GGACTGGTGC 1260 

AGGCTGTCCA CCTTGCTTGG CTCACTGGGA 1320 

GTACTTCTGC AGGAAAAAAA AAAAAGGATG 1380 

GGAGGCCGAA GTTGTTCCCA TTTATCCAGT 1440 

TTCTTTTGCA GAACTACTGT GTGTCTGTTC 1500 

TTGCATGAAT TAAGACGTTT TAATTTCTTT 1560 

TGGGACTGAA TGGGGAGGGA TCCTTTGTGT 1620 

TTGGGACCAC TGGCTGATCA CATCACCTCT 1680 

AGAATAATAC TTGCCTACCT ACCTCACGGG 1740 

TTTTTTTTTT TGTACAGAGC TTTTAAGCAT 1800 



Seq ID No: 173 Protein sequence : 
Protein Accession #: AK02 1806.1 



1 11 21 31 41 51 

I I I I I I 

TVLFLRADED FVSYTPRDKQ NLHENLQGLG PGVRVESLEL AIRKEIHDFA QLSENTYHVY 60 
HNTEDLWGEP HAVAIHGEDD LHVTEEVYKR PLFLQPTYRY HRLPLPEQGS PLEAQLDAFV 120 
SVLRETPSLL QLRDAHGPPP ALVFSCQMGV GRTNLGMVLG TLILLHRSGT TSQPEAAPTQ 180 
AKPLPMEQFQ VIQSPLRMVP QGRRMVEEVD RSIM 



Seq ID- NO: 174 DNA 8 ecru ence 
.Nucleic Acid Accession #: NM_016580.2 

Coding sequence: 1212-4766 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

GGGAAGCGGG AGGAGAGCCA CACGGTCAAG TTGCACAGGT TCTTGCAGCT TCTGGAATCA 60 

AGACCATGGG CACCCTCATA AGTCAGTGTG GGCAGGGACT GCCCCAGGGC CAATCCAAGA 120 

TCCAGAGGTA GCCATAGGGT GTGACAAGTT GTGCAGATTA CAACACTCAC CCCTTGCAAT 180 

AACGTCACTG CCTGTGACTC GGGGCCAGGC CCAGGCCAAA GCCCTTCCTA CATCATTTCG 240 

TTTAATCCTC ACAGTTTCCT GCTGAAAGGG CTACTATTCT TACTCCCATC CCCACTCTAC 300 

AGATGAGGTA ATGGAGGCCC AGGAAAGTTA AGTGACTTGT CCCAGATGAC ACCGCTGGTA 360 

AGTTGCAAAG TCAGAATTTG AACTCAGGCA GTTTACCTCT GATGGCTGCT CTGTTAATCA 420 

CAGCTGCTTT CCAGTGAGAC AAAAACGGGT GATCAGGGCA GAGTCAAGAC AGAGAGGTAA 480 

ACAAGATTGG GAAAAAGACA GGAATGAGAG GGGAACAATG GGGGAAAAGA TAGGAACAAA 540 

GAGAGTTGGG GAAGGGGAGA GAAACAGGAA ACATGACTTG CCCGGGAGGG GCATCAGTCC 600 

ACGTGCAAGC AGGTGGAGGC TCAAGTTTTC TGCTCACTTG GTGATGCAGA GGCTCCCTTT 660 

CCCTCAGCAG CCGCCTTGCT GCGTGGACAG CAGCTTCCCA TCTGGCCTGT CCCCGGAGCC 720 

CCGGCCTCAT CCTCCTCAGC GGCAGGCCAC TTAGCTTCAC AGGAAATGCT CTTTCTCTAA 780 

TTGGCATTGA AACTCACAGC CCTCCCTTTT CCTGTAGGTG GGGTTTCCAT AGGAAAAAGC 840 

TGCTTCTCTG TTTCCCCAGC CTAGCAACTG TTTGGCAGTC AGAGTCCCAC ATCCTGCTCA 900 

ACTGGGTCAG GTCCCTCTTA GACCAGCTCT TGTCCATCAT TTGCTGAAGT GGACCAACTA 960 

GTTCCCCAGT AGGGGGTCTC CCCTGGCAAT TCTTGATCGG CGTTTGGACA TCTCAGATCG 1020 

CTTCCAATGA AGATGGCCTT GCCTTGGGGT CCTGCTTGTT TCATAATCAT CTAACTATGG 1080 

GACAAGGTTG TGCCGGCAGC TCTGGGGGAA GGAGCACGGG GCTGATCAAG CCATCCAGGA 1140 

AACACTGGAG GACTTGTCCA GCCTTGAAAG AACTCTAGTG GTTTCTGAAT CTAGCCCACT 1200 

TGGCGGTAAG C ATGA TGCAA CTTCTGCAAC TTCTGCTGGG GCTTTTGGGG CCAGGTGGCT 1260 

ACTTATTTCT TTTAGGGGAT TGTCAGGAGG TGACCACTCT CACGGTGAAA TACCAAGTGT 1320 

CAGAGGAAGT GCCATCTGGT ACAGTGATCG GGAAGCTGTC CCAGGAACTG GGCCGGGAGG 1380 

AGAGGCGGAG GCAAGCTGGG GCTGCCTTCC AGGTGTTGCA GCTGCCTCAG GCGCTCCCCA 1440 

TTCAGGTGGA CTCTGAGGAA GGCTTGCTCA GCACAGGCAG GCGGCTGGAT CGAGAGCAGC 1500 

TGTGCCGACA GTGGGATCCC TGCCTGGTTT CCTTTGATGT GCTTGCCACA GGGGATTTGG 1560 

CTCTGATCCA TGTGGAGATC CAAGTGCTGG' ACATCAATGA CCACCAGCCA CGGTTTCCCA 1620 

AAGGCGAGCA GGAGCTGGAA ATCTCTGAGA GCGCCTCTCT GCGAACCCGG ATCCCCCTGG 1680 

ACAGAGCTCT TGACCCAGAC ACAGGCCCTA ACACCCTGCA CACCTACACT CTGTCTCCCA 1740 

GTGAGCACTT TGCCTTGGAT GTCATTGTGG GCCCTGATGA GACCAAACAT GCAGAACTCA 1800 

TAGTGGTGAA GGAGCTGGAC AGGGAAATCC ATTCATTTTT TGATCTGGTG TTAACTGCCT 1860 

ATGACAATGG GAACCCCCCC AAGTCAGGTA CCAGCTTGGT CAAGGTCAAC GTCTTGGACT 1920 

CCAATGACAA TAGCCCTGCG TTTGCTGAGA GTTCACTGGC ACTGGAAATC CAAGAAGATG 1980 

CTGCACCTGG TACGCTTCTC ATAAAACTGA CCGCCACAGA CCCTGACCAA GGCCCCAATG 2040 

GGGAGGTGGA GTTCTTCCTC AGTAAGCACA TGCCTCCAGA GGTGCTGGAC ACCTTCAGTA 2100 

TTGATGCCAA GACAGGCCAG GTCATTCTGC GTCGACCTCT AGACTATGAA AAGAACCCTG 2160 

CCTACGAGGT GGATGTTCAG GCAAGGGACC TGGGTCCCAA TCCTATCCCA GCCCATTGCA 2220 
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AAGTTCTCAT CAAGGTTCTG GATGTCAATG ACAACATCCC AAGCATCCAC GTCACATGGG 2280 

CCTCCCAGCC ATCACTGGTG TCAGAAGCTC TTCCCAAGGA CAGTTTTATT GCTCTTGTCA 2340 

TGGCAGATGA CTTGGATTCA GGACACAATG GTTTGGTCCA CTGCTGGCTG AGCCAAGAGC 2400 

TGGGCCACTT CAGGCTGAAA AGAACTAATG GCAACACATA CATGTTGCTA ACCAATGCCA 2460 

CACTGGACAG AGAGCAGTGG CCCAAATATA CCCTCACTCT GTTAGCCCAA GACCAAGGAC 2520 

TCCAGCCCTT ATCAGCCAAG AAACAGCTCA GCATTCAGAT CAGTGACATC AACGACAATG 2580 

CACCTGTGTT TGAGAAAAGC AGGTATGAAG TCTCCACGCG GGAAAACAAC TTACCCTCTC 2640 

TTCACCTCAT TACCATCAAG GCTCATGATG CAGACTTGGG CATTAATGGA AAAGTCTCAT 2700 

ACCGCATCCA GGACTCCCCA GTTGCTCACT TAGTAGCTAT TGACTCCAAC ACAGGAGAGG 2760 

TCACTGCTCA GAGGTCACTG AACTATGAAG AGATGGCCGG CTTTGAGTTC CAGGTGATCG 2820 

CAGAGGACAG CGGGCAACCC ATGCTTGCAT CCAGTGTCTC TGTGTGGGTC AGCCTCTTGG 28 80 

ATGCCAATGA TAATGCCCCA GAGGTGGTCC AGCCTGTGCT CAGCGATGGA AAAGCCAGCC 2940 

TCTCCGTGCT TGTGAATGCC TCCACAGGCC ACCTGCTGGT GCCCATCGAG ACTCCCAATG 3000 

GCTTGGGCCC AGCGGGCACT GACACACCTC CACTGGCCAC TCACAGCTCC CGGCCATTCC 3060 

TTTTGACAAC CATTGTGGCA AGAGATGCAG ACTCGGGGGC AAATGGAGAG CCCCTCTACA 312 0 

GCATCCGCAG TGGAAATGAA GCCCACCTCT TCATCCTCAA CCCTCATACG GGGCAGCTGT 3180 

TCGTCAATGT CACCAATGCC AGCAGCCTCA TTGGGAGTGA GTGGGAGCTG GAGATAGTAG 3240 

TAGAGGACCA GGGAAGCCCC CCCTTACAGA CCCGAGCCCT GTTGAGGGTC ATGTTTGTCA 3300 

CCAGTGTGGA CCACCTGAGG GACTCAGCCC GCAAGCCTGG GGCCTTGAGC ATGTCGATGC 3360 

TGACGGTGAT CTGCCTGGCT GTACTGTTGG GCATCTTCGG GTTGATCCTG GCTTTGTTCA 3420 

TGTCCATCTG CCGGACAGAA AAGAAGGACA ACAGGGCCTA CAACTGTCGG GAGGCCGAGT 3480 

CCACCTACCG CCAGCAGCCC AAGAGGCCCC AGAAACACAT TCAGAAGGCA GACATCCACC 3540 

TCGTGCCTGT GCTCAGGGGT CAGGCAGGTG AGCCTTGTGA AGTCGGGCAG TCCCACAAAG 3600 

ATGTGGACAA GGAGGCGATG ATGGAAGCAG GCTGGGACCC CTGCCTGCAG GCCCCCTTCC 3660 

ACCTCACCCC GACCCTGTAC AGGACGCTGC GTAATCAAGG CAACCAGGGA GCACCGGCGG 3720 

AGAGCCGAGA GGTGCTGCAA GACACGGTCA ACCTCCTTTT CAACCATCCC AGGCAGAGGA 3780 

ATGCCTCCCG GGAGAACCTG AACCTTCCCG AGCCCCAGCC TGCCACAGGC CAGCCACGTT 3840 

CCAGGCCTCT GAAGGTTGCA GGCAGCCCCA CAGGGAGGCT GGCTGGAGAC CAGGGCAGTG 3900 

AGGAAGCCCC ACAGAGGCCA CCAGCCTCCT CTGCAACCCT GAGACGGCAG CGACATCTCA 3960 

ATGGCAAAGT GTCCCCTGAG AAAGAATCAG GGCCCCGTCA GATCCTGCGG AGCCTGGTCC 4020 

GGCTGTCTGT GGCTGCCTTC GCCGAGCGGA ACCCCGTGGA GGAGCTCACT GTGGATTCTC 4080 

CTCCTGTTCA GCAAATCTCC CAGCTGCTGT CCTTGCTGCA TCAGGGCCAA TTCCAGCCCA 4140 

AACCAAACCA CCGAGGAAAT AAGTACTTGG CCAAGCCAGG AGGCAGCAGG AGTGCAATCC 4200 

CAGACACAGA TGGCCCAAGT GCAAGGGCTG GAGGCCAGAC AGACCCAGAA CAGGAGGAAG 4260 

GGCCTTTGGA TCCTGAAGAG GACCTCTCTG TGAAGCAACT GCTAGAAGAA GAGCTGTCAA 4320 

GTCTGCTGGA CCCCAGCACA GGTCTGGCCC TGGACCGGCT GAGCGCCCCT GACCCGGCCT 4380 

GGATGGCGAG ACTCTCTTTG CCCCTCACCA CCAACTACCG TGACAATGTG ATCTCCCCGG 4440 

ATGCTGCAGC CACGGAGGAG CCAAGGACCT TCCAGACGTT CGGCAAGGCA GAGGCACCAG 4500 

AGCTGAGCCC AACAGGCACG AGGCTGGCCA GCACCTTTGT CTCGGAGATG AGCTCACTGC 4560 

TGGAGATGCT GCTGGAACAG CGCTCCAGCA TGCCCGTGGA GGCCGCCTCC GAGGCGCTGC 4620 

GGCGGCTCTC GGTCTGCGGG AGGACCCTCA GTTTAGACTT GGCCACCAGT GCAGCCTCAG 4680 

GCATGAAAGT GCAAGGGGAC CCAGGTGGAA AGACGGGGAC TGAGGGCAAG AGCAGAGGCA 4 74 0 

GCAGCAGCAG CAGCAGGTGC CTG TGAA CAT ACCTCAGACG CCTCTGGATC CAAGAACCAG 4800 

GGGCCTGAGG ATCTGTGGAC AAGAGCTGGT TTCTAAAATC TTGTAACTCA CTAGCTAGCG 4860 

GCGGCCTGAG AACTTTAGGG TGACTGATGC TACCCCCACA GAGGAGGCAA GAGCCCCAGG 4920 

ACTAACAGCT GACTGACCAA AGCAGCCCCT TGTAAGCAGC TCTGAGTCTT TTGGAGGACA 4980 

GGGACGGTTT GTGGCTGAGA TAAGTGTTTC CTGGCAAAAC ATATGTGGAG CACAAAGGGT 5040 

CAGTCCTCTG GCAGAACAGA TGCCACGGAG TATCACAGGC AGGAAAGGGT GGCCTTCTTG 5100 

GGTAGCAGGA GTCAGGGGGC TGTACCCTGG GGGTGCCAGG AAATGCTCTC TGACCTATCA 5160 
ATAAAGGAAA AGCAGTGATT CAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 

Seq ID No: 175 Protein sequence:. 
Protein Accession #: NP_057664.1 



1 11 21 31 41 51 

I I I I I I 

MMQLLQLLLG LLGPGGYLFL LGDCQEVTTL TVKYQVSEEV PSGTVIGKLS QELGREERRR 60 

QAGAAFQVLQ LPQALPIQVD SEEGLLSTGR RLDREQLCRQ WDPCLVSFDV LATGDIALIH 120 

VEIQVLDIND HQPRFPKGEQ ELEISESASL RTRIPLDRAL DPDTGPNTLH TYTLSPSEHF 180 

ALDVIVGPDE TKHAELIWK ELDREIHSFF DLVLTAYDNG NPPKSGTSLV KVNVLDSNDN 240 

SPAFAESSLA LEIQEDAAPG TIjLIKLTATD PDQGPNGEVE FFLSKHMPPE VLDTFSIDAK 300 

TGQVILRRPL DYEKNPAYEV DVQARDLGPN PIPAHCKVLI KVLDVNDNIP SIHVTWASQP 360 

SLVSEALPKD SFIALVMADD LDSGHNGLVH CWLSQELGHF RLKRTNGNTY MLLTNATLDR 420 

EQWPKYTLTL LAQDQGLQPL SAKKQIiSIQI SDINDNAPVF EKSRYEVSTR ENNLPSLHLI 480 

TIKAHDADLG INGKVSYRIQ DSPVAHLVAI DSNTGEVTAQ RSLNYEEMAG FEFQVIAEDS 540 

GQPMLASSVS VWVSLLDAND NAPEWQPVL SDGKASLSVL VNASTGHLLV PIETPNGLGP 600 

AGTDTPPLAT HSSRPFLLTT IVARDADSGA NGEPLYSIRS GNEAHLFILN PHTGQLFVNV 660 

TNASSLIGSE WELEIWEDQ GSPPIjQTRAL LRVMFVTSVD HLRDSARKPG ALSMSMLTVI 720 

CLAVLLGIFG LILALFMSIC RTEKKDNRAY NCREAESTYR QQPKRPQKHI QKADIHLVPV 780 

LRGQAGEPCE VGQSHKDVDK EAMMEAGWDP CLQAPFHLTP TLYRTLRNQG NQGAPAESRE 840 

VLQDTVNLLF NHPRQRNASR ENLNLPEPQP ATGQPRSRPL KVAGSPTGRL AGDQGSEEAP 900 

QRPPASSATL RRQRHLNGKV SPEKESGPRQ ILRSLVRLSV AAFAERNPVE ELTVDSPPVQ 960 

QISQLLSLIiH QGQFQPKPNH RGNKYLAKPG GSRSAIPDTD GPSARAGGQT DPEQEEGPLD 1020 

PEEDLSVKQL LEEELSSLLD PSTGLALDRL SAPDPAWMAR LSLPLTTNYR DNVISPDAAA 1080 

TEEPRTFQTF GKAEAPELSP TGTRLASTFV SEMSSLLEML LEQRSSMPVE AASEALRRLS 1140 
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VCGRTLSLDL ATSAASGMKV QGDPGGKTGT EGKSRGSSSS SRCL 

Seq ID NO: 176 DNA sequence 
Nucleic Acid Accession #: AL109712.1 

Coding sequence: 2-128 (underlined sequences correspond to start and stop codons) 

1, 11 21 31 41 51 

I I I I I I 

GAGTCTCTTT GGGCCAGCCG GGCTGCTGCA GACAGACAGG AAGCACGCCT GACGCTCCTC 60 

TACCCTCGGG CAGCACAGCG GGGCTGGGAC TCACTCTAGC TTGCCCAGCA ACTTGCTTTC 120 

CTGTGTGAAC TCTGGCAGGC TGCCCTCTCT GTGCAAAGCT GCCACTGGGG CCTGCTCAGG 180 

GTGGCCTGGA ACTTGGAGGT GGGCAGTCAG GGCCTAGGAT GGGCCTGTGT CACCAGGGCA 240 

TGTGCCCTTG GGCCAGTTAC TTCCTCTCAG AGCCTTGGGC TCCTCCTCTG AGGATGGGGC 300 

TTGTTGGTGT GAAATGAGGT GAGCATGTTG AGTTGGGGAG CAGCAGGACA CGCACCTGCA 360 

GGCAGCCGCC CTGGCCACGC TCCCTCCCTA CCTTCCGAGT CCTGGGACAG ACACAGTAGA 420 

GCACAGCGGG CCAGCCTGCT CTCTTCTCTG TCTACTTTTT GCAGAAGAGT CAACAGATAC 480 

AACAGGCCCA GGGAGGTGCC CCTGGGGGCC CCAGTCCCCA TCACTCCAAG GGGCAGTCCT 540 

GCAAGTGACA AGGTGGGCCC AATCCCTGTG GAACAGGTCT CTGAGGACCA CAGAGTGGGG 600 

CCCCAGGGAA AGCTGGGAGC CGAGCTAGAG GCAGGCAGCA AGTAAGGGCA AAGCTGTGCC 660 

CCTGCCCGGA AGACCTTCCT GCCCCCAGAA CCCGACCCTC CGCAGATAGC CCTCCCTGGG 720 

CAGCAGCCCC CCAGCTTCCA AGGCCCGTGC CTCACCAGAC GCCATGCTCT CACGGACTTG 780 

TTTGCTGCTC TGTACCCTGC AGATCTGCCC CAGAGGAGCA GGTGAAAAGC CGCGCCTGCC 840 

GAGGTGCTGT GGCGGTGGAG TTTTGGGCAG AGGAGTGGGG GGAAGAGTTT CTCACTTTTA 900 

AGATTCTCCA AATCCAAGAT GAAGTCATGC TGTGCTTTGG AATGGTAGAT GCTCATTTAT 960 
GTAAAATCAT AATAAATGTT ACACAAACTG TTAAAAAAAA AAAAAAAAAA AAAAAA 

Seq ID No: 177 Protein sequence: 
Protein Accession #: AL109712.1 

1 11 21 31 41 51 

I I I I I I 

VSLGQPGCCR QTGSTPDAPL PSGSTAGLGL TLACPATCFP V 

Seq ID NO: 178 DNA sequence 

Nucleic Acid Accession #: none found 

Coding sequence: 3-107 {underlined sequences correspond to staxt and stop codons) 

1 11 21 31 41 51 

I I 1 I I I 

AATGGAGCAC TCCAAAGAAC GATTTGACCA ATAGCATTTC TTCTCTGGGG GTTGTATTTC 60 

AAAGCATGCA ACTCTCCAGG GAACCAGAAC TAAATTGCTT AAAATGAAGT CATTCCTCAG 120 

ATTAACTTCC TCAGATAAAG TGTCAGCGGT CTGCAGAAAC GAAGAAGACA AAACTGAGAT 180 

TATCACTCAT AATTCTCTTA CTTACTATGT CAGTGAAACA ATGAGTTTGC ATTTTTGCAA 240 

TCCTAGAACA TTCTTCATTA GCCCTGGGTC ATGACCTCTT CCAGTTAATT CTCTTTCACA 300 

CCTTTAGGAA AGATTTAAGA TGAACCTTCA ATAGGATATT AACATAACTC ATAGCCAATA 360 

CCACAGCTGC CTTTCAAATT AATGAGGTTA ATTGTTCTCC AGCAAACATG AGTTTGTCTT 420 

TGGCATTTTA AATGCTTCCC ATTGATCTGA CATTTTGCTG TTTCAAGTTT TAAAGGGCTC 480 

AAATCAAAGA CTATTGATAA CTGAGCAAAG AGCGAAGATC CAGAAATACG AAAACATTGT 540 

CTTTTTTTTT CCATGAAAAA CAATCATAGC CTTTTGAATT CAATCGAAGT TTCTACATTA 600 

GCCATCTAAG ACTTATTTAA TTATTTCTGT TCTCAGTCAA GCTAATTCAA GTGAATGAAC 660 

AGTATTGACT TTTAAAATCT TTTTTAAATT TTTTTAAATC TTTAGTTTAT TAAGTTTGTA 720 

gaaaagctct ggggccatga ccacttacgt aaatgtttca gtttaaaaac aaaagattca 780 

ggcctctaat ttgagccaaa tccaggtgat cttgtttgaa atttttgatg aatttgaaaa 840 

gatgaaagtg gaacttttaa cattcatgtt ccccaaattt ttcactggga agggatgcta 900 

attgcctact taagatataa gttcaagaat aacattttca tagaaaattc agaaaactgc 960 

ttgacacagc agtgacatag ttagatgtgg ctcagatgcc ttccaaacct gagggtcccc 1020 

aaagatttct ttaccagttg tttttaacta tgaatcttaa tcttgttcat tcccctgcca 1080 
. aaacaaattt aaaag 

Seq ID No: 179 Protein sequence: 
Protein Accession ft: none found 

1 11 21 31 41 51 

I I ill I 

WSTPKNDLTN SISSLGWFQ SMQLSREPEL NCLK 

Seq ID NO: 180 DNA sequence 

Nucleic Acid Accession ft: none found 

Coding sequence: 2-176 (underlined sequences correspond to start and stop codons) 
1 11 21 31 41 51 
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COGGGTGGGG CCTCGGGATG CAGGCGCCGG TGCCCGGGCC CCTGGGCCTG CTGGACCCCG 60 

CAGAAGGGCT TTCGAGGAGG AAGAAGACGT CGCTCTGGTT TGTGGGGTCT CTGCTGCTGG 120 

TGTCCGTCCT CATAGTCACC GTCGGGCTGG CTGCATCAGC AGGACGGAGA ATGTGACCGT 180 

TGGGGGCTAC TACCCAGGGA TCATTCTCGG CTTTGGATCT TTCTTAGGAA TTATTGGCAT 240 

5 CAACTTGGTG GAGAATAGAA GGCAAATGCT GGTGGCAGCG ATCGTGTTTA TCAGTTTTGG 300 

CGTGGTGGCC GCCTTCTGCT GCGCCATCGT GGACGGCGTA TTTGCAGCAC AGCACATTGA 360 

ACCGAGGCCC CTCACCACGG GAAGATGCCA GTTTTACTCC AGTGGGGTGG GGTACTTGTA 420 

CGATGTCTAC CAGACAGAGG TGAGCAGGAG CACTGAGATT CATGTGGGTT TTGCTCAGCT 480 

AACCCCGCCG ACCCCACGCG GTTTTCCCTG CACATAGGCG TGGTCTGAAT ATTTGGATTC 540 

10 TAATAGTTCC TGGGGGTCAC CCCTGCAGCT GGTGAACCGT TGATGCCCCC TGTGTAAGGG 600 

ACCTTGACAT TTCGATGTGC TGTATTTCAC TCTGGAGTCA GAGTTCTGGA CTTGCTTCAT 660 

TAAATCACAA CAGTCTCAGA AAACAACCGC ACCACCCCGC AATCCCACCA AAGGGGCGCG 720 
CCGTCCCTAA GAGTTATCCC 

15 Seq ID No: 181 Protein sequence: 
Protein Accession #: none found 



1 11 21 31 41 51 

on I I I 1 1 1 

20 RVGPRDAGAG ARAPGPAGPR RRAPEEEEDV ALVCGVSAAG VRPHSHRRAG CISRTENV 

Seq ID NO: 182 DNA sequence 

Nucleic Acid Accession #: AK001579.1 

Coding sequence: 1150-2637 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I i I I I 

TTTTCTCTGC TTTTCGCTAC CCCGGTCACT CTCATTTCTC TCCCCTATTC CTTGTCTCTT 60 

30 CCCCCATCCC CCTTTCTCCT GTCCTCCCCC TGCCTCTACA GTGGTTCTCC CCGCTGAGCT 120 

GCCACCAGCT GCTGGGCCCC GGGCTGCTGC GGCTGGGCCG CCTATGGCTG CGGTCCCCCT 180 

CCCATACAGC CCCGGCCCCT GGTCTCTGGC TGTCAGGGTT TGGCCTCCTT CGTGGTGACC 240 

ACCTCTTCCT GTGCTCAGCG CCGGGCCCAG GCCCCCCAGC CCCTGAGGAC ATGGTGCATC 300 

TGCGGCGGCT ACAGGAGATC AGTGTGGTTT CTGCAGCTGA CACCCCAGAT AAGAAAGAGC 360 

35 ATTTGGTCCT GGTGGAGACA GGAAGGACCC TGTATCTGCA AGGAGAGGGC CGGCTGGACT 420 

TCACGGCATG GAACGCAGCC ATTGGGGGCG CGGCTGGTGG GGGCGGCACA GGGCTGCAGG 480 

AGCAGCAGAT GAGCCGGGGT GACATCCCCA TCATCGTGGA TGCCTGCATC AGTTTTGTTA 540 

CCCAGCATGG GCTCCGGCTG GAAGGTGTAT ACCGGAAAGG GGGCGCTCGT GCCCGCAGCC 600 

TGAGACTCCT GGCTGAGTTC CGTCGGGATG CCCGGTCGGT GAAGCTCCGA CCAGGGGAGC 660 

40 ACTTTGTGGA GGATGTCACT GACACACTCA AACGCTTCTT TCGTGAGCTC GATGACCCTG 720 

TGACCTCTGC ACGGTTGCTG CCTCGCTGGA GGGAGGCTGC TGGTATTCCT AAGATCCCTG 780 

AGAGCCAAGG CCCAACCAGG ATCTCTGCCT TCCCCCACCA GAATCCATGG TTTGGCAGCC 840 

CTCCGCCCCA TCACTTCCCA CCCTGGGGGA TCATCCAGAG ACTTGGCTCA GGGGGAGGTG 900 

GGAAGGGGGC AGAGACACAT CCATCCTGCA TTTGTGCCTA AAAATCCCTC CCTCTGTACC 960 

45 AGCTGCCACT CTTTCTTCCC GGGTCCTCCC CAACCCTCCT CCATTCCATC CCCAGAGCTG 1020 

CCCCAGAAGA ATCAGCGCCT GGAGAAATAT AAAGATGTGA TTGGCTGCCT GCCGCGGGTC 1080 

AACCGCCGCA CACTGGCCAC CCTCATTGGG CATCTCTATC GGGTGCAGAA ATGTGCGGCT ; 1140 

CTAAACCA GA TGT GCACGCG GAACTTGGCT CTGCTGTTTG CACCCAGCGT GTTCCAGACG 1200 

GATGGGCGAG GGGAGCACGA GGTGCGAGTG CTGCAAGAGC TCATTGATGG CTACATCTCT 1260 

50 GTCTTTGATA TCGATTCTGA- CCAGGTAGCT CAGATTGACT TGGAGGTCAG TCTTATCACC 1320 

ACCTGGAAGG ACGTGCAGCT GTCTCAGGCT GGAGACCTCA TCATGGAAGT TTATATAGAG 1380 

CAGCAGCTCC CAGACAACTG TGTCACCCTG AAGGTGTCCC CAACCCTGAC TGCTGAGGAG 1440 

CTGACTAACC AGGTACTGGA GATGCGGGGG ACAGCAGCTG GGATGGACTT GTGGGTGACT 1500 

TTTGAGATTC GCGAGCATGG GGAGCTGGAG CGGCCACTGC ATCCCAAGGA AAAGGTCTTA 1560 

55 GAGCAGGCTT TACAATGGTG CCAGCTCCCA GAGCCCTGCT CAGCTTCCCT GCTCTTGAAA 1620 

AAAGTCCCCC TGGCCCAAGC TGGCTGCCTC TTCACAGGTA TCCGACGTGA GAGCCCACGG 1680 

GTGGGGCTGT TGCGGTGTCG TGAGGAGCCA CCTCGCTTGC TGGGAAGCCG CTTCCAGGAG 1740 

AGGTTCTTTC TGCTGCGTGG CCGCTGCCTG CTGCTGCTCA AGGAGAAGAA AAGCTCTAAA 1800 

CCAGAACGGG AGTGGCCTTT GGAAGGTGCC AAGGTCTACC TGGGAATCCG CAAGAAGTTA 1860 

60 AAGCCCCCAA CACCGTGGGG CTTCACATTG ATACTAGAGA AGATGCACCT CTACTTGTCC 1920 

TGCACTGACG AGGATGAAAT GTGGGATTGG ACCACCAGCA TCCTTAAAGC CCAGCACGAT 1980 

GACCAGCAGC CAGTGGTCTT ACGACGCCAT TCCTCCTCTG ACCTTGCCCG TCAGAAGTTT 2040 

GGCACTATGC CTTTGCTGCC TATCCGTGGG GATGACAGTG GAGCCACCCT CCTCTCTGCC 2100 

AATCAGACCC TGCGGCGACT ACACAACCGG AGGACCCTGT CCATGTTCTT TCCAATGAAG 2160 

65 TCATCCCAGG GGTCTGTGGA GGAGCAAGAG GAGCTGGAGG AGCCTGTGTA CGAGGAGCCA 2220 

GTGTATGAGG AAGTAGGGGC CTTCCCTGAG TTGATCCAGG ACACTTCTAC CTCCTTCTCC 2280 

ACCACACGGG AGTGGACAGT GAAGCCAGAG AACCCCCTCA CCAGCCAGAA GTCATTGGAT 2340 

CAACCCTTTC TCTCCAAGTC AAGCACCCTT GGCCAGGAGG AGAGGCCACC TGAGCCCCCT 2400 

CCAGGCCCCC CTTCAAAGAG CAGTCCCCAG GCACGGGGGT CCCTAGAGGA ACAGCTGCTC 2460 

70 CAGGAGCTCA GCAGCCTCAT CCTGAGGAAA GGAGAGACCA CTGCAGGCCT GGGAAGTCCT 2520 

TCCCAGCCAT CCAGCCCCCA ATCCCCCAGC CCCACTGGCC TTCCAACACA GACACCTGGC 2580 

TTCCCCACCC AACCCCCATG CACTTCCAGT CCACCCTCCA GCCAGCCCCT CACATGACCC 2640 

TAGGACCAGC AGTCTGAGAG GGTAGGTACC AGAAGACCCA GAAACTCTTA TCGTGGCACT 2700 

GTTGCAGCTT CCTCTGCCCT GGCTGGAAAG ACTCCAGAAT CCAGTGTGGT GCTGTGGAAG 2760 

75 GAGCACTGGA CTAAAGGCTT CAGTGGCTGC GTGTCCCAGG ACAGGTCATG GCCCCTCTCT 2820 

GGGCCCAGCC CATTTATCTA TACCATGAGG TAACTGAAGT AAGGAGAGCA GTGAATGTCA 2880 
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AACTGTGTTT CTTAGAGCCA TAAGCCCCAC ATATTATCCC TGAACAAGGG CAGCTCCTGC 2940 

TTTATATATT TGATACGTAG GGGTTCCATG AGAGATTTTG GGTTTTAAAG GAATGGTTTT 3000 

ACTGCATTAA AGAAAAAAAA TGCTTTGGAA ACCAGAGGCC TGGGTGATGT TAAAGTCTAT 3060 

CCTGTCCCAC TTCCTACATT CTGGGACTAC CGTGAAGCCT GGAGTAGGGA GAGCGAGTTT 3120 
GGGAGCTGGG ACTCGGGGAG TCAAAAATAG ATGAGTAATT GTCAATAAAC CTGGGAACC 



Seq ID No: 183 Protein sequence; 
Protein Accession #: AK001579.1 



10 



15 



20 



25 



MSLTHSNASF 
SHPGGSSRDL 
RLEKYKDVIG 
HEVRVLQELI 
NCVTLKVSPT 
WCQLPEPCSA 
RGRCLLLLKE 
EMWDWTTSIL 
RLHNRRTLSM 
TVKPENPLTS 
IiILRKGETTA 



11 
I 

VSSMTLPIjHG 
AQGEVGRGQR 
CLPRVNRRTL 
DGYISVFDID 
LTAEELTNQV 
SLLLKKVPLA 
KKSSKPEREW 
KAQHDDQQPV 
FFPMKSSQGS 
QKSLDQPFLS 
GLGSPSQPSS 



21 
I 

CCLAGGRLLV 
HIHPAFVPKN 
ATLIGHLYRV 
SDQVAQIDLE 
LEMRGTAAGM 
QAGCLFTGIR 
PLEGAKVYLG 
VLRRHSSSDL 
VEEQEELEEP 
KSSTLGQEER 
PQSPSPTGLP 



31 
I 

FLRSLRAKAQ 
PSLCTSCHSF 
QKCAALNQMC 
VSLITTWKDV 
DLWVTFBIRE 
RESPRVGLLR 
IRKKLKPPTP 
ARQKFGTMPL 
VYEEPVYEEV 
PPEPPPGPPS 
TQTPGFPTQP 



41 
I 

PGSLPSPTRI 
FPGPPQPSSI 
TRNLALLFAP 
QLSQAGDLIM 
HGELERPLHP 
CREEPPRIiLG 
WGFTLILEKM 
LPIRGDDSGA 
GAFPELIQDT 
KSSPQARGSL 
PCTSSPPSSQ 



51 

I 

HGLAALRPIT 
PSPELPQKNQ 
SVFQTDGRGE 
EVYIEQQLPD 
KEKVLEQALQ 
SRFQERFFIiL 
HLYLSCTDED 
TLLSANQTLR 
STSFSTTREW 
EEQLLQELSS 
PLT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



Seq ID NO: 184 DNA sequence 

Nucleic Acid Accession #: none found 

Coding sequence: 1-81 (underlined sequences correspond to start and stop codons) 



30 



35 



40 



45 



50 



55 



60 



GTAG AGTTAG 
GTATTAAGCT 
AATCTCAACA 
CTTTCTCTAT 
AACCCTAACT 
CACTGCACCA 
ACTGAAGATG 
AACATGTAAT 
TGTTCAGCTG 
AAGTTTTGTT 
ACAAAGGCAT 
GGATGGCAGT 
TGTGCCTCTT 
CCATTAGCTA 
GTCATGTGCT 
TTGCTTGGTT 
ACTTGGATGC 
TTCTTTTGTC 
CAGGATGGTT 
AGTAAGCTTT 
TGGCTTTTCA 
GCTTATTCTT 
CACCTCCTTG 
GCACTTAAAA 
TGCAGGTTAG 
CACCGTTCTA 
GACACAGTCA 
TTTTGCAGAA 



11 
I 

TGTCAATGTG 
TAAAAAGTTA 
TAAGAAGTCA 
AATTTCATCA 
CTGCTAATTA 
GCTTTGTTAT 
AGAGAACATG 
GAATGTAGTA 
TAACAGAATA 
ACGAATTCAG 
CTTTCCTGAT 
TCCAGCCCTG 
CACTTTAATC 
ATGCTTGCTT 
CAATTAATAT 
GCATTCTTCT 
CTCAGTTGTC 
ATCAGCACCA 
GCAACAACCA 
TGAAAATGTA 
TATTACTCAA 
CCTCTTACTT 
CTGCTTGTCC 
TAGAAAAAAA 
AAGTCATGGA 
TTCAACCCCA 
GCTCTTTCTG 
GAAAAC 



21 
I 

CTTAGAATAT 
_ATTCAGTTTA 
AAATGTAATG 
GTATGTCCTC 
TAAGCTAGGC 
CTGTAAAATG 
ATATGTGTAA 
ATAGTAATTA 
CCCAAAATAA 
ACAATCCAGG 
TTCTGCCAGT 
GTTACGCCCA 
ATAGCTCCCA 
ACATGGTCAC 
CCAAGTGTCC 
TAGCATAAGC 
CTTTCATTTA 
TCACTACCAC 
CCATAGGGAC 
GGTCAGATCA 
AAGAAAACCT 
TATCTCTGTA 
TATACTCCTA 
AAAAAAAAAA 
GCTGGGATCT 
TTGCCTAGAG 
AGAAAAGGCA 



31 
I 

ACCAAATTCA 
AGGAATATAA 
CTGCCAGATA 
TCCCTTTTCT 
AAGTAATCTT 
ATGATAATAC 
AGTGCCTTCC 
TTTTATTTTC 
CAGTTTTAAA 
GCTTTTATAG 
CTCAATGCAT 
TATTAGCACA 
CTAGATGCAC 
ACTTAGTTTC 
AATTACTGAG 
CACATTCTTT 
GAAATGCTCC 
TGCCTTCTTC 
TTTTTGCTTC 
TGTCTCTCTC 
AAAACTTTGC 
TTGCTCTTCC 
AAAGAAGTTC 
AGCTCAGAGA 
AAATCCATGT 
GTGCTTGATT 
GCTCAGCATT 



41 
I 

TAAACATTTT 
ACCAAATTAT 
ACAATATCAA 
CCTATTTGTC 
GGACAAGTTA 
CAACACCTTC 
ACAATACCCA 
TTTTGATTCA 
CAAATTAAAG 
ATGCACCAGG 
GGGTTGCAAT 
CAGAAAGAAA 
CCACTACTTC 
CAGAGAGACA 
AAAAAAAGAA 
TTATGAAGTT 
TTGGACATCC 
AAAGCCACCA 
TACTTCCACA 
TTCTCTTCAA 
TGTGAGATCT 
TCACTCTACT 
AGTCTTCCCT 
GGCTGAGTTG 
CAGTCTGACT 
GCTCAATAAT 
TCCATGAGAT 



51 

I 

CTCTAAAAAA 
TTTATATTTG 

AGGTATTTTT 
AAATTTTAGC 
TTTGACCTCT 
TTCTTGGGGT 
GAACATAGCA 
GTTGGGACTA 
TTTTGTTGTG 
ATCAGCAGGT 
CCAGAGTCCA 
GAGAAAGGGA 
TGCTGATACT 
TGTCTGGACA 
ACTAGCACCT 
GTCCTCAGTT 
TGAATCTGAC 
CGTTCTGTCC 
CAATAGCCAG 
AACCCTCCGA 
ATGTGACCCG 
CCAGCCATCC 
TATGATATTT 
TCCAAGGTCA 
ATGAGTTCTG 
AGATTCCATG 
CCGCACATCC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 



Seq ID No: 185 Protein sequence: 
Protein Accession #: none found 



65 



70 



75 



ii 



21 
I 



31 



VELVSMCLEY TKFINIFSKK VLSLKS 



41 
I 



51 



Seq ID NO: 186 DNA sequence 

Nucleic Acid Accession #: NMJ>02203.2 

Coding sequence: 43-3588 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

| I I I I I 

CTGCAAACCC AGCGCAACTA CGGTCCCCCG GTCAGACCCA GGATGGGGCC AGAACGGACA 60 
GGGGCCGCGC CGCTGCCGCT GCTGCTGGTG TTAGCGCTCA GTCAAGGCAT TTTAAATTGT 120 
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TGTTTGGCCT ACAATGTTGG TCTCCCAGAA 
CAGTTTGGGT ATGCAGTGCA GCAGTTTATA 
TCACCCTGGA GTGGCTTTCC TGAGAACCGA 
CTATCCACTG CCACATGTGA AAAACTAAAT 
5 ACTGAGATGA AAACCAACAT GAGCCTCGGC 
GGTTTTCTCA CATGTGGTCC TCTGTGGGCA 
GGTGTGTGTT CTGACATCAG TCCTGATTTT 
CAGCCCTGCC CTTCCCTCAT AGATGTTGTG 
CCTTGGGATG CAGTAAAGAA TTTTTTGGAA 

10 ACAAAGACAC AGGTGGGGTT AATTCAGTAT 
AACACATATA AAACCAAAGA AGAAATGATT 
GGGGACCTCA CAAACACATT CGGAGCAATT 
GCTTCTGGTG GGCGACGAAG TGCTACGAAA 
CATGATGGTT CAATGTTGAA AGCTGTGATT 

15 TTTGGCATAG CAGTTCTTGG GTACTTAAAC 
AAAGAAATAA AAGCGATCGC TAGTATTCCA 
GAAGCAGCTC TACTAGAAAA GGCTGGGACA 
ACTGTTCAAG GAGGAGACAA CTTTCAGATG 
TACTCTTCTC AAAATGATAT TCTGATGCTG 

20 ACCATTGTCC AGAAGACATC TCATGGCCAT 
ATTCTGCAGG ACAGAAATCA CAGTTCATAT 
GGAGAAAGCA CTCACTTTGT TGCTGGTGCT 
CTATATAGTG TGAATGAGAA TGGCAATATC 
ATTGGCTCCT ATTTTGGTAG TGTGCTGTGT 

25 GACGTGCTCT TGGTAGGTGC ACCAATGTAC 
GTCTACCTGT TTACTATCAA AAAGGGCATT 
GAGGGCATTG AAAACACTCG ATTTGGTTCA 
GATGGCTTTA ATGATGTGAT TGTTGGTTCA 
TACATTTACA ATGGTCATCA GGGCACTATC 

30 TCCGATGGAG CCTTTAGGAG CCATCTCCAG 
GATTTAAATG GGGATTCCAT CACCGATGTG 
CTCTGGTCAC AAAGTATTGC TGATGTAGCT 
ACTTTGGTCA ACAAGAATGC TCAGATAATT 
CCTACTAAGC AAAACAATCA AGTGGCCATT 

35 TTTTCATCCA GAGTAACCTC CAGGGGGTTA 
AAGAATATGG TAGTAAATCA AGCACAGAGT 
CCCTCTGATG TTGTCAACTC TTTGGATTTG 
ACTAGCCCTG CCCTTGAAGC CTATTCTGAG 
AAAGACTGTG GTGAGGATGG ACTTTGCATT 

40 CCAGCTGCTC AAGAACAACC CTTTATTGTC 
GTAACACTGA AAAATAAAAG GGAAAGTGCA 
GAAAACTTGT TTTTTGCATC ATTCTCCCTA 
GTGGCTGCAT CTCAGAAGTC TGTTGCCTGC 
CAACAGGTGA CTTTTACTAT TAACTTTGAC 

45 TCTCTCAGTT TCCAAGCCTT AAGTGAAAGC 
AACCTCAAAA TTCCTCTCCT GTATGATGCT 
AATTTTTATG AAATCTCTTC GGATGGGAAT 
GTTGGTCCAA AATTCATCTT CTCCCTGAAG 
GCAACTGTAA TCATCCACAT CCCTCAGTAT 

50 ACTGGGGTGC AAACAGACAA GGCTGGTGAC 
AAAATAGGAC AAACATCTTC TTCTGTATCT 
GAATTGAACT GCAGAACTGC TTCCTGTAGT 
ATGAAAGGAG AATACTTTGT TAATGTGACT 
TCAACGTTCC AGACAGTACA GCTAACGGCA 

55 ATATATGTGA TTGAAGATAA CACTGTTACG 
AAAGCCGAAG TACCAACAGG AGTTATAATA 
TTAGCTCTGG TTGCAATTTT ATGGAAGCTC 
ACCAAAAATC CAGATGAGAT TGATGAGACC 
ACCTGCAGTG GGAACCGGCA GCATCCCAGC 

60 TTTTTAAATC CCATATTTTT TTTATCATGT 
AAAACTGCAG GTCAGTTTGG ATGAAGAAAT 
GTAGGGAAAT AATAGGGAAA ATACCTATTT 
AACTGGCTGG CCCAGAGTTT ACATTCTAAT 
CAAGCATGAC AACTTTTAAA GAAAAATATG 

65 TCTCTTTAAA ATATTTGTCT TTAAACAGCA 
GTACTTCCAC TTGTGTATAT TTTAATGAAT 
ACAGGTTTTT TCAATTTATG CTGCTCATCC 
TAATTTTATT TATAAACTAG GTAAAATTTG 
CTTCCACACC CCATCTTGCT CTAATGATCA 

70 TACCTCCTAT ATGTCCATTT AAGTTAGGAG 
TTTTGTTTAA AACTCAGAAT ATAACATTTA 
. TGTGCCAGAG GAAGGAAAAG GAGGAAATTT 
CTTCTAGGAT TTGTTTGGCT GACTGGCAGT 
TTCTTTGGCA ACCTTCCTCC TCCCTTACTG 

75 TATTATAGAA GCCCTCTACA GCCTGACTTT 
TTACCCCTCA TCCAAAGTTC CCACTCCTTC 



GCAAAAATAT TTTCCGGTCC TTCAAGTGAA 180 

AATCCAAAAG GCAACTGGTT ACTGGTTGGT 240 

ATGGGAGATG TGTATAAATG TCCTGTTGAC 300 

TTGCAAACTT CAACAAGCAT TCCAAATGTT 360 

TTGATCCTCA CCAGGAACAT GGGAACTGGA 420 

CAGCAATGTG GGAATCAGTA TTACACAACG 480 

CAGCTCTCAG CCAGCTTCTC ACCTGCAACT 540 

GTTGTGTGTG ATGAATCAAA TAGTATTTAT 600 

AAATTTGTAC AAGGCCTTGA TATAGGCCCC 660 

GCCAATAATC CAAGAGTTGT GTTTAACTTG 720 

GTAGCAACAT CCCAGACATC CCAATATGGT 780 

CAATATGCAA GAAAATATGC CTATTCAGCA 840 

GTAATGGTAG TTGTAACTGA CGGTGAATCA 900 

GATCAATGCA ACCATGACAA TATACTGAGG 960 

AGAAACGCCC TTGATACTAA AAATTTAATA 1020 

ACAGAAAGAT ACTTTTTCAA TGTGTCTGAT 1080 

TTAGGAGAAC AAATTTTCAG CATTGAAGGT 1140 

GAAATGTCAC AAGTGGGATT CAGTGCAGAT 1200 

GGTGCAGTGG GAGCTTTTGG CTGGAGTGGG 1260 

TTGATCTTTC CTAAACAAGC CTTTGACCAA 1320 

TTAGGTTACT CTGTGGCTGC AATTTCTACT 1380 

CCTCGGGCAA ATTATACCGG CCAGATAGTG 1440 

ACGGTTATTC AGGCTCACCG AGGTGACCAG 1500 

TCAGTTGATG TGGATAAAGA CACCATTACA 1560 

ATGAGTGACC TAAAGAAAGA GGAAGGAAGA 1620 

TTGGGTCAGC ACCAATTTCT TGAAGGCCCC 1680 

GCAATTGCAG CTCTTTCAGA CATCAACATG 1740 

CCACTAGAAA ATCAGAATTC TGGAGCTGTA 1800 

CGCACAAAGT ATTCCCAGAA AATCTTGGGA 1860 

TACTTTGGGA GGTCCTTGGA TGGCTATGGA 1920 

TCTATTGGTG CCTTTGGACA AGTGGTTCAA 1980 

ATAGAAGCTT CATTCACACC AGAAAAAATC 2040 

CTCAAACTCT GCTTCAGTGC AAAGTTCAGA 2100 

GTATATAACA TCACACTTGA TGCAGATGGA 2160 

TTTAAAGAAA ACAATGAAAG GTGCCTGCAG 2220 

TGCCCCGAGC ACATCATTTA TATACAGGAG 2280 

CGTGTGGACA TCAGTCTGGA AAACCCTGGC 2340 

ACTGCCAAGG TCTTCAGTAT TCCTTTCCAC 2400 

TCTGATCTAG TCCTAGATGT CCGACAAATA 2460 

AGCAACCAAA ACAAAAGGTT AACATTTTCA 2520 

TACAACACTG GAATTGTTGT TGATTTTTCA 2580 

CCGGTTGATG GGACAGAAGT AACATGCCAG 2640 

GATGTAGGCT ACCCTGCTTT AAAGAGAGAA 2700 

TTCAATCTTC AAAACCTTCA GAATCAGGCG 2760 

CAAGAAGAAA ACAAGGCTGA TAATTTGGTC 2820 

GAAATTCACT TAACAAGATC TACCAACATA 2880 

GTTCCTTCAA TCGTGCACAG TTTTGAAGAT 2940 

GTAACAACAG GAAGTGTTCC AGTAAGCATG 3000 

ACCAAAGAAA AGAACCCACT GATGTACCTA 3060 

ATCAGTTGTA ATGCAGATAT CAATCCACTG 3120 

TTCAAAAGTG AAAATTTCAG GCACACCAAA 3180 

AATGTTACCT GCTGGTTGAA AGACGTTCAC 3240 

ACCAGAATTT GGAACGGGAC TTTCGCATCA 3300 

GCTGCAGAAA TCAACACCTA TAACCCTGAG 3360 

ATTCCCCTGA TGATAATGAA ACCTGATGAG 3420 

GGAAGTATAA TTGCTGGAAT CCTTTTGCTG 3480 

GGCTTCTTCA AAAGAAAATA TGAAAAGATG 3540 

ACAGAGCTCA GTAGCTGAAC CAGCAGACCT 3600 

CAGGGTTTGC TGTTTGCGTG CATGGATTTC 3660 

CGTAGGTAAA CTAACCTGGT ATTTTAAGAG 3720 

TGTGGGGGGT GGGGGAGGTG CGGGGGGCAG 3780 

TATATGATGG GGGAAAAAAA GTAATCTTTA 3840 

TTGCATTGTG TCAGAAACAT GAAATGCTTC 3900 

ATACTCTCAG ATTTTAAGGG GGAAAACTGT 3960 

ACTACAGAAG TGGAAGTGCT TGATATGTAA 4020 

ATTGATGTTA ACAAGAGGGG AAAACAAAAC 4080 

AAAGTTGCCA CAGATGATAC TTCCAAGTGA 4140 

TTGTTGGTTC CTTTTATACC ACGGCTGCCC 4200 

AAACATGCTT GAATAACTGA GCTTAGAGTA 4260 

AGGGGGCGAT ATAGAGACTA AGGCACAAAA 4320 

TGTAAAATCC CATCTGCTAG AAGCCCATCC 4380 

CCTTTCTCTT TTAGGAGGCA CAACAGTTCT 4440 

AACCTAGTGA ATTTTTGAAA GATGAGTAAT 4500 

AACCACTCTC CCACCTCCTG GTGGTACCAT 4560 

CTCTCCAGCG GTCCAAAGTT ATCCCCTCCT 4620 

AGGACAGCTG CTGTGCATTA GATATTAGGG 4680 
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10 



GGGAAAGTCA 
TTCAGGGAGC 
CTAAATGTTG 
CCAGAAGTTA 
GCTGTCTTGT 
GTTCAAAAGG 
TTTTAACAAG 
ATTTCTACTT 
CTTAGATTAA 
AAACTTCAGA 
TTTAGTTTTA 
CAAGAATTTG 



TCTGTTTAAT 
TATTTTCATT 
GAATGTTATG 
CAGATGAGGC 
TTCTGAAGTA 
TAGATCCTGA 
CACCCCAGTC 
TTTGCACCTT 
AATTCACAGA 
GTCCTCATTA 
AAAGTCTATG 
ACTTGGAAAA 



TTACACACTT 
TAGTGCTAAA 
GGATGTAAAC 
ACTGGAAACC 
CTTTTTCTTC 
GATGATTTGG 
ACTAGGATGC 
ATTTTCTCTG 
CACTACATAT 
TAAAATGGGA 
ATCTGATCTG 
G 



GCATGAATTA 
CAAGTAAGAA 
AATGTAAAGT 
ACCACCAAAT 
CACAAGAGTG 
TCAGATTGGG 
AGATGGACCA 
TTCCTGAGCC 
CTAAAGCTTT 
AGACTGAGCT 
GACTTCCTAT 



CTGTATATAA 
AAATAAGCTA 
AAAACACTCT 
TAGCAGGTGC 
AATTTGACCT 
ATAAGGCCCA 
CACTTTGAGA 
CCCACATTCT 
GACAAGTCCT 
GGAGTTCAGC 
AATACAAATA 



ACTCCTTAAC 
GAGTGAATTT 
CAGGATTTCA 
ACCTTCTGTG 
AGGCAAGTTT 
GCAATCTGCA 
AACACCACCC 
CTAGGAGAAA 
TGACCTCTAT 
AGTGATGCTT 
CACAATCCTC 



4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 



15 



Seq ID NO: 187 Protein sequence: 
Protein Accession #: NP_002194.1 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



MGPERTGAAP 
NWLLVGSPWS 
. RNMGTGGFLT 
ESNSIYPWDA 
QTSQYGGDLT 
HDNILRFGIA 
IFSIEGTVQG 
KQAFDQILQD 
AHRGDQIGSY 
QFLEGPEGIE 
SQKILGSDGA 
FTPEKITLVN 
NERCLQKNMV 
FSIPFHKDCG 
IWDFSENLF 
NLQNQASLSF 
VHSFEDVGPK 
ADINPUCIGQ 
NGTFASSTFQ 
AGILLLLALV 



11 
I 

LPLLLVLALS 
GFPENRMGDV 
CGPLWAQQCG 
VKNFLEKFVQ 
NTFGAIQYAR 
VLGYLNRNAL 
GDNFQMEMSQ 
RNHSSYLGYS 
FGSVLCSVDV 
NTRFGSAIAA 
FRSHLQYFGR 
KNAQIILKLC 
VNQAQSCPEH 
EDGLCISDLV 
FASFSLPVDG 
QALSESQEEN 
FIFSLKVTTG 
TSSSVSFKSE 
TVQLTAAAEI 
AILWKLGFFK 



21 
I 

QGILNCCLAY 
YKCPVDLSTA 
NQYYTTGVCS 
GLDIGPTKTQ 
KYAYSAASGG 
DTKNLIKEIK 
VGFSADYSSQ 
VAAISTGEST 
DKDTITDVLL 
LSDINMDGFN 
SLDGYGDLNG 
FSAKFRPTKQ 
IIYIQEPSDV 
LDVRQIPAAQ 
TEVTCQVAAS 
KADNLVNLKI 
SVPVSMATVI 
NFRHTKELNC 
NTYNPEIYVT 
RKYEKMTKNP 



31 
I 

NVGLPEAKIF 
TCEKLNLQTS 
DISPDFQLSA 
VGLIQYANNP 
RRSATKVMW 
AIASIPTERY 
NDILMLGAVG 
HFVAGAPRAN 
VGAPMYMSDL 
DVIVGSPLEN 
DSITDVSIGA 
NNQVAIVYNI 
VNSLDLRVDI 
EQPFIVSNQN 
QKSVACDVGY 
PLLYDAEIHL 
IHIPQYTKEK 
RTASCSNVTC 
EDNTVTIPLM 
DEIDETTELS 



41 

I 

SGPSSEQFGY 
TSIPNVTEMK 
SFSPATQPCP 
RWFNLNTYK 
VTDGESHDGS 
FFNVSDEAAL 
AFGWSGTIVQ 
YTGQIVLYSV 
KKEEGRVYLF 
QNSGAVYIYN 
FGQWQLWSQ 
TLDADGFSSR 
SLENPGTSPA 
KRLTFSVTLK 
PALKREQQVT 
TRSTNINFYE 
NPLMYLTGVQ 
WLKDVHMKGE 
IMKPDEKAEV 
S 



51 
I 

AVQQFINPKG 
TNMSLGLII/T 
SLIDVWVCD 
TKE EMI VATS 
MLKAVIDQCN 
LEKAGTLGEQ 
KTSHGHLIFP 
NENGNITVIQ 
TIKKGILGQH 
GHQGTIRTKY 
SIADVAIEAS 
VTSRGLFKEN 
USAYSETAKV 
NKRESAYNTG 
FTINFDFNLQ 
ISSDGNVPSI 
TDKAGDISCN 
YFVNVTTRIW 
PTGVIIGSII 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



Seq ID NO: 188 DNA sequence 

Nucleic Acid Accession #: NM_002210.1 

Coding sequence: 42-3188 (underlined sequences correspond to start and stop codons) 



GGCTACCGCT 
GACGGCTGCG 
TGTGCCGCGC 
GTTACTTCGG 
TCGTGGGAGC 
TCAAATGTGA 
ATAGAGATTA 
CTGTGAGGTC 
AGATGAAACA 
TTGAGTATGC 
GAGGATTCAG 
TTTATTGGCA 
CCAATGTTTA 
TTGATGACAG 
ATGACTTTGT 
ATGGGAAGAA 
GATTTTCTGT 
CACCTCTCTT 
TGTCTCTACA 
TTGCACGGTT 
ATATTGCAAT 
ATGGAAGATC 
CTCGAAGCAT 
ATGGATATCC 
CCAGACCAGT 
ACAATAAAAC 
TCTGCTTAAA 
TTCTTTTGGA 
GGTCCCCAAG 
AATTGATAGC 
TTTTTATGGA 



11 
I 

CCCGGCTTGG 
CCTCGGTCCC 
CTTCAACCTA 
CTTCGCCGTG 
TCCCAAAGCA 

TGCCAAGGAT 
GAAACAGGAT 
GGAGCGAGAG 
TCCATGTAGA 
CATTGATTTT 
AGGTCAGCTT 
CAGCATCAAG 
CTATTTGGGT 
TTCAGGAGTT 
CATGTCCTCC 
AGCTGCCACT 
CATGGATCGT 
GAGAGCTTCA 
TGGCAGTGCC 
TGCTGCTCCA 
AACAGGCTTG 
GCCACCAAGC 
AGACTTAATT 
TATCACTGTA 
CTGCTCACTG 
GGCAGATGGC 
TAAACTCAAG 
TCACTCCAAG 
GTATCTGCGG 
ATATCGGTTG 



21 
I 

CGTCCCGCGC 
CGCGGCCTCC 
GACGTGGACA 
GATTTCTTCG 
AACACCACCC 
ACCCGCCGGT 
GATCCATTGG 
AAAATTTTGG 
CCTGTTGGAA 
TCACAAGATA 
ACTAAAGCTG 
ATTTCGGATC 
TATAATAACC 
TATTCTGTGG 
CCAAGAGCAG 
TTATACAATT 
GACATTAATG 
GGCTCTGATG 
GGAGACTTCC 
ATAG CTCCTT 
TATGGGGGTG 
AACGCAGTCC 
TTTGGCTATT 
GTAGGAGCTT 
AATGCTGGTC 
CCTGGAACAG 
AAAGGAGTAC 
CAAAAGGGAG 
AACATGACTA 
GATGAATCTG 
GATTATAGAA 



31 
I 

GCACTTCGGC 
CGCTTCTTCT 
GTCCTGCCGA 
TGCCCAGCGC 
AGCCTGGGAT 
GCCAGCCAAT 
AATTTAAGTC 
CCTGTGCCCC 
CATGCTTTCT 
TTGATGCTGA 
ACAGAGTACT 
AAGTGGCAGA 
AATTAGCAAC 
CTGTCGGAGA 
CAAGGACTTT 
TTACTGGCGA 
GAGATGATTA 
GCAAACTCCA 
AGACGACAAA 
TGGGAGATCT 
AAGATAAAAA 
CATCTCAAAT 
CAATGAAAGG 
TTGGTGTAGA 
TTGAAGTGTA 
CTCTCAAAGT 
TTCCCAGGAA 
CAATTCGACG 
TTTCAAGGGG 
AATTTAGAGA 
CAGCTGCTGA 



41 

I 

GATGGCTTTT 
CTCGGGACTC 
GTACTCTGGC 
GTCTTCCCGG 
TGTGGAAGGA 
TGAATTTGAT 
CCATCAGTGG 
ATTGTACCAT 
TCAAGATGGA 
TGGACAGGGA 
TCTTGGTGGT 
AATCGTATCT 
TCGGACTGCA 
TTTCAATGGT 
GGGAATGGTT 
GCAGATGGCT 
TGCAGATGTG 
AGAGGTGGGG 
GCTGAATGGA 
GGACCAGGAT 
AGGAATTGTT 
CCTTGAAGGG 
AGCCACAGAT 
TCGAGCTATC 
CCCTAGCATT 
TTCCTGTTTT 
ACTTAATTTC 
AGCACTGTTT 
GGGACTGATG 
CAAACTCACT 
TACAACAGGC 



51 
I 

CCGCCGCGGC 
CTGCTACCTC 
CCCGAGGGAA 
ATGTTTCTTC 
GGGCAGGTCC 
GCAACAGGCA 
TTTGGAGCAT 
TGGAGAACTG 
ACAAAGACTG 
TTTTGTCAAG 
CCTGGTAGCT 
AAATACGACC 
CAAGCTATTT 
GATGGCATAG 
TATATTTATG 
GCATATTTCG 
TTTATTGGAG 
CAGGTCTCAG 
TTTGAGGTCT 
GGTTTCAATG 
TATATCTTCA 
CAGTGGGCTG 
ATAGACAAAA 
TTATACAGGG 
TTAAATCAAG 
AATGTTAGGT 
CAGGTGGAAC 
CTCTACAGCA 
CAGTGTGAGG 
CCAATTACTA 
TTGCAACCCA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
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TTCTTAACCA GTTCACGCCT GCTAACATTA GTCGACAGGC TCACATTCTA CTTGACTGTG 1920 

GTGAAGACAA TGTCTGTAAA CCCAAGCTGG AAGTTTCTGT AGATAGTGAT CAAAAGAAGA 1980 

TCTATATTGG GGATGACAAC CCTCTGACAT TGATTGTTAA GGCTCAGAAT CAAGGAGAAG 2040 

GTGCCTACGA AGCTGAGCTC ATCGTTTCCA TTCCACTGCA GGCTGATTTC ATCGGGGTTG 2100 

5 TCCGAAACAA TGAAGCCTTA GCAAGACTTT CCTGTGCATT TAAGACAGAA AACCAAACTC 2160 

GCCAGGTGGT ATGTGACCTT GGAAACCCAA TGAAGGCTGG AACTCAACTC TTAGCTGGTC 2220 

TTCGTTTCAG TGTGCACCAG CAGTCAGAGA TGGATACTTC TGTGAAATTT GACTTACAAA 2280 

TCCAAAGCTC AAATCTATTT GACAAAGTAA GCCCAGTTGT ATCTCACAAA GTTGATCTTG 2340 

CTGTTTTAGC TGCAGTTGAG ATAAGAGGAG TCTCGAGTCC TGATCATATC TTTCTTCCGA 2400 

10 TTCCAAACTG GGAGCACAAG GAGAACCCTG AGACTGAAGA AGATGTTGGG CCAGTTGTTC 2460 

AGCACATCTA TGAGCTGAGA AACAATGGTC CAAGTTCATT CAGCAAGGCA ATGCTCCATC 2520 

TTCAGTGGCC TTACAAATAT AATAATAACA CTCTGTTGTA TATCCTTCAT TATGATATTG 2580 

ATGGACCAAT GAACTGCACT TCAGATATGG AGATCAACCC TTTGAGAATT AAGATCTCAT 2640 

CTTTGCAAAC AACTGAAAAG AATGACACGG TTGCCGGGCA AGGTGAGCGG GACCATCTCA 2700 

15 TCACTAAGCG GGATCTTGCC CTCAGTGAAG GAGATATTCA CACTTTGGGT TGTGGAGTTG 2760 

CTCAGTGCTT GAAGATTGTC TGCCAAGTTG GGAGATTAGA CAGAGGAAAG AGTGCAATCT 2820 

TGTACGTAAA GTCATTACTG TGGACTGAGA CTTTTATGAA TAAAGAAAAT CAGAATCATT 2880 

CCTATTCTCT GAAGTCGTCT GCTTCATTTA ATGTCATAGA GTTTCCTTAT AAGAATCTTC 2940 

CAATTGAGGA TATCACCAAC TCCACATTGG TTACCACTAA TGTCACCTGG GGCATTCAGC 3000 

20 CAGCGCCCAT GCCTGTGCCT GTGTGGGTGA TCATTTTAGC AGTTCTAGCA GGATTGTTGC 3060 

TACTGGCTGT TTTGGTATTT GTAATGTACA GGATGGGCTT TTTTAAACGG GTCCGGCCAC 3120 

CTCAAGAAGA ACAAGAAAGG GAGCAGCTTC AACCTCATGA AAATGGTGAA GGAAA CTCA G 3180 

AAACTTAACT GCAGTTTTTA AGTTATGCTA CATCTTGACC CACTAGAATT AGCAACTTTA 3240 

TTATAGATTT AAACTTTCTT CATGAGGAGT AAAAATCCAA GGCTTTACTG CTGATAGTGC 3300 

25 TAATTGGCAT TAACCACAAA ATGAGAATTA TATTTGTCAA CCTTCTCCTT ATAAATAAGT 3360 

TCAGACATAC ATTTAATAAC ATAGGGTGAC TTGTGTTTTT AGGTATTTAA ATAATAAAAT 3420 

TTCAAGGGAT AGTTTTTATT CAATGTATAT AAGACAGGTA GTGCCTGATT TACTACTTTA 3480 

TATAAAATAG TACCTCCTTC AGTTACTGTT TCTGATTTAA TGTACGGAAC TTTATTTGTT 3540 

GTTGTTGTTG TTGTTGTTGT TGTTGTTTTA AAGCAGTCCA AATTTGGACC TTAGCAATCA 3600 

30 TGTCTTTTGT ATAGGTACTT AATGTTAATA CATATTACAC TACAGTTTAC TTTTCAGAAT 3660 

ACTAAAGACT TTATAACTGC ATGAACTTGG ATTTTTTTAA TCACTCATAT GGTAGAATTT 3720 

TATAAACACA TACATGATAC CATCCAAATT CTTGCTTTTA ATAACAAAGG TACAATATTT 3780 

TGTTTTAGTA TGAAAATCTG GTAGATCCTA TTACACTTCT GTTTATATTA AATCCACAAT 3840 

ATTTTATTAC ATTTTTAACT TGTATAAATT TTAGGTCAAA TCCTTCAAGC CAACCTATAC 3900 

35 TAAAAATTAG TTCCATAATC ACAAATGGCT CTTTTGTGTA ATTGTTTAAT TTCACCTGAA 3960 

TATCATAATG CTTAAAGCCA TATGGAGTTG GAAATTATTT CCAAAGCATA TTTATTCCAT 4020 

TGTTTTAGTC TGGCTATTTA CAGTATAAAA AAAGCATTTT ATTAAAATAC TGTGTAGTTC 4080 

TTTGAGATAG TTGCTTATGC ATATAGTAAG TATTACATTC TTAGAGTAGA GCAGAGTTTT 4140 

TAGTTAGTAT TAATTTATTT TCCTCCATTC ATGTACTTTT CCTTATATTT CCAAAACTGT 4200 

40 TACTGAGAAT GGGTCAAGAT CAGTGAGAAA TCTTTACAGT TGACAGGAAC CTGGACCCCT 4260 

TACCCCAACT TTATGAGTAA TGCTTGGAAT AAAAAACTCT TAAGGCAACT CACTGATTTA 4320 

CTTCTAGCAA TAGCATGATG TTACAGGAAT ATTACCTCTG TTTAAGCAAG GTAATGTGTA 4380 

AAATCAGTCT CGGCTGTCAG AATAACTTCT AAAAGGTATT TTTATAAGCA GTTCAAGTTA 444 0 

CTGAAAACCT TTTAAACCTT TCTGAAGTTC GTTAGTATAA ATTACTTTTC TAGGATTATT 4500 

45 AATAAAAGCC ACATAGGTGG CAAGTTGTAG TTTTATATGG CTCTGTAGAG TGGTGAACCT 4560 

TCTAGAGGAA TATATGATTT ATTCACAGTT CCTCAAGGCC TGGGGATGAT GATCAGTTAT 4620 

ACCTATTTTT GTGCAATTAC ATCATGTTGT ACATTAGAAA TGGAGAGTTT AATAGCTCTT 4680 

TAACTGCTGT CCTCATTAGG TAATGATAAA TATTTCCCTT AAATAATTGA CTATTTTGCT 4740 

GTGTTTTAAA AATGATTGAA ATTTATCTTG CCATATCTCA TAATTTCATG CACAAGTTGA 4800 

50 CTGAGCTAAT CTTGAGAATA TATTCGTAAA ATAGGAGCAC ATTTAGTTGA GGTATACAAG 4 860 

GTAGGACTCT AGACAAAACC TTCTATTTTA GCTTTAGTGA ATTTCAAAAG TAATGGGTCT 4920 

TGGAGTATAG ATTTTTATTA GTAGCTTGAA AGAGCTTAAT CATATGCAGT AAGTATTTTT 4980 

ATTACCAATA AATTTAAAAT TTTTTAAGAA AAATATTTTT ATCCTAGGGC CAAGTGTTGC 504 0 

CTGCCACCAA TCAGTAAGTT AGTCTATAAC AAATTTTACC CTAACAGTTT TACCACCTAG 5100 

55 CAACAGTCAT TTCTGAAAAT ATGTTGGATA GAAAGTCACT CTTTGGCAAA AGTGTTAGAA 5160 

TTTGCTTTTG TGCCATCTAT TCCTTTTATG GCATCTATCT TGAAAGTAAT CTTGTATTGG 5220 

AGATTGAAAG ATGCTGTAAT TTAGAAATTA ACATGATATC TTAAATTACC TTTATGAAAT 5280 

ATAGTTTTGT ATAATAGCAT AGATTTTCCT TCAAAAAATG AACATTTATA TATCTACAAA 5340 

AATATGGAGA AGAGCAATTT GAAAGCCTAC TTTCTGAAGA AAATGGTGGG ATTTTTTTTT 5400 

60 ATCATGATTA AATATCAAAA AATTGCCCTA TGAAAACTTT AAATCTCTAA AACATTTGAA 5460 

ATACTACCAT ATTTGTGATT TATTGAGAAT AAAAATCCAT TTTGAAATGT AAAATTTTTA 5520 

TGATCTGATT CAGTTTTAAG AAAACATGAA TGAACTAGAA GATATTAAAA ACATTTGACA 5580 

TTGGTAAGAA ATATTGATAC TGATATTGAT TTTTATATAG GTATTTATTT CAGAATTGAT 5640 

ATTTTGAGAA AAATACATGT GAGTCATTTT TTCTGTTTCT CTTTTCTCTT AACGATTATC 5700 
65 ACTGTAATTC TGAATCT 

Seq ID NO: 189 Protein sequence: 
Protein Accession #: NP_002201.1 

70 1 11 21 31 41 51 

] | I 1 1 I 

MAFPPRRRLR LGPRGLPLIiL SGLLLPLCRA FNLDVDSPAE YSGPEGSYFG FAVDFFVPSA 60 

SSRMFLLVGA PKANTTQPGI VEGGQVLKCD WSSTRRCQPI EFDATGNRDY AKDDPLEFKS 120 

HQWFGASVRS KQDKILACAP LYHWRTEMKQ ERBPVGTCFIj QDGTKTVEYA PCRSQDIDAD 180 

75 GQGFCQGGFS IDFTKADRVL LGGPGSFYWQ GQLISDQVAE IVSKYDPNVY SIKYNNQLAT 240 

RTAQAIFDDS YLGYSVAVGD FNGDGIDDFV SGVPRAARTL GMVYIYDGKN MSSLYNFTGE 300 
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QMAAYFGFSV 
LNGFEVPARF 
LEGQWAARSM 
PSILNQDNKT 
ALFLYSRSPS 
TTGLQPILNQ 
AQNQGEGAYE 
TQLLAGLRFS 
DHIFLPIPNW 
ILHYDIDGPM 
TLGCGVAQCL 
FPYKNLPIED 
FKRVRPPQEE 



AATDINGDDY 
GSAIAPLGDL 
PPSFGYSMKG 
CSLPGTALKV 
HSKNMTISRG 
FTPANISRQA 
AELIVSIPLQ 
VHQQSEMDTS 
EHKENPETEE 
NCTSDMEINP 
KIVCQVGRLD 
ITNSTLVTTN 
QEREQLQPHE 



ADVFIGAPLF 
DQDGFNDIAI 
ATDIDKNGYP 
SCFNVRFCLK 
GLMQCEELIA 
HILLDCGEDN 
ADFIGWRNN 
VKFDLQIQSS 
DVGPWQHIY 
LRIKISSLQT 
RGKSAILYVK 
VTWGIQPAPM 
NGEGNSET 



MDRGSDGKLQ 
AAPYGGEDKK 
DLIVGAFGVD 
ADGKGVIiPRK 
YLRDESEFRD 
VCKPKLEVSV 
EALARLSCAF 
NLFDKVSPW 
ELRNNGPSSF 
TEKNDTVAGQ 
SLLWTETFMN 
PVPVWVIIIiA 



EVGQVSVSLQ 
GIVYIFNGRS 
RAILYRARPV 
LNFQVELLLD 
KLTPITIFME 
DSDQKKIYIG 
KTENQTRQW 
SHKVDIiAVLA 
SKAMLHLQWP 
GERDHLITKR 
KENQNHSYSL 
VLAGLLLLAV 



RASGDFQTTK 
TGLNAVPSQI 
ITVNAGLEVY 
KLKQKGAIRR 
YRLDYRTAAD 
DDNPLTIjIVK 
CDLGNPMKAG 
AVEIRGVSSP 
YKYNNNTLLY 
DLtALSEGDIH 
KSSASFNVIE 
LVFVMYRMGF 



360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 



Seq ID NO: 190 DNA aecruence 
Nucleic Acid Accession #: NMJ)04864 

Coding sequence: 26-952 (underlined sequences correspond to start and stop codons) 



1 
I 

CGGAACGAGG 
TCAGATGCTC 
GGCCGAGGCG 
ATTCCGAGAG 
CTGGGAAGAT 
AGTGCGGCTG 
GGGGCTCCCC 
AAGGTCGTGG 
GCCCGCGCTG 
ATCTTCGTCC 
CCGCAGAGCG 
TCTGCACACG 
ACGGGAGGTG 
CATGCACGCG 
CTGCTGCGTG 
GTCGCTCCAG 
GGTCCTTCCA 
GGGCTCAAGG 
TTATTTATTA 
ACTGTGTATT 
AAAA 



11 
I 

GCAACCTGCA 
CTGGTGTTGC 
AGCCGCGCAA 
TTGCGGAAAC 
TCGAACACCG 
GGATCCGGCG 
GAGGCCTCCC 
GACGTGACAC 
CACCTGCGAC 
GCACGGCCCC 
CGTGCGCGCA 
GTCCGCGCGT 
CAAGTGACCA 
CAGATCAAGA 
CCCGCCAGCT 
ACCTATGATG 
CTGTGCACCT 
TTCCTGAGAC 
TTAATTTATT 
TATTTAAAAC 



21 
I 

CAGCCATGCC 
TGGTGCTCTC 
GTTTCCCGGG 
GCTACGAGGA 
ACCTCGTCCC 
GCCACCTGCA 
GCCTTCACCG 
GACCGCTGCG 
TGTCGCCGCC 
AGCTGGAGTT 
ACGGGGACGA 
CGCTGGAAGA 
TGTGCATCGG 
CGAGCCTGCA 
ACAATCCCAT 
ACTTGTTAGC 
GCGCGGGGGA 
ACCCGATTCC 
GGGGTGACCT 
TCTGGTGATA 



31 
I 

CGGGCAAGAA 
GTGGCTGCCG 
ACCCTCAGAG 
CCTGCTAACC 
GGCCCCTGCA 
CCTGCGTATC 
GGCTCTGTTC 
GCGTCAGCTC 
GCCGTCGCAG 
GCACTTGCGG 
CTGTCCGCTC 
CCTGGGCTGG 
CGCGTGCCCG 
CCGCCTGAAG 
GGTGCTCATT 
CAAAGACTGC 
GGCGACCTCA 
TGCCCAAACA 
TCTTGGGGAC 
AAAATAAAGC 



41 



51 



CTCAGGACGG TGAATGGCTC 
CATGGGGGCG CCCTGTCTCT 
TTGCACTCCG AAGACTCCAG 
AGGCTGCGGG CCAACCAGAG 
GTCCGGATAC TCACGCCAGA 
TCTCGGGCCG CCCTTCCCGA 
CGGCTGTCCC CGACGGCGTC 
AGCCTTGCAA GACCCCAAGC 
TCGGACCAAC TGCTGGCAGA 
CCGCAAGCCG CCAGGGGGCG 
GGGCCCGGGC GTTGCTGCCG 
GCCGATTGGG TGCTGTCGCC 
AGCCAGTTCC GGGCGGCAAA 
CCCGACACGG AGCCAGCGCC 
CAAAAGACCG ACACCGGGGT 
CACTGCAT AT GA GCAGTCCT 
GTTGTCCTGC CCTGTGGAAT 
GCTGTATTTA TATAAGTCTG 
TCGGGGGCTG GTCTGATGGA 
TGTCTGAACT GTTAAAAAAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 



Seq ID NO: 191 Protein sequence: 
Protein Accession #: NP_004855 



MPGQELRTVN 
EDLLTRLRAN 
HRALFRLSPT 
ELHIiRPQAAR 
IGACPSQFRA 
LAKDCHCI 



11 

I 

GSQMLLVLLV 
QSWEDSNTDL 
ASRSWDVTRP 
GRRRARARNG 
ANMHAQIKTS 



21 
I 

LSWLPHGGAL 
VPAPAVRILT 
LRRQLSLARP 
DDCPIiGPGRC 
LHRLKPDTEP 



31 41 51 

I I I 

SLAEASRASF PGPSELHSED SRFRELRKRY 60 

PEVRLGSGGH LHLRISRAAL PEGLPEASRL 120 

QAPALHLRLS PPPSQSDQLL AESSSARPQL 180 

CRLHTVRASIj EDLGWADWVL SPREVQVTMC 240 

APCCVPASYN PMVLIQKTDT GVSLQTYDDL 300 



Seq ID NO: 192 DNA sequence 

Nucleic Acid Accession #: XM_061731.1 

Coding sequence: 1-567 (underlined sequences correspond to start and stop codons) 



1 
I 

ATGAGAAAAG 
AGAAAGTTTC 
GTAAAAGAAC 
CTGTTGTTGA 
TCCCCTCCAA 
TGGGGTGGCA 
AGTCTTTCAC 
GAGGTCTTCC 
GCTGGATGGT 
ATGGTTTATT 



11 

I 

GAAATGAGGG 
TCAAAGAAGA 
CTTTCTCTCT 
TGTCCACAGA 
TGTGCACCAA 
AAGACACCAG 
TCACCAAACA 
AGCCACTTTC 
GGATTTATCA 
CTAAAGAAAC 



21 
I 

AGAGAACACA 
TGGCATTACA 
GATTGGACTT 
CACTGGCAAG 
ATCACGTAAA 
GAGCAATACT 
TTCCCACAAG 
AGAGCCAGGT 
GAGCTGTCAG 
TGAG TGA 



31 
I 

GAAGAGGGCA 
TTGCACATCT 
GACACACAGA 
GACAGGTTTA 
AATGGGGATA 
GATCTTCCTA 
CCTGTCCCTG 
GTAGAAGCAG 
GTTCCTTCCT 



Seq ID NO: 193 Protein sequence: 
Protein Accession #: XP_061731.1 



11 



21 



31 
I 



41 
I 

GGCTTGCTCA 
CTCTGTGTCT 
AGGATCTCAG 
CCAACATACT 
ATGACTCCCC 
TCAGAGACCC 
AGCATCAGTG 
AGATGGAAGT 
CAACCCTTGC 



41 
I 



51 
I 

GCTTGCTCAA 
CTCTATTGCT 
TAAAGATTTG 
GCTGTCACAC 
TGCCTTCACA 
TGGGGGCAAG 
TGACCAGAGA 
GTTCGCTGAT 
AAGAAAGAAG 



51 



60 
120 
180 
240 
300 
360 
420 
480 
540 



270 



WO 02/079492 



PCTYUS02/04915 



MRKGNEGENT EEGRLAQLAQ RKPLKEDGIT LHISLCLSIA VKBPPSLIGL DTQKDLSKDL 60 

LUjMSTDTGK DRFTNILLSH SPPMCTKSRK NGDNDSPAFT WGGKDTRSNT DLPIRDPGGK 120 

SLSLTKHSHK PVPEHQCDQR EVFQPLSEPG VEAEMEVFAD AGWWIYQSCQ VPSSTLARKK 180 
^ MVYSKETE 

Seq ID NO: 194 DNA sequence 

Nucleic Acid Accession #: NM_005415.2 

Coding sequence: 371-2410 {underlined sequences correspond to start and stop codons) 

10 1 11 21 31 41 SI 

I I I I I I 

GAGCTGTCCC CGGTGCCGCC GACCCGGGCC GTGCCGTGTG CCOGTGGCTC CAGCCGCTGC 60 

CGCCTCGATC TCCTCGTCTC CCGCTCCGCC CTCCCTTTTC CCTGGATGAA CTTGCGTCCT 120 

TTCTCTTCTC CGCCATGGAA TTCTGCTCCG TGCTTTTAGC CCTCCTGAGC CAAAGAAACC 180 

15 CCAGACAACA GATGCCCATA CGCAGCGTAT AGCAGTAACT CCCCAGCTCG GTTTCTGTGC 240 

CGTAGTTTAC AGTATTTAAT TTTATATAAT ATATATTATT TATTATAGCA TTTTTGATAC 300 

CTCATATTCT GTTTACACAT CTTGAAAGGC GCTCAGTAGT TCTCTTACTA AACAACCACT 360 

ACTCCAGAGA ATGGCAACGC TGATTACCAG TACTACAGCT GCTACCGCCG CTTCTGGTCC 420* 

TTTGGTGGAC TACCTATGGA TGCTCATCCT GGGCTTCATT ATTGCATTTG TCTTGGCATT 480 

20 CTCCGTGGGA GCCAATGATG TAGCAAATTC TTTTGGTACA GCTGTGGGCT CAGGTGTAGT 540 

GACCCTGAAG CAAGCCTGCA TCCTAGCTAG CATCTTTGAA ACAGTGGGCT CTGTCTTACT 600 

GGGGGCCAAA GTGAGCGAAA CCATCCGGAA GGGCTTGATT GACGTGGAGA TGTACAACTC 660 

GACTCAAGGG CTACTGATGG CCGGCTCAGT CAGTGCTATG TTTGGTTCTG CTGTGTGGCA 720 

ACTCGTGGCT TCGTTTTTGA AGCTCCCTAT TTCTGGAACC CATTGTATTG TTGGTGCAAC 780 

25 TATTGGTTTC TCCCTCGTGG CAAAGGGGCA GGAGGGTGTC AAGTGGTCTG AACTGATAAA 840 

AATTGTGATG TCTTGGTTCG TGTCCCCACT GCTTTCTGGA ATTATGTCTG GAATTTTATT 900 

CTTCCTGGTT CGTGCATTCA TCCTCCATAA GGCAGATCCA GTTCCTAATG GTTTGCGAGC 960 

TTTGCCAGTT TTCTATGCCT GCACAGTTGG AATAAACCTC TTTTCCATCA TGTATACTGG 1020 

AGCACCGTTG CTGGGCTTTG ACAAACTTCC TCTGTGGGGT ACCATCCTCA TCTCGGTGGG 1080 

30 ATGTGCAGTT TTCTGTGCCC TTATCGTCTG GTTCTTTGTA TGTCCCAGGA TGAAGAGAAA 1140 

AATTGAACGA GAAATAAAGT GTAGTCCTTC TGAAAGCCCC TTAATGGAAA AAAAGAATAG 1200 

CTTGAAAGAA GACCATGAAG AAACAAAGTT GTCTGTTGGT GATATTGAAA ACAAGCATCC 1260 

TGTTTCTGAG GTAGGGCCTG CCACTGTGCC CCTCCAGGCT GTGGTGGAGG AGAGAACAGT 1320 

CTCATTCAAA CTTGGAGATT TGGAGGAAGC TCCAGAGAGA GAGAGGCTTC CCAGCGTGGA 1380 

35 CTTGAAAGAG GAAACCAGCA TAGATAGCAC CGTGAATGGT GCAGTGCAGT TGCCTAATGG 1440 

GAACCTTGTC CAGTTCAGTC AAGCCGTCAG CAACCAAATA AACTCCAGTG GCCACTCCCA 1500 

GTATCACACC GTGCATAAGG ATTCCGGCCT GTACAAAGAG CTACTCCATA AATTACATCT 1560 

TGCCAAGGTG GGAGATTGCA TGGGAGACTC CGGTGACAAA CCCTTAAGGC GCAATAATAG 1620 

CTATACTTCC TATACCATGG CAATATGTGG CATGCCTCTG GATTCATTCC GTGCCAAAGA 1680 

40 AGGTGAACAG AAGGGCGAAG AAATGGAGAA GCTGACATGG CCTAATGCAG ACTCCAAGAA 1740 

GCGAATTCGA ATGGACAGTT ACACCAGTTA CTGCAATGCT GTGTCTGACC TTCACTCAGC 1800 

ATCTGAGATA GACATGAGTG TCAAGGCAGC GATGGGTCTA GGTGACAGAA AAGGAAGTAA 1860 

TGGCTCTCTA GAAGAATGGT ATGACCAGGA TAAGCCTGAA GTCTCTCTCC TCTTCCAGTT 1920 

CCTGCAGATC CTTACAGCCT GCTTTGGGTC ATTCGCCCAT GGTGGCAATG ACGTAAGCAA I960 

45 TGCCATTGGG CCTCTGGTTG CTTTATATTT GGTTTATGAC ACAGGAGATG TTTCTTCAAA 2040 

AGTGGCAACA CCAATATGGC TTCTACTCTA TGGTGGTGTT GGTATCTGTG TTGGTCTGTG 2100 

GGTTTGGGGA AGAAGAGTTA TCCAGACCAT GGGGAAGGAT CTGACACCGA TCACACCCTC 2160 

TAGTGGCTTC AGTATTGAAC TGGCATCTGC CCTCACTGTG GTGATTGCAT CAAATATTGG 2220 

CCTTCCCATC AGTACAACAC ATTGTAAAGT GGGCTCTGTT GTGTCTGTTG GCTGGCTCCG 2280 

50 GTCCAAGAAG GCTGTTGACT GGCGTCTCTT TCGTAACATT TTTATGGCCT GGTTTGTCAC 2340 

AGTCCCCATT TCTGGAGTTA TCAGTGCTGC CATCATGGCA ATCTTCAGAT ATGTCATCCT 2400 

CAGAATGTGA AGCTGTTTGA GATTAAAATT TGTGTCAATG TTTGGGACCA TCTTAGGTAT 2460 

TCCTGCTCCC CTGAAGAATG ATTACAGTGT TAACAGAAGA CTGACAAGAG TCTTTTTATT 2520 

TGGGAGCAGA GGAGGGAAGT GTTACTTGTG CTATAACTGC TTTTGTGCTA AATATGAATT 2580 

55 GTCTCAAAAT TAGCTGTGTA AAATAGCCCG GGTTCCACTG GCTCCTGCTG AGGTCCCCTT 2640 

TCCTTCTGGG CTGTGAATTC CTGTACATAT TTCTCTACTT TTTGTATCAG GCTTCAATTC 2700 

CATTATGTTT TAATGTTGTC TCTGAAGATG ACTTGTGATT TTTTTTTCTT TTTTTTAAAC 2760 

CATGAAGAGC CGTTTGACAG AGCATGCTCT GCGTTGTTGG TTTCACCAGC TTCTGCCCTC 2820 

ACATGCACAG GGATTTAACA ACAAAAATAT AACTACAACT TCCCTTGTAG TCTCTTATAT 2B80 

60 AAGTAGAGTC CTTGGTACTC TGCCCTCCTG TCAGTAGTGG CAGGATCTAT TGGCATATTC 2940 

GGGAGCTTCT TAGAGGGATG AGGTTCTTTG AACACAGTGA AAATTTAAAT TAGTAACTTT 3000 

TTTGCAAGCA GTTTATTGAC TGTTATTGCT AAGAAGAAGT AAGAAAGAAA AAGCCTGTTG 3060 

GCAATCTTGG TTATTTCTTT AAGATTTCTG GCAGTGTGGG ATGGATGAAT GAAGTGGAAT 3120 

GTGAACTTTG GGCAAGTTAA ATGGGACAGC CTTCCATGTT CATTTGTCTA CCTCTTAACT 3180 

65 GAATAAAAAA GCCTACAGTT TTTAGAAAAA ACCCGAATTC 

Seq ID NO: 195 Protein sequence; 
Protein Accession #: NP_005406.2 

70 1 11 21 31 41 51 

I I I 1 I I 

MATLITSTTA ATAASGPLVD YLWMLILGFI IAFVLAFSVG ANDVANSFGT AVGSGWTLK 60 

QACILASIFE TVGSVLLGAK VSETIRKGLI DVEMYNSTQG liLMAGSVSAM FGSAVWQLVA 120 

SFLKLPISGT HCIVGATIGF SLVAKGQEGV KWSELIKIVM SWFVSPLLSG IMSGILFFLV 180 

75 . RAFILHKADP VPNGLRALPV FYACTVGINL FSIMYTGAPI* LGFDKLPLWG TILISVGCAV 240 

FCAIiIVWFFV CPRMKRKIER EIKCSPSESP LMEKKNSLKE DHEETKLSVG DIENKHPVSE 300 



271 
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VGPATVPLQA WEERTVSFK LGDLEEAPER ERLPSVDLKE ETSIDSTVNG AVQLPNGNLV 360 

QPSQAVSNQI NSSGHSQYHT VHKDSGLYKE LLHKLHLAKV GDCMGDSGDK PLRRNNSYTS 420 

YTMAICGMPL DSFRAKEGEQ KGEEMBKLTW PNADSKKRIR MDSYTSYCNA VSDLHSASEI 480 

DMSVKAAMGL GDRKGSNGSL EEWYDQDKPE VSLLFQFIiQI LTACFGSFAH GGNDVSNAIG 540 

PIiVALYLVYD TGDVSSKVAT PIWLLLYGGV GICVGLWVWG RRVIQTMGKD LTPITPSSGP 600 

SIELASALTV VIASNIGLPI STTHCKVGSV VSVGWLRSKK AVDWRIiFRNI FMAWFVTVPI 660 
SGVTSAAIMA IFRYVILRM 



Seq ID NO: 196 DNA sequence 

Nucleic Acid Accession #: NMJ)00020.1 

Coding sequence: 283-1794 (underlined sequences correspond to start and stop codons) 



AGGAAACGGT 
AGAAACATTT 
GAGCGAGCCC 
CCAGCGCTGG 
AGGCTAGCGC 
AGGAAAGGCC 
TCTCGGGGCC 
CGGGGGGCCT 
CGGGGCTGCG 
CACTACTGCT 
CAACCTCCTT 
CTGGCCTTGC 
CAGGAGAAGC 
TCTGAGCAGG 
GGCTCAGGGC 
TGTGTGGGAA 
GCCGTCAAGA 
AACACAGTAT 
CGCAACTCGA 
GACTTTCTGC 
GCATGCGGCC 
GCCCACCGCG 
GCCGACCTGG 
AACCCGAGAG 
ACGGACTGCT 
GAGATTGCCC 
GATGTGGTGC 
CAGACCCCCA 
ATGATGCGGG 
AAGACACTAC 
AGCACCTGAT 
CTATCTGGGT 
TGCTCGGCCC 



11 
I 

TTATTAGGAG 
TTGCTCCAGC 
CTCCCCGGCT 
CGGTGCAACT 
CCCGCCACCC 
TTCTGATGCT 
CGCTGGTGAC 
GGTGCACAGT 
GGAACTTGCA 
GCGACAGCCA 
CGGAGCAGCC 
TGGCCCTGGT 
AGCGTGGCCT 
GCGACACGAT 
TCCCCTTCCT 
AAGGCCGCTA 
TCTTCTCCTC 
TGCTCAGACA 
GCACGCAGCT 
AGAGACAGAC 
TGGCGCACCT 
ACTTCAAGAG 
GCCTGGCTGT 
TGGGCACCAA 
TTGAGTCCTA 
GCCGGACCAT 
CCAATGACCC 
CCATCCCTAA 
AGTGCTGGTA 
AAAAAATTAG 
TCCTTTCTGC 
AGAGGTAGTG 
CCAGCCCACC 



21 
I 

GGAGTGGTGG 
CCCCATCCCA 
CCAGCCCGGT 
GCGGCCGCGC 
GCAGAGCGGG 
GCTGATGGCC 
CTGCACGTGT 
AGTGCTGGTG 
CAGGGAGCTC 
CCTCTGCAAC 
GGGAACAGAT 
GGCCCTGGGT 
GCACAGCGAG 
GTTGGGGGAC 
GGTGCAGAGG 
TGGCGAAGTG 
GAGGGATGAA 
CGACAACATC 
GTGGCTCATC 
GCTGGAGCCC 
GCACGTGGAG 
CCGCAATGTG 
GATGCACTCA 
GCGGTACATG 
CAAGTGGACT 
CGTGAATGGC 
CAGCTTTGAG 
CCGGCTGGCT 
CCCAAACCCC 
CAACAGTCCA 
CTGCAGGGGG 
TGAGTGTGGT 
CAGCCAAAAA 



31 
I 

AGCTGGGCCA 
GTCCCGGGAG 
CCGGGGCCGC 
GGTGGAGGGG 
CCCAGAGGGA 
TTGGTGACCC 
GAGAGCCCAC 
CGGGAGGAGG 
TGCAGGGGGC 
CACAACGTGT 
GGCCAGCTGG 
GTCCTGGGCC 
CTGGGAGAGT 
CTCCTGGACA 
ACAGTGGCAC 
TGGCGGGGCT 
CAGTCCTGGT 
CTAGGCTTCA 
ACGCACTACC 
CATCTGGCTC 
ATCTTCGGTA 
CTGGTCAAGA 
CAGGGCAGCG 
GCACCCGAGG 
GACATCTGGG 
ATCGTGGAGG 
GACATGAAGA 
GCAGACCCGG 
TCTGCCCGAC 
GAGAAGCCTA 
CTGGGGGGGT 
GTGTGCTGGG 
TACAGCTGGG 



41 
I 

GGCAGGAAGA 
GCTGCCGCGC 
GCCGGACCCC 
AGGTGGCCCC 
CCATGACCTT 
AGGGAGACCC 
ATTGCAAGGG 
GGAGGCACCC 
GCCCCACCGA 
CCCTGGTGCT 
CCCTGATCCT 
TGTGGCATGT 
CCAGTCTCAT 
GTGACTGCAC 
GGCAGGTTGC 
TGTGGCACGG 
TCCGGGAGAC 
TCGCCTCAGA 
ACGAGCACGG 
TGAGGCTAGC 
CACAGGGCAA 
GCAACCTGCA 
ATTACCTGGA 
TGCTGGACGA 
CCTTTGGCCT 
ACTATAGACC 
AGGTGGTGTG 
TCCTCTCAGG 
TCACCGCGCT 
AAGTGATTCA 
GGGGGGCAGT 
GATGGGCAGC 
CTGAAACCTG 



51 
I 

CGCTGGAATA 
CAGCTGCGCC 
AGCCCGCCGT 
GGTCCGCCGA 
GGGCTCCCCC 
TGTGAAGCCG 
GCCTACCTGC 
CCAGGAACAT 
GTTCGTCAAC 
GGAGGCCACC 
GGGCCCCGTG 
CCGACGGAGG 
CCTGAAAGCA 
CACAGGGAGT 
CTTGGTGGAG 
TGAGAGTGTG 
TGAGATCTAT 
CATGACCTCC 
CTCCCTCTAC 
TGTGTCCGCG 
ACCAGCCATT 
GTGTTGCATC 
CATCGGCAAC 
GCAGATCCGC 
GGTGCTGTGG 
ACCCTTCTAT 
TGTGGATCAG 
CCTAGCTCAG 
GCGGATCAAG 
ATAGCCCAGG 
GGATGGTGCC 
TGCGCCTGCC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 



Seq ID NO: 197 Protein sequence: 
Protein Accession #: NP_0 00 011.1 



1 11 21 

i I I 

MTLGSPRKGL LMLLMALVTQ GDPVKPSRGP 
RHPQEHRGCG NLRRELCRGR PTEFVNHYCC 
LILGPVLALL ALVALGVLGL WHVRRRQEKQ 
DCTTGSGSGL PFLVQRTVAR QVALVECVGK 
RETEIYNTVL LRHDNILGFI ASDMTSRNSS 
RLAVSAACGL AHLHVEIFGT QGKPAIAHRD 
YLDIGNNPRV GTKRYMAPEV LDEQIRTDCF 
YRPPFYDWP NDPSFEDMKK WCVDQQTPT 
TAIiRlKKTLQ KISNSPEKPK VIQ 



31 41 51 

I I I 

LVTCTCESPH CKGPTCRGAW CTWLVREEG 60 

DSHLCNHNVS LVLEATQPPS EQPGTDGQLA 120 

RGLHSELGES SLILKASEQG DTMLGDLLDS 180 

GRYGEVWRGL WHGESVAVKI FSSRDEQSWF 240 

TQLWLITHYH EHGSLYDFLQ RQTLEPHLAL 300 

FKSRNVLVKS NLQCCIADLG LAVMHSQGSD 360 

ESYKWTDIWA FGLVLWEIAR RTIVNGIVED 420 

IPNRLAADPV LSGLAQMMRE CWYPNPSARL 480 



Seq ID NO: 198 DNA sequence 

Nucleic Acid Accession #: NMJ)03199.1 

Coding sequence: 200-2203 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

l l l l I I 

CGGGGGGATC TTGGCTGTGT GTCTGCGGAT CTGTAGTGGC GGCGGCGGCG GCGGCGGCGG 60 

GGAGGCAGCA GGCGCGGGAG CGGGCGCAGG AGCAGGCGGC GGCGGTGGCG GCGGCGGTTA 120 

GACATGAACG CCGCCTCGGC GCCGGCGGTG CACGGAGAGC CCCTTCTCGC GCGCGGGCGG 180 

TTTGTGTGAT TTTGCTAA AA TG CATCACCA ACAGCGAATG GCTGCCTTAG GGACGGACAA 240 

AGAGCTGAGT GATTTACTGG ATTTCAGTGC GATGTTTTCA CCTCCTGTGA GCAGTGGGAA 300 

AAATGGACCA ACTTCTTTGG CAAGTGGACA TTTTACTGGC TCAAATGTAG AAGACAGAAG 360 

TAGCTCAGGG TCCTGGGGGA ATGGAGGACA TCCAAGCCCG TCCAGGAACT ATGGAGATGG 420 
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GACTCCCTAT GACCACATGA CCAGCAGGGA CCTTGGGTCA CATGACAATC TCTCTCCACC 480 

TTTTGTCAAT TCCAGAATAC AAAGTAAAAC AGAAAGGGGC TCATACTCAT CTTATGGGAG 540 

AGAATCAAAC TTACAGGGTT GCCACCAGCA GAGTCTCCTT GGAGGTGACA TGGATATGGG 600 

CAACCCAGGA ACCCTTTCGC CCACCAAACC TGGTTCCCAG TACTATCAGT ATTCTAGCAA 660 

TAATCCCCGA AGGAGGCCTC TTCACAGTAG TGCCATGGAG GTACAGACAA AGAAAGTTCG 720 

AAAAGTTCCT CCAGGTTTGC CATCTTCAGT CTATGCTCCA TCAGCAAGCA CTGCCGACTA 780 

CAATAGGGAC TCGCCAGGCT ATCCTTCCTC CAAACCAGCA ACCAGCACTT TCCCTAGCTC 840 

CTTCTTCATG CAAGATGGCC ATCACAGCAG TGACCCTTGG AGCTCCTCCA GTGGGATGAA 900 

TCAGCCTGGC TATGCAGGAA TGTTGGGCAA CTCTTCTCAT ATTCCACAGT CCAGCAGCTA 960 

CTGTAGCCTG CATCCACATG AACGTTTGAG CTATCCATCA CACTCCTCAG CAGACATCAA 1020 

TTCCAGTCTT CCTCCGATGT CCACTTTCCA TCGTAGTGGT ACAAACCATT ACAGCACCTC 1080 

TTCCTGTACG CCTCCTGCCA ACGGGACAGA CAGTATAATG GCAAATAGAG GAAGCGGGGC 1140 

AGCCGGCAGC TCCCAGACTG GAGATGCTCT GGGGAAAGCA CTTGCTTCGA TCTATTCTCC 1200 

AGATCACACT AACAACAGCT TTTCATCAAA CCCTTCAACT CCTGTTGGCT CTCCTCCATC 1260 

TCTCTCAGCA GGCACAGCTG TTTGGTCTAG AAATGGAGGA CAGGCCTCAT CGTCTCCTAA 1320 

TTATGAAGGA CCCTTACACT CTTTGCAAAG CCGAATTGAA GATCGTTTAG AAAGACTGGA 1380 

TGATGCTATT CATGTTCTCC GGAACCATGC AGTGGGCCCA TCCACAGCTA TGCCTGGTGG 1440 

TCATGGGGAC ATGCATGGAA TCATTGGACC TTCTCATAAT GGAGCCATGG GTGGTCTGGG 1500 

CTCAGGGTAT GGAACCGGCC TTCTTTCAGC CAACAGACAT TCACTCATGG TGGGGACCCA 1560 

TCGTGAAGAT GGCGTGGCCC TGAGAGGCAG CCATTCTCTT CTGCCAAACC AGGTTCCGGT 1620 

TCCACAGCTT CCTGTCCAGT CTGCGACTTC CCCTGACCTG AACCCACCCC AGGACCCTTA 1680 

CAGAGGCATG CCACCAGGAC TACAGGGGCA GAGTGTCTCC TCTGGCAGCT CTGAGATCAA 1740 

ATCCGATGAC GAGGGTGATG AGAACCTGCA AGACACGAAA TCTTCGGAGG ACAAGAAATT 1800 

AGATGACGAC AAGAAGGATA TCAAATCAAT TACTAGCAAT AATGACGATG AGGACCTGAC 1860 

ACCAGAGCAG AAGGCAGAGC GTGAGAAGGA GCGGAGGATG GCCAACAATG CCCGAGAGCG 1920 

TCTGCGGGTC CGTGACATCA ACGAGGCTTT CAAAGAGCTC GGCCGCATGG TGCAGCTCCA 1980 

CCTCAAGAGT GACAAGCCCC AGACCAAGCT CCTGATCCTC CACCAGGCGG TGGCCGTCAT 2040 

CCTCAGTCTG GAGCAGCAAG TCCGAGAAAG GAATCTGAAT CCGAAAGCTG CGTGTCTGAA 2100 

AAGAAGGGAG GAAGAGAAGG TGTCCTCGGA GCCTCCCCCT CTCTCCTTGG CCGGCCCACA 2160 

CCCTGGAATG GGAGACGCAT CGAATCACAT GGGACAGATG TAAAAGGGTC CAAGTTGCCA 2220 

CATTGCTTCA TTAAAACAAG AGACCACTTC CTTAACAGCT GTATTATCTT AAACCCACAT 2280 

AAACACTTCT CCTTAACCCC CATTTTTGTA ATATAAGACA AGTCTGAGTA GTTATGAATC 2340 

GCAGACGCAA GAGGTTTCAG CATTCCCAAT TATCAAAAAA CAGAAAAACA AAAAAAAGAA 2400 

AGAAAAAAGT GCAACTTGAG GGACGACTTT CTTTAACATA TCATTCAGAA TGTGCAAAGC 2460 
AGTATGTACA GGCTGAGACA CAGCCCAGAG ACTGAACGGC 



Seq ID NO: 199 Protein sequence: 
Protein Accession #; NP_ 003190.1 

I 11 21 31 41 51 

II I I I I 

MHHQQRMAAIi GTDKELSDLL DFSAMFSPPV SSGKNGPTSL ASGHFTGSNV EDRSSSGSWG 60 

NGGHPSPSRN YGDGTPYDHM TSRDLGSHDN LSPPFVNSRI QSKTERGSYS SYGRESNLQG 120 

CHQQSLLGGD MDMGNPGTLS PTKPGSQYYQ YSSNNPRRRP IiHSSAMEVQT KKVRKVPPGL 180 

PSSVYAPSAS TADYNRDSPG YPSSKPATST FPSSFFMQDG HHSSDPWSSS SGMNQPGYAG 240 

MLGNSSHIPQ SSSYCSLHPH ERLSYPSHSS ADINSSLPPM STFHRSGTNH YSTSSCTPPA 300 

NGTDSIMANR GSGAAGSSQT GDALGKALAS IYSPDHTNNS FSSWPSTPVG SPPSLSAGTA 360 

VWSRNGGQAS SSPNYEGPLH SLQSRIEDRL ERLDDAIHVL RNHAVGPSTA MPGGHGDMHG 420 

IIGPSHNGAM GGLGSGYGTG LLSANRHSLM VGTHREDGVA LRGSHSLLPN QVPVPQLPVQ 480 

SATSPDLNPP QDPYRGMPPG LQGQSVSSGS SEIKSDDEGD ENLQDTKSSE DKKLDDDKKD 540 

IKSITSNNDD EDLTPEQKAE REKERRMANN ARERLRVRDI NEAFKELGRM VQLHLKSDKP 600 

QTKLLILHQA VAVILSLEQQ VRERNLNPKA ACLKRREEEK VSSEPPPLSL AGPHPGMGDA 660 
SNHMGQM 



Seq ID NO: 200 DNA sequence 

Nucleic Acid Accession #: BC005987 (1-1286), BE888744 (1287-1756) 

Coding sequence: 124-525 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

GGCAGAAGAG GAAGATTTCT GAAGAGTGCA GCTGCCTGAA CCGAGCCCTG CCGAACAGCT 60 

GAGAATTGCA CTGCAAC CAT GA GTGAGAAC AATAAGAATT CCTTGGAGAG CAGCCTACGG 120 

CAACTAAAAT GCCATTTCAC CTGGAACTTG ATGGAGGGAG AAAACTCCTT GGATGATTTT 180 

GAAGACAAAG TATTTTACCG GACTGAGTTT CAGAATCGTG AATTCAAAGC CACAATGTGC 240 

AACCTACTGG CCTATCTAAA GCACCTCAAA GGGCAAAACG AGGCAGCCCT GGAATGCTTA 300 

CGTAAAGCTG AAGAGTTAAT CCAGCAAGAG CATGCTGACC AGGCAGAAAT CAGAAGTCTG 360 

GTCACCTGGG GAAACTATGC CTGGGTCTAC TATCACATGG GCCGACTCTC AGACGTTCAG 420 

ATTTATGTAG ACAAGGTGAA ACATGTCTGT GAGAAGTTTT CCAGTCCCTA TAGAATTGAG 480 

AGTCCAGAGC TTGACTGTGA GGAAGGGTGG ACACGGTTAA AGTGTGGARG AAACCAAAAT 540 

GAAAGAGCGA AGGTGTGCTT TGAGAAGGCT CTGGAAAAGA AGCCAAAGAA CCCAGAATTC 600 

ACCTCTGGAC TGGCAATAGC AAGCTACCGT CTGGACAACT GGCCACCATC TCAGAACGCC 660 

ATTGACCCTC TGAGGCAAGC CATTCGGCTG AATCCTGACA ACCAGTACCT TAAAGTCCTC 720 

CTGGCTCTGA AGCTTCATAA GATGCGTGAA GAAGGTGAAG AGGAAGGTGA AGGAGAGAAG 780 

TTAGTTGAAG AAGCCTTGGA GAAAGCCCCA GGTGTAACAG ATGTACTTCG CAGTGCAGCC 840 

AAGTTTTATC GAAGAAAAGA TGAGCCAGAC AAAGCGATTG AACTGCTTAA AAAGGCTTTA 900 

GAATACATAC CAAACAATGC CTACCTGCAT TGCCAAATTG GGTGCTGCTA TAGGGCAAAA 960 
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10 



15 



GTCTTCCAAG 
CTAATAGGAC 
CGTGTCTGTT 
TATTACTTCC 
CTGCGGTATG 
TTTATAGAGQ 
CTGCAAAAAA 
GTCTTGGCAT 
AGGGGTTTGG 
ATAGAGATGT 
GGTATTCAAA 
CACTGGACAG 
GGGAGAGGGA 
CTCTTTG CGG 



TAATGAATCT 
ACGCTGTGGC 
CCATTCTTGC 
AAAAGGAATT 
GCAACTTTCA 
GTGTAAAAAT 
TTGCCAAAAT 
TCCTTCAGGA 
AGTCTGGAAG 
GGTGCCCACT 
ATATGTAATG 
GGTTATGTTA 
CAGATTGGGG 
AACTTC 



AAGAGAGAAT 
TCATCTGAAG 
CAGCCTCCAT 
CAGTAAAGAG 
GCTGTACCAA 
AAACCAGAAA 
GCGACTTTCT 
GCTGAATGAA 
CCTCATCCCT 
AGGCTACTGC 
ACTGGTATGG 
AACCTGAATT 
GGTCGTCCAG 



GGAATGTATG 
AAAGCTGATG 
GCTCTAGCAG 
CTTACTCCTG 
ATGAAGTGTG 
TCAAGGGAGA 
AAAAATGGAG 
AAAATGCAAC 
TCAGCATCAA 
TGAAAGGGAG 
CAAAAGATTG 
GCTGGGTCTT 
GGCTGCGCTA 



GGAAAAGAAA 
AGGCCAATGA 
ATCAGTATGA 
TAGCGAAACA 
AAGACAAGGC 
AAGAAAAGAT 
CAGATTCTGA 
AAGCAGATGA 
GCTGGAATGG 
CTGAAATTCC 
GACTAAGACA 
AAAAGAGCCC 
AATTATTCTC 



GTTACTGGAA 
TAATCTCTTC 
AGAAGCAGAG 
ACTGCTCCAT 
CATCCACCAC 
GAAAGACAAA 
GGCTTTGCAT 
AGACTCTGAG 
GGAATGAAGA 
TCCACAAGTT 
CTGGCCATAC 
AAGGAGTTCT 
AATGATTTGT 



1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 



Seq ID NO: 201 Protein sequence: 
Protein Accession #: AAA59191 



20 



25 



30 



35 



40 



45 



50 



55 



60. 



65 



i 
I 

MSENNKNSLE 
KHLKGQNEAA 
KHVCEKFSSP 
ASYRLDNWPP 
EKAPGVTDVL 
LRENGMYGKR 
FSKELTPVAK 
MRLSKNGADS 



11 
I 

SSLRQLKCHF 
LECLRKAEEL 
YRIESPELDC 
SQNAIDPLRQ 
RSAAKFYRRK 
KLLELIGHAV 
QLLHLRYGNF 
EALHVLAFLQ 



21 
I 

TWNLMEGENS 
IQQEHADQAE 
EEGWTRLKCG 
AIRLNPDNQY 
DEPDKAIELL 
AHLKKADEAN 
QLYQMKCEDK 
ELNEKMQQAD 



31 
I 

LDDFEDKVFY 
IRSLVTWGNY 
GNQNERAKVC 
LKVLLALKLH 
KKALEYIPNN 
DNLFRVCSIL 
AIHHFIEGVK 
EDSERGLESG 



41 

I 

RTEFQNREFK 
AWVYYHMGRL 
FEKALEKKPK 
KMREEGEEEG 
AYLHCQIGCC 
ASLHALADQY 
INQKSREKEK 
SLIPSASSWN 



51 
I 

ATMCNLLAYL 
SDVQIYVDKV 
NPEFTSGLAI 
EGEKLVEEAL 
YRAKVFQVMN 
EDAEYYFQKE 
MKDKLQKIAK 
GE 



60 
120 
180 
240 
300 
360 
420 



Seq ID NO: 202 DNA sequence 
Nucleic Acid Accession #: NM_003090 

Coding sequence: 57-824 {underlined sequences correspond to start and stop codons) 



GAATTCCGCG 
TCAAGCTGAC 
GGGAGCTGGA 
ACCAGTTTGA 
TGTTGAGAAG 
GACTTGATCA 
AACTGGGTGA 
GAAATCCGGT 
TCAGAGTACT 
TCAAGGGCAA 
ATCCAGGTGC 
AAGCAATCAA 
GGTTGCTGCA 
GTGAAGAAGA 
ATAATAATAG 
TGTGTTAGCA 
TTTGTAATAT 
CACTCTCTAT 



11 

I 

GGAGGCCACG 
GGCGGAGCTG 
CCTCCGGGGG 
TGCTATTGAT 
ACTGAAAACA 
GGCTCTGCCC 
TCTGGACCCT 
AACCAATAAG 
GGATTTCCAG 
ACGGGGTGCA 
TGGTTTGCCA 
GAATGCCATA 
GTCTGGTCAG 
GATGGAAGAA 
GCCCTCTTGG 
AAGTGGAATC 
AAGTTTTGAA 
GCTAAAAAAA 



21 
I 

GGCTTTCCAC 
ATCGAGCAGG 
TATAAAATTC 
TTTTCTGACA 
TTGTTAGTGA 
TGTCTGACAG 
CTGGCATCTC 
AAGCATTACA 
AAAGTGAAAC 
CAGCTTGCAA 
ACTGACAAAA 
GCAAATGCTT 
ATCCCTGGCA 
GACACAGTCA 
AACAAGTCTT 
TATCAGCATT 
ATCTAAATGT 
AAAAAAAGGA 



31 
I 

AGCGCGGGGG 
CGGCGCAGTA 
CCGTCATTGA 
ATGAGATCAG 
ACAACAACAG 
AACTCATTCT 
TCAAATCGCT 
GATTGTATGT 
TAAAAGAGCG 
AGGATATTGC 
AGAGAGGTGG 
CAACTCTGGC 
GAGAACGCAG 
CAAACGGGTC 
GCTTTTCGAA 
GTTGAAATGC 
CAATTTTCTA 
ATTC 



41 


51 




I 

AACGGGAGGC 


1 

TGCAGGATGG 


60 


CACCAACGCG 


GTGCGCGACC 


120 


AAATCTAGGT 


GCTACGTTAG 


180 


GAAACTGGAT 


GGTTTTCCTT 


240 


AATATGCCGT 


ATAGGTGAGG 


300 


CACCAATAAT 


AGTCTCGTGG 


360 


GACTTACCTA 


AGTATCCTAA 


420 


GATTTATAAA 


GTTCCGCAAG 


480 


TCAGGAAGCA 


GAGAAAATGT 


540 


CAGGAGAAGC 


AAAACTTTTA 


600 


GCCATCTCCA 


GGGGATGTAG 


660 


TGAAGTGGAG 


AGGCTGAAGG 


720 


ATCAGGGCCC 


ACTGATGATG 


780 


CTGAGCAGTG 


AGGCAGATGT 


840 


CATGGTATAA 


TAGCCTTGTT 


900 


TTAAGACTGC 


TGCTGATAAT 


960 


CAAATTATAA 


AAATAAACTC 


1020 



Seq ID NO: 203 Protein sequence t 
Protein Accession #: NP_ 003081.1 



11 



21 



31 41 51 

III 
MVKLTAELIE QAAQYTNAVR DRELDLRGYK IPVIENLGAT LDQFDAIDFS DNEIRKIiDGF 
PLLRRIiKTLL VNNNRICRIG EGLDQAIiPCL TELILTNNSI* VELGDLDPIiA SLKSLTYLSI 
LRNPVTNKKH YRLYVIYKVP QVRVLDFQKV KLKERQEAEK MFKGKRGAQL AKDIARRSKT 
FNPGAGLPTD KKRGGPSPGD VEAIKNAIAN ASTLAEVERL KGLLQSGQIP GRERRSGPTD 
DGEEEMEEDT VTNGS 



60 
120 
180 
240 



70 



75 



Seq ID NO: 204 DNA sequence 

Nucleic Acid Accession #: NM_J>17643.1 

Coding sequence: 169-1401 (underlined sequences correspond to start and stop codons) 



11 



21 



31 
i 



41 



51 



AATAGCAATA GCTTTATAGC AGCTCCGGTT ACCTGTTTTA AACATGGAAG GAGAGTCGCT 
CCCAGATAGC CCTCACGAGT GGCCCTGGAG CAGGGAGTGG TGGAGCAGAT CTTCCTTGTT 
TGGGAGGAGC CTGAGGTGGA CCTCGCGTCC TGAGTCTGGA AGGCACCTAT_GGGGACCTGC 
TGGGGTGATA TCTCAGAAAA TGTGAGAGTA GAAGTTCCCA ATACAGACTG CAGCCTACCT 



60 
120 
180 
240 
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ACCAAAGTCT TCTGGATTGC TGGAATTGTA 
TATGAAGGAT TTGAAAATGA CTCTGGTCTG 
ATCCATCCAG TTGGTTGGTG TGCAGCCAGC 
CAGCATAAAT ATACAAACTG GAAAGCTTTT 
CTGCCTCCTG ATTTCTCCCA AAAGGTTTCA 
ATGAGAGTAG AAGTGGTTGA CAAGAGGCAT 
AGTGTAATTG GAGGAAGATT AAGACTAGTG 
TTCTGGTGCC ATATGCACAG CCCATTAATA 
CATCGATTCA AAAGATCTGA TATTACAAAG 
CATTTATTTG CTAAGGTAAA AGAAGTAGAC 
AAATTGGAAG CTATAGACCC ATTAAATCTT 
GTGCTAGCTG ACGGATTCCT GATGATTGGG 
GACTGGTTCT GTTACCATGC AACCTCTCCT 
AACATGATTG AACTTACTCC ACCCAGAGGT 
TACCTCAGGG AAACTGGCTC CATTGCAGCA 
AATCACGGAT TTCGTGTAGG AATGAAATTA 
ATATGTGTAG CCACAGTAAC TCGAATTATT 
TGGGAAGAAG AGTATGATCA GTGGGTAGAC 
TGGTGTCAGT TAACTGGATA TCAACTACAG . 
AGAAAAGGTG TCCTTTTG TA AA AATCAGCA 
TCTTATGAGC TCACAGGACA AGAATATACC 
AAGACTCAAC AACAATATCA CAGAATCAGA 
AGTCAATTAC ATAATGACTA TAGAAACACA 
TTTAGTGAGT TAAAAATTAC ATACTAAAAG 
AAATAGTGGA AAATGTCTCA TGTTGAGGCT 
TCAGAGCATC ATGTACTTAA GTATAATGGT 
CTAGACAACT GTATCGTCTA AATTGTAAAC 
TTTTCAGCAT CAAGAGAAAA CCAATCAGCT 
CAGCAATACA AAGGACATAA GAAAAGTGGG 
ATTGTTAACT AATTGGAGTC ACAGTATTCT 
TGATGATTGT GCATTATGTA TTATGCTTAA 
TGCAATAATG AGAAACACTG ATATTTTACT 
AGTATACGTG GTAAAGAATA GAGTCTGTGA 
TGAAAGTTAA TCCTTTTTAA AAACTTTATT 
GGCTCACGCC TGTAATCCCA GCACTTTAGG 
AGATCGAGAC CATCCTGGCT AACACGGTGA 
CTGCCGGGCG TGGTGGCACA CGCCTGAAGT 
ATCACTTGAA CCCAGGAGGC AGAGGTTGCA 
CTGGGCAACA CAGCAAGACT CTGTCTCAAA 



AAATTAGCAG GTTACAATGC CCTTTTAAGA 300 

GACTTCTGGT GCAATATATG TGGTTCTGAT 360 

GGAAAACCTC TTGTTCCTCC TAGAACTATT 420 

CTAGTGAAAC GACTTACTGG TGCCAAAACA 480 

GAGAGTATGC AGTATCCTTT CAAACCTTGC 54 0 

TTGTGTCGAA CACGAGTAGC AGTGGTGGAA 600 

TATGAAGAAA GCGAAGATAG AACAGATGAC 660 

CATCATATTG GTTGGTCTCG AAGCATAGGT 720 

AAACAGGATG GACATTTTGA TACACCACCA 780 

CAGAGTGGGG AATGGTTCAA GGAAGGAATG 840 

TCTACAATAT GTGTCGCAAC CATTAGAAAG 900 

ATCGATGGCT CAGAAGCAGC AGACGGATCT 960 

TCTATTTTCC CTGTCGGTTT CTGTGAAATT 1020 

TACACAAAAC TTCCTTTTAA ATGGTTTGAC 1080 

CCAGTAAAAC TATTTAATAA GGATGTTCCA 1140 

GAAGCAGTAG ATCTCATGGA GCCACGTTTA 1200 

CATCGTCTCT TGAGGATACA TTTTGATGGA 1260 

TGTGAGTCAC CTGACCTCTA TCCTGTAGGG 1320 

CCTCCAGCAT CACAGTGTAA GTTGGTATAC 1380 

ATTCTCCAGA GGACTATCTC ACATAAGTCA 1440 

TATGTCTGAT TGGTTGCCAG GTAAGACATT 1500 

CCATGTGTCC CATGGCAATG TGAATCCAAT 1560 

ACAGTCACCA AATTAAACTA GACTTACTAT 1620 

TTTATTGGTA GGTAATAAAT GCTTTTGAGT 1680 

ATGGTTTTGT AGGAACAAGT ACCCTTATTT 1740 

CTTGGTAAAG ATAGTTCATA TAAGTTGTAT 1800 

AATTATCTAG TACCAATTTT CCCTTTTTAT 1860 

TCATCAAAAC AGAAGAAAAA GGCTAAGTCC 1920 

TCACCACGTG GTGTTCACAT ACATTTTCTA 1980 

TGGACAGAAA ATGATATATC TTGTGAGAAC 2040 

AGGTGCAGTA TGCCATAAAA GGCAAACCCT 2100 

AACAGGAGAA ATGATTACCA CAGTATTTAA 2160 

ATGATTCTTG AAATAATATG TAAAACCTAC 2220 

TAAAAAGAAA AATTAGCAGC CAGGTGCAGT 2280 

AGGCCGAGGC TGGCAGATCA CAAGGTCAGG 2340 

AACCCTGTCT CCACCAAAAA TACAAAAAAT 2400 

CCCAGCTACT CAGGAGGCTG AGGCAAGAGA 2460 

GTGGGCCAAG ATCACGCCAC TACATTCCAG 2520 
AAAAAAAAAA AAAA 



Seq ID NO: 205 Protein sequence: 
Protein Accession #: NP_060113.1 

1 11 ' 21 31 41 51 

I I I I I I 

MGTCWGDISE NVRVEVPNTD CSLPTKVFWI AGIVKLAGYN ALLRYEGFEN DSGLDFWCNI 60 
CGSDIHPVGW CAASGKPLVP PRTIQHKYTN WKAFLVKRLT GAKTLPPDFS QKVSESMQYP 120 
FKPCMRVEW DKRHLCRTRV AWESVIGGR LRLVYEESED RTDDFWCHMH SPLIHHIGWS 180 
RSIGHRFKRS DITKKQDGHF DTPPHLFAKV KEVDQSGEWF KEGMKLEAID PLNLSTICVA 240 
TIRKVLADGF LMIGIDGSEA ADGSDWFCYH ATSPSIFPVG FCEINMIELT PPRGYTKLPF 300 
KWFDYLRETG SIAAPVKLFN KDVPNHGFRV GMKLEAVDLM EPRLICVATV TRIIHRLLRI 360 
HFDGWEEEYD QWVDCESPDL YPVGWCQLTG YQLQPPASQC KLVYRKGVLL 



Seq ID NO: 206 DNA seoruence 
Nucleic Acid Accession #: NM_012334 

Coding sequence: 223-6399 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

GAGACAAAGG CTGCCGTCGG GACGGGCGAG TTAGGGACTT GGGTTTGGGC GAACAAAAGG 60 

TGAGAAGGAC AAGAAGGGAC CGGGCGATGG CAGCAGGGGA GCCCCGCGGG CGCGCGTCCT 120 

CGGGAGTGGC GCCGTGACAC GCATGGTTTC CCCGGACCCG CGGCGGCGCT GACTTCCGCG 180 

AGTCGGAGCG GCACTCGGCG AGTCCGGGAC TGCGCTGGAA CAATGGATAA CTTCTTCACC 240 

GAGGGAACAC GGGTCTGGCT GAGAGAAAAT GGCCAGCATT TTCCAAGTAC TGTAAATTCC 300 

TGTGCAGAAG GCATCGTCGT CTTCCGGACA GACTATGGTC AGGTATTCAC TTACAAGCAG 360 

AGCACAATTA CCCACCAGAA GGTGACTGCT ATGCACCCCA CGAACGAGGA GGGCGTGGAT 420 

GACATGGCGT CCTTGACAGA GCTCCATGGC GGCTCCATCA TGTATAACTT ATTCCAGCGG 480 

TATAAGAGAA ATCAAATATA TACCTACATC GGCTCCATCC TGGCCTCCGT GAACCCCTAC 540 

CAGCCCATCG CCGGGCTGTA CGAGCCTGCC ACCATGGAGC AGTACAGCCG GCGCCACCTG 600 

GGCGAGCTGC CCCCGCACAT CTTCGCCATC GCCAACGAGT GCTACCGCTG CCTGTGGAAG 660 

CGCTACGACA ACCAGTGCAT CCTCATCAGT GGTGAAAGTG GGGCAGGTAA AACCGAAAGC 720 

ACTAAATTGA TCCTCAAGTT TCTGTCAGTC ATCAGTCAAC AGTCTTTGGA ATTGTCCTTA 780 

AAGGAGAAGA CATCCTGTGT TGAACGAGCT ATTCTTGAAA GCAGCCCCAT CATGGAAGCT 840 

TTCGGCAATG CGAAGACCGT GTACAACAAC AACTCTAGTC GCTTTGGGAA GTTTGTTCAG 900 
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CTGAACATCT GTCAGAAAGG AAATATTCAG 
AAAAACCGAG TAGTAAGGCA AAATCCCGGG 
CTGGCAGGGC TGGAACATGA AGAAAGAGAA 
CACTACTTGA ATCAGTCTGG ATGTGTAGAA 
5 AGGGAAGTTA TTACGGCAAT GGACGTGATG 
TCGAGGCTGC TTGCTGGTAT ACTGCATCTT 
GCACAGGTTT CCTTCAAAAC AGCTTTGGGC 
ACACAGCTCA CAGATGCTTT GACCCAGAGA 
ACGCCTCTCA ATGTTCAACA GGCAGTAGAC 

10 GCGTGCTGCT TTGAGTGGGT AATCAAGAAG 
TTCAAGTCTA TTGGCATCCT CGACATCTTT 
GAACAGTTCA ATATAAACTA TGCAAACGAG 
TTTTCTTTAG AACAACTAGA ATATAGCCGG 
ATAGACAATG GAGAATGCCT GGACTTGATT 

15 AATGAAGAAA GCCATTTTCC TCAAGCCACA 
CAGCATGCGA ATAACCACTT TTATGTGAAG 
AAGCACTATG CTGGAGAGGT GCAATATGAT 
ACATTTCGAG ATGACCTTCT CAATTTGCTA 
CTTTTTGAAC ATGTTTCAAG CCGCAACAAC 

20 CGGCGGCCTA CAGTCAGCTC ACAGTTCAAG 
AGCTCCTCTA ATCCTTTCTT TGTTCGCTGT 
CAGTTTGACC AGGCGGTTGT GCTGAACCAG 
AGAATCCGCA AAGCTGGGTA TGCGGTCCGA 
AAAGTGCTGA TGAGGAATCT GGCTCTGCCT 

25 CTGCAGCTCT ATGATGCCTC CAACAGCGAG 
CGAGAATCCT TGGAACAGAA ACTGGAGAAG 
ATGGTGATTC GGGCCCATGT CTTGGGCTTC 
TATTGTGTGG TGATAATACA GAAGAATTAC 
CACCTGAAAA AGGCAGCCAT AGTTTTCCAG 

30 GTTTACAGAC AATTGCTGGC AGAGAAAAGG 
GAAGAAAAGA AGAAACGGGA GGAAGAAGAA 
GAGCTCCGCG CCCAGCAGGA AGAAGAAACG 
AAGAGCCAGA AGGAAGCTGA ACTGACCCGT 
GTGGAAGAGA TCCTCCGTCT GGAGAAAGAA 

35 CAGGAGCTGT CGCTGACCGA GGCTTCCCTG 
CTCCGCAGGC TGGAGGAGGA AGCGTGCAGG 
TTCGACGAGA TCGACGAGTG TGTCCGGAAT 
TTTTCCAGCG AGCTGGCTGA GAGCGCATGC 
CCCTACCCAG AGGAGGAGGT CGATGAGGGC 

40 TCCCCCAACC CCAGCGAGCA CGGCCACTCA 
GATGACTCTT CAGAGGAGGA CCCATACATG 
GCGGACAGCA CGGTGCTGCT CGCCCCATCA 
TCCAGCGGCG AGTCCACCTA CTGCATGCCC 
GGCGACTACG ACTACGACCA GGATGACTAT 

45 GTGACCTTCT CCAACTCCTA CGGCAGCCAG 
ACCTACAACA GCTCGGGTGC CTACCGGTTC 
GATAGTGAAG AGGACTTTGA TTCCAGGTTT 
GACTCTGTGT ACAGCTGTGT CACTCTGCCG 
GGCCTGATGA ACTCTTGGAA ACGCCGCTGG 

50 . TTCCGCTCCA AGCAGGAGGC CCTCAAGCAA 
TCCACGCTGT CCAGGAGAAA TTGGAAGAAG 
ATGTACTTTG AAAACGACAG CGAGGAGAAG 
AAAGAGATCA TAGATAACAC CACCAAGGAG 
ACTTTCCACC TGATTGCAGA GTCCCCAGAA 

55 CAGGTCCACG CGTCCACGGA CCAGGAGATC 
CAGAATGCTG TGGGCACCTT GGATGTGGGG 
. CCTGATAGAC CCAACTCGTT TGTGATCATC 
GACACGCCGG AGGAGATGCA CCACTGGATA 
AGAGTGGAGG GCCAGGAATT CATCGTGAGA 

60 CCGAAGATGT CTTCACTGAA ACTGAAGAAA 
GATTACTACA AGAGTTCAGA GAAGAACGCG 
CTCTGCTCTG TCGTCCCCCC AGATGAGAAG 
ACCGTGTACG GGCGCAAGCA CTGTTACCGG 
* CGGTGGTCCA GTGCCATTCA AAACGTGACT 

65 CAGCAGCTGA TTCAAGATAT CAAGGAGAAC 
TACAAGCGGA ACCCGATCCT TCGATACACC 
CTTCCGTATG GGGACATAAA TCTCAACTTG 
GATGAGGCCA TCAAGATATT CAATTCCCTG 
CCAATAATCC AGGGCATCCT ACAGACAGGG 

70 TACTGCCAGC TTATCAAACA GACCAACAAA 
TACAGCTGGC AGATCCTGAC ATGCCTGAGC 
AAGTATCTCA AGTTCCATCT GAAAAGGATA 
AAATACGCTC TCTTCACTTA CGAATCTCTT 
TCCCGAGATG AAATAGAAGC TCTGATCCAC 

75 CATGGCGGCG GCTCCTGCAA GATCACCATC 
GAGAAGCTGA TCCGAGGCCT GGCCATGGAG 



GGCGGGAGAA TTGTAGATTA TTTATTAGAA 960 

GAAAGGAATT ATCACATATT TTATGCACTG 1020 

GAATTTTATT TATCTACGCC AGAAAACTAC 1080 

GACAAGACAA TCAGTGACCA GGAATCCTTT 1140 

CAGTTCAGCA AGGAGGAAGT TCGGGAAGTG 1200 

GGGAACATAG AATTTATCAC TGCTGGTGGG 1260 

AGATCTGCGG AGTTACTTGG GCTGGACCCA 1320 

TCAATGTTCC TCAGGGGAGA AGAGATCCTC 1380 

AGCAGGGACT CCCTGGCCAT GGCTCTGTAT 1440 

ATCAACAGCA GGATCAAAGG CAATGAGGAC 1500 

GGATTTGAAA ACTTTGAGGT TAATCACTTT 1560 

AAACTTCAGG AGTACTTCAA CAAGCATATT 1620 

GAAGGATTAG TGTGGGAAGA TATTGACTGG 1680 

GAGAAGAAAC TTGGCCTCCT AGCCCTTATC 1740 

GACAGCACCT TATTGGAGAA GCTACACAGT 1800 

CCCAGAGTTG CAGTTAACAA TTTTGGAGTG I860 

GTCCGAGGTA TCTTGGAGAA GAACAGAGAT 1920 

AGAGAAAGCC GATTTGACTT TATCTACGAT 1980 

CAGGATACCT TGAAATGTGG AAGCAAACAT 2040 

GACTCACTGC ATTCCTTAAT GGCAACGCTA 2100 

ATCAAGCCAA ACATGCAGAA GATGCCAGAC 2160 

CTGCGGTACT CAGGGATGCT GGAGACTGTG 2220 

AGACCCTTTC AGGACTTTTA CAAAAGGTAT 2280 

GAGGACGTCC GAGGGAAGTG CACGAGCCTG 2340 

TGGCAGCTGG GGAAGACCAA GGTCTTTCTT 2400 

CGGAGGGAAG AGGAAGTGAG CCACGCGGCC 2460 

TTAGCACGAA AACAATACAG AAAGGTCCTT 2520 

AGAGCATTCC TTCTGAGGAG GAGATTTTTG 2580 

AAGCAACTCA GAGGTCAGAT TGCTCGGAGA 2640 

GAGCAAGAAG AAAAGAAGAA ACAGGAAGAG 2700 

AGAGAAAGAG AGAGAGAGCG AAGAGAAGCC 2760 

AGGAAGCAGC AAGAACTCGA AGCCTTGCAG 2820 

GAACTGGAGA AACAGAAGGA AAATAAGCAG 2880 

ATCGAGGACC TGCAGCGCAT GAAGGAGCAG 2940 

CAGAAGCTGC AGGAGCGGCG GGACCAGGAG 3000 

GCGGCCCAGG AGTTCCTCGA GTCCCTCAAT 3060 

ATCGAGCGGT CCCTGTCGGT GGGAAGCGAA 3120 

GAGGAGAAGC CCAACTTCAA CTTCAGCCAG 3180 

TTCGAAGCCG ACGACGACGC CTTCAAGGAC 3240 

GACCAGCGAA CAAGTGGCAT CCGGACCAGC 3300 

AACGACACGG TGGTGCCCAC CAGCCCCAGT 3360 

GTGCAGGACT CCGGGAGCCT ACACAACTCC 3420 

CAGAACGCTG GGGACTTGCC CTCCCCAGAC 3480 

GAGGACGGTG CCATCACTTC CGGCAGCAGC 3540 

TGGTCCCCCG ACTACCGCTG CTCTGTGGGG 3600 

AGCTCTGAGG GGGCGCAGTC CTCGTTTGAA 3660 

GATACAGATG ATGAGCTTTC ATACCGGCGT 3720 

TATTTCCACA GCTTTCTGTA - CATGAAAGGT 3780 

TGCGTCCTCA AGGATGAAAC CTTCTTGTGG 3840 

GGCTGGCTCC ACAAAAAAGG GGGGGGCTCC 3 900 

CGCTGGTTTG TCCTCCGCCA GTCCAAGCTG 3960 

CTCAAGGGCA CCGTAGAAGT GCGAACGGCA 4020 

AATGGGATCG ACATCATTAT GGCCGATAGG 4 080 

GATGCCAGCC AGTGGTTCAG CGTGCTGAGT 4140 

CAGGAGATGC ATGATGAGCA GGCAAACCCA 4200 

CTGATTGATT CTGTGTGTGC CTCTGACAGC 4260 

ACGGCCAACC GGGTGCTGCA CTGCAACGCC 4320 

ACCCTGCTGC AGAGGTCCAA AGGGGACACC 4380 

GGATGGTTGC ACAAAGAGGT GAAGAACAGT 4440 

CGGTGGTTTG TACTCACCCA CAATTCCCTG 4500 

CTCAAACTGG GGACCCTGGT CCTCAACAGC 4560 

ATATTCAAAG AGACAGGCTA CTGGAACGTC 4620 

CTCTACACCA AGCTGCTCAA CGAGGCCACC 4680 

GACACCAAGG CCCCGATCGA CACCCCCACC 4740 

TGCCTGAACT CGGATGTGGT GGAACAGATT 4800 

CATCACCCCT TGCACTCCCC GCTCCTGCCC 4860 

CTCAAAGACA AAGGCTATAC CACCCTTCAG 4920 

CAGCAACTGG AGTCCATGTC TGACCCAATT 4980 

CATGACCTGC GACCTCTGCG GGACGAGCTG 5040 

GTGCCCCACC CCGGCAGTGT GGGCAACCTG 5100 

TGCACCTTCC TGCCGAGTCG AGGGATTCTC 5160 

CGGGAACAGT TTCCAGGAAC CGAGATGGAA 5220 

AAGAAAACCA AATGCCGAGA GTTTGTGCCT 5280 

AGGCAGGAAA TGACATCCAC GGTCTATTGC 5340 

AACTCCCACA CCACTGCTGG GGAGGTGGTG 5400 

GACAGCAGGA ACATGTTTGC TTTGTTTGAA 5460 
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TACAACGGCC ACGTCGACAA AGCCATTGAA AGTCGAACCG TCGTAGCTGA TGTCTTAGCC 5520 

AAGTTTGAAA AGCTGGCTGC CACATCCGAG GTTGGGGACC TGCCATGGAA ATTCTACTTC 5580 

AAACTTTACT GCTTCCTGGA CACAGACAAC GTGCCAAAAG ACAGTGTGGA GTTTGCATTT 5640 

ATGTTTGAAC AGGCCCACGA AGCGGTTATC CATGGCCACC ATCCAGCCCC GGAAGAAAAC 5700 

5 CTCCAGGTTC TTGCTGCCCT GCGACTCCAG TATCTGCAGG GGGATTATAC TCTGCACGCT 5760 

GCCATCCCAC CTCTCGAAGA GGTTTATTCC CTGCAGAGAC TCAAGGCCCG CATCAGCCAG 5820 

TCAACCAAAA CCTTCACCCC TTGTGAACGG CTGGAGAAGA GGCGGACGAG CTTCCTAGAG 5880 

GGGACCCTGA GGCGGAGCTT CCGGACAGGA TCCGTGGTCC GGCAGAAGGT CGAGGAGGAG 5940 

CAGATGCTGG ACATGTGGAT TAAGGAAGAA GTCTCCTCTG CTCGAGCCAG TATCATTGAC 6000 

10 AAGTGGAGGA AATTTCAGGG AATGAACCAG GAACAGGCCA TGGCCAAGTA CATGGCCTTG 6060 

ATCAAGGAGT GGCCTGGCTA TGGCTCGACG CTGTTTGATG TGGAGTGCAA GGAAGGTGGC 6120 

TTCCCTCAGG AACTCTGGTT GGGTGTCAGC GCGGACGCCG TCTCCGTCTA CAAGCGTGGA 6180 

GAGGGAAGAC CACTGGAAGT CTTCCAGTAT GAACACATCC TCTCTTTTGG GGCACCCCTG 6240 

GCGAATACGT ATAAGATCGT GGTCGATGAG AGGGAGCTGC TCTTTGAAAC CAGTGAGGTG 6300 

15 GTGGATGTGG CCAAGCTCAT GAAAGCCTAC ATCAGCATGA TCGTGAAGAA GCGCTACAGC 6360 

ACGACACGCT CCGCCAGCAG CCAGGGCAGC TCCAGGTGAA GGCGGGACAG AGCCCACCTG 6420 

TCTTTGCTAC CTGAACGCAC CACCCTCTGG CCTAGGCTGG CTCCAGTGTG CCATGCCCAG 6480 

CCAAAACAAA CACAGAGCTG CCCAGGCTTT CTGGAAGCTT CTGGTCTGAG GGAGGTGTCT 6540 

CCGAGGATCC TTTTGCCTGC CGCCTTCATT GATCCTGTAT TAAGCTGTCA ACTTTAACAG 6600 

20 TCTGCACAGT TTCCAAAGCT TTACTACTCT TAGAGGACAC ATGCCTTAAA AAAGGAGGGG 6660 

AGGAACCACG CTGCCACCAA AGCAGCCGGA AGTGCCTTAA CTTGTGGAAC CAACACTAAT 6720 

CGACCGTAAC TGTGCTACTG AAGGGAACTG CCTTTCCCCC TTCTGGGGGA GACTTAACAG 6780 

AGCGTGGAAG GGGGGCATTC TCTGTCAATG ATGCACTAAC CTCCCAACCT GATTTCCCCG 6840 

AATCTGAGGG AAGGTGAGGG AGTGGGAAGG GGGATGGAGA GCTCGAGGGG ACAGTGTGTT 6900 

25 TGAGCTGGAG TGCTGCGGGC AGCCTTTCTC ATGGAATGAC ATGAATCAAC TTTTTTCTTT 6960 

GTTTCATCTT TTAAGTGTAC GTGCTTGCCT GTTCGTGCAT GTGTTCATAA ACTCAACACT 7020 

TTAATCATGG TTTCATGAGC ATTAAAAAGC AAAGGGAAAA AGGATGTGTA ATGGTGTACA 7080 

CAGTCTGTAT ATTTTAATAA TGCAGAGCTA TAGTCTCAAT TGTTACTTTA TAAGGTGGTT 7140 

TTATTAACAA ACCCAAATCC TGGATTTTCC TGTCTTTGCT GTATTTTGAA AAACACGTGT 7200 

30 TGACTCCATT GTTTTACATG TAGCAAAGTC TGCCATCTGT GTCTGCTGTA TTATAAACAG 7260 

ATAAGCAGCC TACAAGATAA CTGTATTTAT AAACCACTCT TCAACAGCTG GCTCCAGTGC 7320 

TGGTTTTAGA ACAAGAATGA AGTCATTTTG GAGTCTTTCA TGTCTAAAAG ATTTAAGTTA 7380 

AAAACAAAGT GTTACTTGGA AGGTTAGCTT CTATCATTCT GGATAGATTA CAGATATAAT 7440 

AACCATGTTG ACTATGGGGG AGAGACGCTG CATTCCAGAA ACGTCTTAAC ACTTGAGTGA 7500 

35 ATCTTCAAAG GACCCTGACA TTAAATGCTG AGGCTTTAAT ACACACATAT TTTATCCCAA 7560 

GTTTATAATG GTGGTCTGAA CAAGGCACCT GTAAATAAAT CAGCATTTAT GACCAGAAGA 7620 

AAAATAATCT GGTCTTGGAC TTTTTATTTT TATATGGAAA AGTTTTAAGG ACTTGGGCCA 7680 

ACTAAGTCTA CCCACACGAA AAAAGAAATT TGCCTTGTCC CTTTGTGTAC AACCATGCAA 7740 
AACTGTTTGT TGGCTCACAG AAGTTCTGAC AATAAAAGAT ACTAGCT 

40 

Seq ID NO: 207 Protein sequence: 
Protein Accession #: NP_036466 

1 11 21 31 41 51 

45 | | | I II 

MDNFPTEGTR VWLRENGQHF PSTVNSCAEG IWFRTDYGQ VPTYKQSTIT HQKVTAMHPT 60 

NEEGVDDMAS LTELHGGSIM YNLFQRYKRN QIYTYIGSIL ASVNPYQPIA GLYEPATMEQ 120 

YSRRHLGELP PHIFAIANEC YRCLWKRYDN QCILISGESG AGKTESTKLI LKFLSVISQQ 180 

SLELSLKEKT SCVERAILES SPIMEAFGNA KTVYNNNSSR FGKFVQLNIC QKGNIQGGRI 240 

50 VDYLLEKNRV VRQNPGERNY HIFYALLAGL EHEEREEFYL STPENYHYLN QSGCVEDKTI 300 

SDQESFREVI TAMDVMQFSK EEVREVSRLL AGILHLGNIE FITAGGAQVS FKTALGRSAE 360 

IiLGLDPTQLT DAliTQRSMFL RGEEILTPLN VQQAVDSRDS IAMALYACCF EWVIKKINSR 420 

1KGNEDFKSI GILDIFGFEN FEVNHFEQFN INYANEKLQE YFNKHIFSLE QLEYSREGLV 480 

WEDIDWIDNG ECLDLIEKKL GLLALINEES HFPQATDSTL LEKLHSQHAN NHFYVKPRVA 540 

55 .VKNFGVKHYA GEVQYDVRGI LEKNRDTFRD DLLNLLRESR FDFIYDLFEH VSSRNNQDTIi 600 

KCGSKHRRPT VSSQFKDSLH SLMATLSSSN PFFVRCIKPN MQKMPDQFDQ AWLNQLRYS 660 

GMLETVRIRK AGYAVRRPFQ DFYKRYKVLM RNLALPEDVR GKCTSLLQLY DASNSEWQLG 720 

KTKVFLRESL EQKLEKRREE EVSHAAMVIR AHVLGFLARK QYRKVLYCW IIQKNYRAFL 780 

LRRRFLHLKK AAIVFQKQLR GQIARRVYRQ LLAEKREQEE KKKQEEEEKK KREEEERERE 840 

60 RERREAELRA QQEEETRKQQ ELEALQKSQK EAELTRELEK QKENKQVEEI LRIiEKEIEDL 900 

QRMKEQQELS LTEASLQKLQ ERRDQELRRL EEEACRAAQE FLESLNFDEI DECVRNIERS 960 

LSVGSEFSSE LAESACEEKP NFNFSQPYPE EEVDEGFEAD DDAFKDSPNP SEHGHSDQRT 1020 

SGIRTSDDSS EEDPYMNDTV VPTSPSADST VLLAPSVQDS GSLHNSSSGE STYCMPQNAG 1080 

DLPSPDGDYD YDQDDYEDGA ITSGSSVTFS NSYGSQWSPD YRCSVGTYNS SGAYRFSSEG 1140 

65 AQSSFEDSEE DFDSRFDTDD ELSYRRDSVY SCVTLPYFHS FLYMKGGLMN SWKRRWCVLK 1200 

DETFLWFRSK QEALKQGWIiH KKGGGSSTLS RRNWKKRWFV IiRQSKLMYFE NDSEEKLKGT .1260 

VEVRTAKEII DNTTKENGID IIMADRTFHL IAESPEDASQ WFSVLSQVHA STDQEIQEMH 1320 

DEQANPQNAV GTLDVGIjIDS VCASDSPDRP NSFVIITANR VLHCNADTPE EMHHWITLLQ 1380 

RSKGDTRVEG QEFIVRGWLH KEVKNSPKMS SLKLKKRWFV LTHNSLDYYK SSEKNALiKLG 1440 

70 TLVLNSLCSV VPPDEKIFKE TGYWNVTVYG RKHCYRLYTK LLNEATRWSS AIQNVTDTKA 1500 

PIDTPTQQIjI QDIKENCLNS DWEQIYKRN PILRYTHHPL HSPIiLPLPYG DINLNLLKDK 1560 

GYTTLQDEAI KIFNSLQQLE SMSDPIPIIQ GILQTGHDLR PLRDELYCQL IKQTNKVPHP 1620 

GSVGNLYSWQ ILTCLSCTFL PSRGILKYIiK FHLiKRIREQF PGTEMEKYAL FTYESLKKTK 1680 

CREFVPSRDE IEALIHRQEM TSTVYCHGGG SCKITINSHT TAGEWEKLI RGLAMEDSRN 1740 

75 MFALFEYNGH VDKAIESRTV VADVLAKFEK LAATSEVGDL PWKFYFKLYC FLDTDNVPKD 1800 

SVEFAFMFEQ AHEAVIHGHH PAPEENLQVL AALRLQYLQG DYTLHAAIPP LEEVYSLQRL 1860 
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KARISQSTKT FTPCERLEKR RTSFLEGTLR RSFRTGSWR QKVEEEQMLD MWIKEEVSSA 1920 

RASIIDKWRK FQGMNQEQAM AKYMALIKEW PGYGSTLFDV ECKEGGFPQE LWLGVSADAV 1980 

SVYKRGEGRP LEVFQYEHIIi SFGAPLANTY KIWDERELL FETSEWDVA KLMKAYISMI 2040 
VKKRYSTTRS ASSQGSSR 

Seq ID NO: 208 DNA sequence. 

Nucleic Acid Accession #: XM_059761.1 

Coding sequence: 124-525 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

CGAAGATCTA TCCAAAATCA AGAAGCCTTT GATTTAGATG TTGCTGTAAA AGAAAATAAA 60 

GATGATCTCA ATCATGTGGA TTTGAATGTG TGTACAAGCT TTTCGGGCCC GGGTAGGAGT 120 

GGCATGGCTC TTATGGAAGT TAACCTATTA AGTGGCTTTA TGGTGCCTTC AGAAGCAATT 180 

TCTCTGAGCG AGACAGTGAA GAAAGTGGAA TATGATCATG GAAAACTCAA CCTCTATTTA 240 

GATTCTGTAA ATGAAACCCA GTTTTGTGTT AATATTCCTG CTGTGAGAAA CTTTAAAGTT 300 

TCAAATACCC AAGATGCTTC AGTGTCCATA GTGGATTACT ATGAGCCAAG GAGACAGGCG 360 

GTGAGAAGTT ACAACTCTGA AGTGAAGCTG TCCTCCTGTG ACCTTTGCAG TGATGTCCAG 420 

GGCTGCCGTC CTTGTGAGGA TGGAGCTTCA GGCTCCCATC ATCACTCTTC AGTCATTTTT 480 

ATTTTCTGTT TCAAGCTTCT GTACTTTATG GAACTTTGGC TGTGATTTAT TTTTAAAGGA 540 

CTCTGTGTAA CACTAACATT TCCAGTAGTC ACATGTGATT GTTTTGTTTT CGTAGAAGAA 600 

TACTGCTTCT ATTTTGAAAA AAGAGTTTTT TTTCTTTCTA TGGGGTTGCA GGGATGGTGT 660 

ACAACAGGTC CTAGCATGTA TAGCTGCATA GATTTCTTCA CCTGATCTTT GTGTGGAAGA 720 

TCAGAATGAA TGCAGTTGTG TGTCTATATT TTCCCCTCTC AAAATCTTTT AGAATTTTTT 780 
TGGAGGTGTT TGTTTTCTCC AGAATAAAGG TATTACTTTA G 

Seq ID NO: 209 Protein sequence: 
Protein Accession #: XP_059 761.1 

1 11 21 31 41 51 

III III 

MAIiMEVNLLS GFMVPSEAIS LSETVKKVEY DHGKLNLYLD SVNETQFCVN IPAVRNFKVS 60 
NTQDASVSIV DYYEPRRQAV RSYNSEVKLS SCDLCSDVQG CRPCEDGASG SHHHSSVIFI 120 
FCFKLLYFME LWL 



Seq ID NO: 210 DNA sequence 
Nucleic Acid Accession #: NM_015472 

Coding sequence: 258-1460 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

GACACACTCC TCTACAACAC CAGAGACTCC CAAACACAAG GCCTTATATT GACTCATTTC 60 

AGCTCACATC CTGGCGACTC TCAAGAGAGA AACCTCAGAG TGACTAAAAT CTCCATAATG 120 

AGAAGACATG TACATTCAGT ATCTATTTTG GCATTTTCCC CAATACATCT CTGCTCATCT 180 

GACTCTTATC TTGGCATCTG CTTCCTGGTG GATCTGAACT GACCCATAAG CCACGCTTAC 240 

TGGTGATTTT CCAGAAGATG AATCCGGCCT CGGCGCCCCC TCCGCTCCCG CCGCCTGGGC 300 

AGCAAGTGAT CCACGTCACG CAGGACCTAG ACACAGACCT CGAAGCCCTC TTCAACTCTG 360 

TCATGAATCC GAAGCCTAGC TCGTGGCGGA AGAAGATCCT GCCGGAGTCT TTCTTTAAGG 420 

AGCCTGATTC GGGCTCGCAC TCGCGCCAGT CCAGCACCGA CTCGTCGGGC GGCCACCCGG 480 

GGCCTCGACT GGCTGGGGGT GCCCAGCATG TCCGCTCGCA CTCGTCGCCC GCGTCCCTGC 540 

AGCTGGGCAC CGGCGCGGGT GCTGCGGGTA GCCCCGCGCA GCAGCACGCG CACCTCCGCC 600 

AGCAGTCCTA CGACGTGACC GACGAGCTGC CACTGCCCCC GGGCTGGGAG ATGACCTTCA 660 

CGGCCACTGG CCAGAGGTAC TTCCTCAATC ACATAGAAAA AATCACCACA TGGCAAGACC 720 

CTAGGAAGGC GATGAATCAG CCTCTGAATC ATATGAACCT CCACCCTGCC GTCAGTTCCA 780 

CACCAGTGCC TCAGAGGTCC ATGGCAGTAT CCCAGCCAAA TCTCGTGATG AATCACCAAC 840 

ACCAGCAGCA GATGGCCCCC AGTACCCTGA GCCAGCAGAA CCACCCCACT CAGAACCCAC 900 

CCGCAGGGCT CATGAGTATG CCCAATGCGC TGACCACTCA GCAGCAGCAG CAGCAGAAAC 960 

TGCGGCTTCA GAGAATCCAG ATGGAGAGAG AAAGGATTCG AATGCGCCAA GAGGAGCTCA 1020 

TGAGGCAGGA AGCTGCCCTC TGTCGACAGC TCCCCATGGA AGCTGAGACT CTTGCCCCAG 1080 

TTCAGGCTGC TGTCAACCCA CCCACGATGA CCCCAGACAT GAGATCCATC ACTAATAATA 1140 

GCTCAGATCC TTTCCTCAAT GGAGGGCCAT ATCATTCGAG GGAGCAGAGC ACTGACAGTG 1200 

GCCTGGGGTT AGGGTGCTAC AGTGTCCCCA CAACTCCGGA GGACTTCCTC AGCAATGTGG 1260 

ATGAGATGGA TACAGGAGAA AACGCAGGAC AAACACCCAT GAACATCAAT CCCCAACAGA 1320 

CCCGTTTCCC TGATTTCCTT GACTGTCTTC CAGGAACAAA CGTTGACTTA GGAACTTTGG 1380 

AATCTGAAGA CCTGATCCCC CTCTTCAATG ATGTAGAGTC TGCTCTGAAC AAAAGTGAGC 1440 

CCTTTCTAAC CTGGCTGTAA TCACTACCAT TGTAACTTGG ATGTAGCCAT GACCTTACAT 1500 

TTCCTGGGCC TCTTGGAAAA AGTGATGGAG CAGAGCAAGT CTGCAGGTGC ACCACTTCCC 1560 

GCCTCCATGA CTCGTGCTCC CTCCTTTTTA TGTTGCCAGT TTAATCATTG CCTGGTTTTG 1620 
ATTGAGAGTA ACTTAAGTTA AACATAAATA AATATTCTAT TTTCATTTTC 

Seq ID NO: 211 Protein sequence: 
Protein Accession #: NP_056287.1 

1 11 21 31 41 51 
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MNPASAPPPL PPPGQQVIHV TQDLDTDLEA LFNSVMNPKP SSWRKKILPE SFFKEPDSGS 60 

HSRQSSTDSS GGHPGPRLAG GAQHVRSHSS PASLQLGTGA GAAGSPAQQH AHLRQQSYDV 120 

TDELPLPPGW EMTFTATGQR -YFLNHIEKIT TWQDPRKAMN QPLNHMNLHP AVSSTPVPQR 180 

SMAVSQPNLV MNHQHQQQMA PSTLSQQNHP TQNPPAGLMS MPNALTTQQQ QQQKLRLQRI 240 

QMERERIRMR QEELMRQEAA IjCRQLPMEAE TLAPVQAAVN PPTMTPDMRS ITNNSSDPFL 300 

NGGPYHSREQ STDSGLGLGC YSVPTTPEDF LSNVDEMDTG ENAGQTPMNI NPQQTRFPDF 360 

LDCLPGTNVD LGTLESEDLI PLFNDVESAL NKSEPFLTWL 



Seq ID NO: 212 DNA sequence 
Nucleic Acid Accession #: NM_018174 

Coding sequence: 176-2194 (underlined sequences correspond to start and stop codons) 



CATCTCCCCC AACCTGGGGG TCGTGTTCTT CAACGCCTGC GAGGCCGCGT CGCGGCTGGC 60 

GCGCGGCGAG GATGAGG CGG AGCTGGCGCT GAGCCTCCTG GCGCAGCTGG GCATCACGCC 120 

TCTGCCACTC AGCCGCGGCC CCGTGCCAGC CAAACCCACC GTGCTCTTCG AGAAGATGGG 180 

CGTGGGCCGG CTGGACATGT ATGTGCTGCA CCCGCCCTCC GCCGGCGCCG AGCGCACGCT 240 

GGCCTCTGTG TGCGCCCTGC TGGTGTGGCA CCCCGCCGGC CCCGGCGAGA AGGTGGTGCG 300 

CGTGCTGTTC CCCGGTTGCA CCCCGCCCGC CTGCCTCCTG GACGGCCTGG TCCGCCTGCA 360 

GCACTTGAGG TTCCTGCGAG AGCCCGTGGT GACGCCCCAG GACCTGGAGG GGCCGGGGCG 420 

AGCCGAGAGC AAAGAGAGCG TGGGCTCCCG GGACAGCTCG AAGAGAGAGG GCCTCCTGGC 480 

CACCCACCCT AGACCTGGCC AGGAGCGCCC TGGGGTGGCC CGCAAGGAGC CAGCACGGGC 540 

TGAGGCCCCA CGCAAGACTG AGAAAGAAGC CAAGACCCCC CGGGAGTTGA AGAAAGACCC 600 

CAAACCGAGT GTCTCCCGGA CCCAGCCGCG GGAGGTGCGC CGGGCAGCCT CTTCTGTGCC 660 

CAACCTCAAG AAGACGAATG CCCAGGCGGC ACCCAAGCCC CGCAAAGCGC CCAGCACGTC 720 

CCACTCTGGC TTCCCGCCGG TGGCAAATGG ACCCCGCAGC CCGCCCAGCC TCCGATGTGG 780 

AGAAGCCAGC CCCCCCAGTG CAGCCTGCGG CTCTCCGGCC TCCCAGCTGG TGGCCACGCC 840 

CAGCCTGGAG CTGGGGCCGA TCCCAGCCGG GGAGGAGAAG GCACTGGAGC TGCCTTTGGC 900 

CGCCAGCTCA ATCCCAAGGC CACGCACACC CTCCCCTGAG TCCCACCGGA GCCCCGCAGA 960 

GGGCAGCGAG CGGCTGTCGC TGAGCCCACT GCGGGGCGGG GAGGCCGGGC CAGACGCCTC 1020 

ACCCACAGTG ACCACACCCA CGGTGACCAC GCCCTCACTA CCCGCAGAGG TGGGCTCCCC 1080 

GCACTCGACC GAGGTGGACX3 AGTCCCTGTC GGTGTCCTTT GAGCAGGTGC TGCCGCCATC 1140 

CGCCCCCACC AGTGAGGCTG GGCTGAGCCT CCCGCTGCGT GGCCCCCGGG CGCGGCGCTC 1200 

GGCTTCCCCA CACGATGTGG ACCTGTGCCT GGTGTCACCC TGTGAATTTG AGCATCGCAA 1260 

GGCGGTGCCA ATGGCACCGG CACCTGCGTC CCCCGGCAGC TCGAATGACA GCAGTGCCCG 1320 

GTCAGAGGAA CGGGCAGGTG GGCTGGGGGC CGAGGAGACG CCACCCACAT CGGTCAGCGA 1380 

GTCCCTGCCC ACCCTGTCTG ACTCGGATCC CGTGCCCCTG GCCCCCGGTG CGGCAGACTC 1440 

AGACGAAGAC ACAGAGGGCT TTGGAGTCCC TCGCCACGAC CCTTTGCCTG ACCCCCTCAA 1500 

GGTCCCCCCA CCACTGCCTG ACCCATCCAG CATCTGCATG GTGGACCCCG AGATGCTGCC 1560 

CCCCAAGACA GCACGGCAAA CGGAGAACGT CAGCCGCACC CGGAAGCCCC TGGCCCGCCC 1620 

CAACTCACGC GCTGCCGCCC CCAAAGCCAC TCCAGTGGCT GCTGCCAAAA CCAAGGGGCT 1680 

TGCTGGTGGG GACCGTGCCA GCCGACCACT CAGTGCCCGG AGTGAGCCCA GTGAGAAGGG 1740 

AGGCCGGGCA CCCCTGTCCA GAAAGTCCTC AACCCCCAAG ACTGCCACTC GAGGCCCGTC 1800 

GGGGTCAGCC AGCAGCCGGC CCGGGGTGTC AGCCACCCCA CCCAAGTCCC CGGTCTACCT 1860 

GGACCTGGCC TACCTGCCCA GCGGGAGCAG CGCCCACCTG GTGGATGAGG AGTTCTTCCA 1920 

GCGCGTGCGC GCGCTCTGCT ACGTCATCAG TGGCCAGGAC CAGCGCAAGG AGGAAGGCAT 1980 

GCGGGCCGTC CTGGACGCGC TACTGGCCAG CAAGCAGCAT TGGGACCGTG ACCTGCAGGT 2040 

GACCCTGATC CCCACTTTCG ACTCGGTGGC CATGCATACG TGGTACGCAG AGACGCACGC 2100 

CCGGCACCAG GCGCTGGGCA TCACGGTGTT GGGCAGCAAC GGCATGGTGT CCATGCAGGA 2160 

TGACGCCTTC CCGGCCTGCA AGGTGGAGTT C TAG CCCCAT CGCCGACACG CCCCCCACTC 2220 
AGCCCAGCCC GCCTGTCCCT AGATTCAGCC ACATCAGAAA TAAACTGTGA CTACACTTG 

Seq ID NO: 213 Protein sequence: 
Protein Accession #: NP_060644.1 

MGVGRLDMYV LHPPSAGAER TIASVCALLV WHPAGPGEKV VRVLFPGCTP PACLU3GLVR 60 

LQHLRFLREP WTPQDLEGP GRAESKESVG SRDSSKREGL LATHPRPGQE RPGVARKEPA 120 

RAEAPRKTEK EAKTPRELKK DPKPSVSRTQ PREVRRAASS VPNLKKTNAQ AAPKPRKAPS 180 

TSHSGFPPVA NGPRSPPSLR CGEASPPSAA CGSPASQIjVA TPSLEIiGPIP AGEEKALELP 240 

LAASSIPRPR TPSPESHRSP AEGSERLSLS PLRGGEAGPD ASPTVTTPTV TTPSfcPAEVG 300 

SPHSTEVDES LSVSFEQVLP PSAPTSEAGL SLPLRGPRAR RSASPHDVDL CLVSPCEFEH 360 

RKAVPMAPAP ASPGSSNDSS ARSQERAGGL GAEETPPTSV SESLPTLSDS DPVPLAPGAA 420 

DSDEDTEGFG VPRHDPLPDP LKVPPPLPDP SSICMVDPEM LPPKTARQTE NVSRTRKPLA 480 

RPNSRAAAPK ATPVAAAKTK GLAGGDRASR- PLSARSSPSE KGGRAPLSRK SSTPKTATRG 540 

PSGSASSRPG VSATPPKSPV YLDLAYIiPSG SSAHLVDEEF FQRVRALCYV ISGQDQRKEE 600 

GMRAVLDALL ASKQHWDRDL QVTLIPTFDS VAMHTWYAET HARHQALGIT VLGSNGMVSM 660 
QDDAFPACKV EF 



Seq ID NO: 214 DNA sequence 

Nucleic Acid Accession #: NM_002019.1 

Coding sequence: 250-4266 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

GCGGACACTC CTCTCGGCTC CTCCCCGGCA GCGGCGGCGG CTCGGAGCGG GCTCCGGGGC 60 
TCGGGTGCAG CGGCCAGCGG GCCTGGCGGC GAGGATTACC CGGGGAAGTG GTTGTCTCCT 120 
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GGCTGGAGCC GCGAGACGGG CGCTCAGGGC 
GACTCTGGCG GCCGGGTCGT TGGCCGGGGG 
GCGCTCAC CA TGG TCAGCTA CTGGGACACC 
CTGCTTCTCA CAGGATCTAG TTCAGGTTCA 
GGCACCCAGC ACATCATGCA AGCAGGCCAG 
GCCCATAAAT GGTCTTTGCC TGAAATGGTG 
AAATCTGCCT GTGGAAGAAA TGGCAAACAA 
CAAGCAAACC ACACTGGCTT CTACAGCTGC 
AAGGAAACAG AATCTGCAAT CTATATATTT 
ATGTACAGTG AAATCCCCGA AATTATACAC 
TGCCGGGTTA CGTCACCTAA CATCACTGTT 
ATCCCTGATG GAAAACGCAT AATCTGGGAC 
ACGTACAAAG AAATAGGGCT TCTGACCTGT 
ACAAACTATC TCACACATCG ACAAACCAAT 
CGCCCAGTCA AATTACTTAG AGGCCATACT 
TTGAACACGA GAGTTCAAAT GACCTGGAGT 
GTAAGGCGAC GAATTGACCA AAGCAATTCC 
ATTGACAAAA TGCAGAACAA AGACAAAGGA 
TCATTCAAAT CTGTTAACAC CTCAGTGCAT 
CATCGAAAAC AGCAGGTGCT TGAAACCGTA 
AAAGTGAAGG CATTTCCCTC GCCGGAAGTT 
GAGAAATCTG CTCGCTATTT GACTCGTGGC 
GAGGATGCAG GGAATTATAC AATCTTGCTG 
CTCACTGCCA CTCTAATTGT CAATGTGAAA 
TTTCCAGACC CGGCTCTCTA CCCACTGGGC 
GGTATCCCTC AACCTACAAT CAAGTGGTTC 
GCAAGGTGTG ACTTTTGTTC CAATAATGAA 
ATGGGAAACA GAATTGAGAG CATCACTCAG 
ATGGCTAGCA CCTTGGTTGT GGCTGACTCT 
TCCAATAAAG TTGGGACTGT GGGAAGAAAC 
GGGTTTCATG TTAACTTGGA AAAAATGCCG 
ACAGTTAACA AGTTCTTATA CAGAGACGTT 
AGAACAATGC ACTACAGTAT TAGCAAGCAA 
ACTCTTAATC TTACCATCAT GAATGTTTCC 
GCCAGGAATG TATACACAGG GGAAGAAATC 
CAGGAAGCAC CATACCTCCT GCGAAACCTC 
ACCACTTTAG ACTGTCATGC TAATGGTGTC 
AACCACAAAA TACAACAAGA GCCTGGAATT 
ATTGAAAGAG TCACAGAAGA GGATGAAGGT 
GGCTCTGTGG AAAGTTCAGC ATACCTCACT 
GAGCTGATCA CTCTAACATG CACCTGTGTG 
CTCCTTATCC GAAAAATGAA AAGGTCTTCT 
ATAATGGACC CAGATGAAGT TCCTTTGGAT 
AGCAAGTGGG AGTTTGCCCG GGAGAGACTT 
TTTGGAAAAG TGGTTCAAGC ATCAGCATTT 
GTGGCTGTGA AAATGCTGAA AGAGGGGGCC 
GAGCTAAAAA TCTTGACCCA CATTGGCCAC 
TGCACCAAGC AAGGAGGGCC TCTGATGGTG 
TCCAACTACC TCAAGAGCAA ACGTGACTTA 
ATGGAGCCTA AGAAAGAAAA AATGGAGCCA 
GATAGCGTCA CCAGCAGCGA AAGCTTTGCG 
AGTGATGTTG AGGAAGAGGA GGATTCTGAC 
GATCTGATTT CTTACAGTTT TCAAGTGGCC 
TGCATTCATC GGGACCTGGC AGCGAGAAAC 
ATTTGTGATT TTGGCCTTGC CCGGGATATT 
GATACTCGAC TTCCTCTGAA ATGGATGGCT 
ACCAAGAGCG ACGTGTGGTC TTACGGAGTA 
TCTCCATACC CAGGAGTACA AATGGATGAG 
AGGATGAGAG CTCCTGAGTA CTCTACTCCT 
CACAGAGACC CAAAAGAAAG GCCAAGATTT 
CTTCAAGCAA ATGTACAACA GGATGGTAAA 
GGAAATAGTG GGTTTACATA CTCAACTCCT 
ATTTCAGCTC CGAAGTTTAA TTCAGGAAGC 
AAGTTCATGA GCCTGGAAAG AATCAAAACC 
ATGTTTGATG ACTACCAGGG CGACAGCAGC 
TTCACCTGGA CTGACAGCAA ACCCAAGGCC 
AAAAGTAAGG AGTCGGGGCT GTCTGATGTC 
GGGCACGTCA GCGAAGGCAA GCGCAGGTTC 
ATCGCGTGCT GCTCCCCGCC CCCAGACTAC 
ATC TAGA GTT TGACACGAAG CCTTATTTCT 
AACTAGCTTT TGCCAGTATT ATGCATATAT 
CAGCTGCTTT TTGTGATTTT TTTAATAGTG 
TCCAGATAGA GAAATAGTGA CAAGTGAAGA 
TGTTAGAGAA ATCCTTCCTA AACCCAATGA 
GCACGCAGGA CCAGTTTGAT TGAGGAGCTG 
CTGGGCCAGC CCTGCAGCCC AAAACCCAGG 



GCGGGGCCGG CGGCGGCGAA CGAGAGGACG 180 

AGCGCGGGCA CCGGGCGAGC AGGCCGCGTC 240 

GGGGTCCTGC TGTGCGCGCT GCTCAGCTGT 300 

AAATTAAAAG ATCCTGAACT GAGTTTAAAA 360 

ACACTGCATC TCCAATGCAG GGGGGAAGCA 420 

AGTAAGGAAA GCGAAAGGCT GAGCATAACT 480 

TTCTGCAGTA CTTTAACCTT GAACACAGCT 540 

AAATATCTAG CTGTACCTAC TTCAAAGAAG 600 

ATTAGTGATA CAGGTAGACC TTTCGTAGAG 660 

ATGACTGAAG GAAGGGAGCT CGTCA TTCC C 720 

ACTTTAAAAA AGTTTCCACT TGACACTTTG 780 

AGTAGAAAGG GCTTCATCAT ATCAAATGCA 840 

GAAGCAACAG TCAATGGGCA TTTGTATAAG 900 

ACAATCATAG ATGTCCAAAT AAGCACACCA 960 

CTTGTCCTCA ATTGTACTGC TACCACTCCC 1020 

TACCCTGATG AAAAAAATAA GAGAGCTTCC 1080 

CATGCCAACA TATTCTACAG TGTTCTTACT 1140 

CTTTATACTT GTCGTGTAAG GAGTGGACCA 1200 

ATATATGATA AAGCATTCAT CACTGTGAAA 1260 

GCTGGCAAGC GGTCTTACCG GCTCTCTATG 1320 

GTATGGTTAA AAGATGGGTT ACCTGCGACT 1380 

TACTCGTTAA TTATCAAGGA CGTAACTGAA 1440 

AGCATAAAAC AGTCAAATGT GTTTAAAAAC 1500 

CCCCAGATTT ACGAAAAGGC CGTGTCATCG 1560 

AGCAGACAAA TGCTGACTTG TACCGCATAT 1620 

TGGCACCCCT GTAACCATAA TCATTCCGAA 1680 

GAGTCCTTTA TCCTGGATGC TGACAGCAAC 1740 

CGCATGGCAA TAATAGAAGG AAAGAATAAG 1800 

AGAATTTCTG GAATCTACAT TTGCATAGCT 1860 

ATAAGCTTTT ATATCACAGA TGTGCCAAAT 1920 

ACGGAAGGAG AGGACCTGAA ACTGTCTTGC 1980 

ACTTGGATTT TACTGCGGAC AGTTAATAAC 2040 

AAAATGGCCA TCACTAAGGA GCACTCCATC 2100 

CTGCAAGATT CAGGCACCTA TGCCTGCAGA 2160 

CTCCAGAAGA AAGAAATTAC AATCAGAGAT 2220 

AGTGATCACA CAGTGGCCAT CAGCAGTTCC 2280 

CCCGAGCCTC AGATCACTTG GTTTAAAAAC 2340 

ATTTTAGGAC CAGGAAGCAG CACGCTGTTT 2400 

GTCTATCACT GCAAAGCCAC CAACCAGAAG 2460 

GTTCAAGGAA CCTCGGACAA GTCTAATCTG 2520 

GCTGCGACTC TCTTCTGGCT CCTATTAACC 2580 

TCTGAAATAA AGACTGACTA CCTATCAATT 2640 

GAGCAGTGTG AGCGGCTCCC TTATGATGCC 2700 

AAACTGGGCA AATCACTTGG AAGAGGGGCT 2760 

GGCATTAAGA AATCACCTAC GTGCCGGACT 2820 

ACGGCCAGCG AGTACAAAGC TCTGATGACT 2880 

CATCTGAACG TGGTTAACCT GCTGGGAGCC 2940 

ATTGTTGAAT ACTGCAAATA TGGAAATCTC 3000 

TTTTTTCTCA ACAAGGATGC AGCACTACAC 3060 

GGCCTGGAAC AAGGCAAGAA ACCAAGACTA 3120 

AGCTCCGGCT TTCAGGAAGA TAAAAGTCTG 3180 

GGTTTCTACA AGGAGCCCAT CACTATGGAA 3240 

AGAGGCATGG AGTTCCTGTC TTCCAGAAAG 3300 

ATTCTTTTAT CTGAGAACAA CGTGGTGAAG 3360 

TATAAGAACC CCGATTATGT GAGAAAAGGA 3420 

CCCGAATCTA TCTTTGACAA AATCTACAGC 3480 

TTGCTGTGGG AAATCTTCTC CTTAGGTGGG 3540 

GACTTTTGCA GTCGCCTGAG GGAAGGCATG 3600 

GAAATCTATC AGATCATGCT GGACTGCTGG 3660 

GCAGAACTTG TGGAAAAACT AGGTGATTTG 3720 

GACTACATCC CAATCAATGC CATACTGACA 3780 

GCCTTCTCTG AGGACTTCTT CAAGGAAAGT 3840 

TCTGATGATG TCAGATATGT AAATGCTTTC 3900 

TTTGAAGAAC TTTTACCGAA TGCCACCTCC 3960 

ACTCTGTTGG CCTCTCCCAT GCTGAAGCGC 4020 

TCGCTCAAGA TTGACTTGAG AGTAACCAGT 4080 

AGCAGGCCCA GTTTCTGCCA TTCCAGCTGT 4140 

ACCTACGACC ACGCTGAGCT GGAAAGGAAA 4200 

AACTCGGTGG TCCTGTACTC CACCCCACCC 4260 

AGAAGCACAT GTGTATTTAT ACCCCCAGGA 4320 

AAGTTTACAC CTTTATCTTT CCATGGGAGC 4380 

CTTTTTTTTT TTGACTAACA AGAATGTAAC 4440 

ACACTACTGC TAAATCCTCA TGTTACTCAG 4500 

CTTCCCTGCT CCAACCCCCG CCACCTCAGG 4560 

CACTGATCAC CCAATGCATC ACGTACCCCA 4620 

GCAACAAGCC CGTTAGCCCC AGGGGATCAC 4680 
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TGGCTGGCCT GAGCAACATC TCGGGAGTCC TCTAGCAGGC CTAAGACATG TGAGGAGGAA 4740 

AAGGAAAAAA AGCAAAAAGC AAGGGAGAAA AGAGAAACCG GGAGAAGGCA TGAGAAAGAA 4800 

TTTGAGACGC ACCATGTGGG CACGGAGGGG GACGGGGCTC AGCAATGCCA TTTCAGTGGC 4860 

TTCCCAGCTC TGACCCTTCT ACATTTGAGG GCCCAGCCAG GAGCAGATGG ACAGCGATGA 4920 

5 GGGGACATTT TCTGGATTCT GGGAGGCAAG AAAAGGACAA ATATCTTTTT TGGAACTAAA 4980 

GCAAATTTTA GACCTTTACC TATGGAAGTG GTTCTATGTC CATTCTCATT CGTGGCATGT 5040 

TTTGATTTGT AGCACTGAGG GTGGCACTCA ACTCTGAGCC CATACTTTTG GCTCCTCTAG 5100 

TAAGATGCAC TGAAAACTTA GCCAGAGTTA GGTTGTCTCC AGGCCATGAT GGCCTTACAC 5160 

TGAAAATGTC ACATTCTATT TTGGGTATTA ATATATAGTC CAGACACTTA ACTCAATTTC 5220 

10 TTGGTATTAT TCTGTTTTGC ACAGTTAGTT GTGAAAGAAA GCTGAGAAGA ATGAAAATGC 5280 

AGTCCTGAGG AGAGTTTTCT CCATATCAAA ACGAGGGCTG ATGGAGGAAA AAGGTCAATA 5340 

AGGTCAAGGG AAGACCCCGT CTCTATACCA ACCAAACCAA TTCACCAACA CAGTTGGGAC 54 00 

CCAAAACACA GGAAGTCAGT CACGTTTCCT TTTCATTTAA TGGGGATTCC ACTATCTCAC 5460 

ACTAATCTGA AAGGATGTGG AAGAGCATTA GCTGGCGCAT ATTAAGCACT TTAAGCTCCT 5520 

15 TGAGTAAAAA GGTGGTATGT AATTTATGCA AGGTATTTCT CCAGTTGGGA CTCAGGATAT 5580 

TAGTTAATGA GCCATCACTA GAAGAAAAGC CCATTTTCAA CTGCTTTGAA ACTTGCCTGG 5640 

GGTCTGAGCA TGATGGGAAT AGGGAGACAG GGTAGGAAAG GGCGCCTACT CTTCAGGGTC 5700 

TAAAGATCAA GTGGGCCTTG GATCGCTAAG CTGGCTCTGT TTGATGCTAT TTATGCAAGT 5760 

TAGGGTCTAT GTATTTAGGA TGCGCCTACT CTTCAGGGTC TAAAGATCAA GTGGGCCTTG 5820 

20 GATCGCTAAG CTGGCTCTGT TTGATGCTAT TTATGCAAGT TAGGGTCTAT GTATTTAGGA 5880 

TGTCTGCACC TTCTGCAGCC AGTCAGAAGC TGGAGAGGCA ACAGTGGATT GCTGCTTCTT 5940 

GGGGAGAAGA GTATGCTTCC TTTTATCCAT GTAATTTAAC TGTAGAACCT GAGCTCTAAG 6000 

TAACCGAAGA ATGTATGCCT CTGTTCTTAT GTGCCACATC CTTGTTTAAA GGCTCTCTGT 6060 

ATGAAGAGAT GGGACCGTCA TCAGCACATT CCCTAGTGAG CCTACTGGCT CCTGGCAGCG 6120 

25 GCTTTTGTGG AAGACTCACT AGCCAGAAGA GAGGAGTGGG ACAGTCCTCT CCACCAAGAT 6180 

CTAAATCCAA ACAAAAGCAG GCTAGAGCCA GAAGAGAGGA CAAATCTTTG TTGTTCCTCT 6240 

TCTTTACACA TACGCAAACC ACCTGTGACA GCTGGCAATT TTATAAATCA GGTAACTGGA 6300 

AGGAGGTTAA ACTCAGAAAA AAGAAGACCT CAGTCAATTC TCTACTTTTT TTTTTTTTTT 6360 

TCCAAATCAG ATAATAGCCC AGCAAATAGT GATAACAAAT AAAACCTTAG CTGTTCATGT 6420 

30 CTTGATTTCA ATAATTAATT CTTAATCATT AAGAGACCAT AATAAATACT CCTTTTCAAG 6480 

AGAAAAGCAA AACCATTAGA ATTGTTACTC AGCTCCTTCA AACTCAGGTT TGTAGCATAC 6540 

ATGAGTCCAT CCATCAGTCA AAGAATGGTT CCATCTGGAG TCTTAATGTA GAAAGAAAAA 6600 

TGGAGACTTG TAATAATGAG CTAGTTACAA AGTGCTTGTT CATTAAAATA GCACTGAAAA 6660 

TTGAAACATG AATTAACTGA TAATATTCCA ATCATTTGCC ATTTATGACA AAAATGGTTG 6720 

35 GCACTAACAA AGAACGAGCA CTTGCTTTCA GAGTTTCTGA GATAATGTAC GTGGAACAGT 6780 

CTGGGTGGAA TGGGGCTGAA ACCATGTGCA AGTCTGTGTC TTGTCAGTCC AAGAAGTGAC 6840 . 

ACCGAGATGT TAATTTTAGG GACCCGTGCC TTGTTTCCTA GCCCACAAGA ATGCAAACAT 6900 

CAAACAGATA CTCGCTAGCC TCATTTAAAT TGATTAAAGG AGGAGTGCAT CTTTGGCCGA 6960 

CAGTGGTGTA ACTGTGTGTG TGTGTGTGTG TGTGTGTGTG TGTGTGTGTG TGTGGGTGTG 7020 

40 GGTGTATGTG TGTTTTGTGC ATAACTATTT AAGGAAACTG GAATTTTAAA GTTACTTTTA 7080 

TACAAACCAA GAATATATGC TACAGATATA AGACAGACAT GGTTTGGTCC TATATTTCTA 7140 

GTCATGATGA ATGTATTTTG TAT AC CATC T TCATATAATA TACTTAAAAA TATTTCTTAA 7200 

TTGGGATTTG TAATCGTACC AACTTAATTG ATAAACTTGG CAACTGCTTT TATGTTCTGT 7260 

CTCCTTCCAT AAATTTTTCA AAATACTAAT TCAACAAAGA AAAAGCTCTT TTTTTTCCTA 7320 

45 AAATAAACTC AAATTTATCC TTGTTTAGAG CAGAGAAAAA TTAAGAAAAA CTTTGAAATG 7380 

GTCTCAAAAA ATTGCTAAAT ATTTTCAATG GAAAACTAAA TGTTAGTTTA GCTGATTGTA 7440 

TGGGGTTTTC GAACCTTTCA CTTTTTGTTT GTTTTACCTA TTTCACAACT GTGTAAATTG 7500 

CCAATAATTC CTGTCCATGA AAATGCAAAT TATCCAGTGT AGATATATTT GACCATCACC 7560 

CTATGGATAT TGGCTAGTTT TGCCTTTATT AAGCAAATTC ATTTCAGCCT GAATGTCTGC 7620 

50 CTATATATTC TCTGCTCTTT GTATTCTCCT TTGAACCCGT TAAAACATCC TGTGGCACTC 



55 



Seq ID NO: 215 Protein sequence : 
Protein Accession #: NP_002010.1 



1 11 21 31 41 51 

I I I I 1 1 

MVSYWDTGVL LCALLSCLLL TGSSSGSKLK DPELSLKGTQ HIMQAGQTLH LQCRGEAAHK 60 

WSLPEMVSKE SERLSITKSA CGRNGKQFCS TLTLNTAOAN HTGFYSCKYL AVPTSKKKET 120 

60 ESAIYIFISD TGRPFVEMYS EIPEIIHMTE GRELVIPCRV TSPNITVTLK KFPLDTLIPD 180 

GKRIIWDSRK GFIISNATYK EIGLLTCEAT VNGHLYKTNY LTHRQTNTII DVQISTPRPV 240 

KLLRGHTLVL NCTATTPLNT RVQMTWSYPD EKNKKASVRR RIDQSNSHAN IFYSVLTIDK 300 

MQNKDKGLYT CRVRSGPSFK SVNTSVHIYD KAFITVKHRK QQVLETVAGK RSYRLSMKVK 360 

AFPSPEWWL KDGLPATEKS ARYIjTRGYSIj IIKDVTEEDA GNYTILLSIK QSNVFKNLTA 420 

65 TLIVNVKPQI YEKAVSSFPD PALYPLGSRQ ILTCTAYGIP QPTIKWFWHP CNHNHSEARC 480 

DFCSNNEESF ILDADSNMGN RIESITQRMA IIEGKNKMAS TLWADSRIS GIYICIASNK 540 

VGTVGRNISF YITDVPNGFH VNLEKMPTEG EDLKLSCTVN KFLYRDVTWI LfcRTVNNRTM 600 

HYSISKQKMA ITKEHSITLN LTIMNVSLQD SGTYACRARN VYTGEEILQK KEITIRDQEA 660 

PYLLRNLSDH TVAISSSTTL DCHANGVPEP QITWFKNNHK IQQEPGIILG PGSSTLFIER 720 

70 VTEEDEGVYH CKATNQKGSV ESSAYLTVQG TSDKSNLELI TLTCTCVAAT LFWLLrLTLLI 780 

RKMKRSSSEI KTDYLSIIMD PDEVPLDEQC ERLPYDASKW EFARERLKLG KSLGRGAFGK 840 

WQASAFGIK KSPTCRTVAV KMLKEGATAS EYKALMTELK ILTHIGHHLN VVNLLGACTK 900 

QGGPLMVIVE YCKYGNLSNY LKSKRDLFFL NKDAALHMEP KKEKMEPGLE QGKKPRLDSV 960 

TSSESFASSG FQEDKSLSDV EEEEDSDGFY KEPITMEDLI SYSFQVARCH4 EFLSSRKCIH 1020 

75 RDLAARNILL SENNWKICD FGLARDIYKN PDYVRKGDTR LPLKWMAPES IFDKIYSTKS 1080 

DVWSYGVLIiW EIFSLGGSPY PGVQMDEDFC SRLREGMRMR APEYSTPEIY QIMLDCWHRD 1140 
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PKERPRFAEL VEKLGDLLQA NVQQDGKDYI PINAILTGNS GFTYSTPAFS EDFPKESISA 1200 

PKFNSGSSDD VRYVNAFKFM SLBRIKTFEE LLPNATSMFD DYQGDSSTLL ASPMLKRFTW 1260 

TDSKPKASLK IDLRVTSKSK ESGLSDVSRP SFCHSSCGHV SEGKRRFTYD HAELERKIAC 1320 
CSPPPDYNSV VLYSTPPI 



Seq ID NO: 216 DNA sequence 
Nucleic Acid Accession #: 
Coding sequence: 76-624 



NM_024689 

(underlined sequences correspond to start and stop codons) 



1 
I 

CTCTTTGGCC 
AGCCCAGAGC 
TGGCTGGCCC 
TCCCTTTCTT 
GGTCTTGATA 
GAAATAAGAT 
TCTTATCCTG 
CTGGTCAGCA 
CCAAAGACCT 
CAGGCCAAGC 
AAGTTCCTGT 
AGAACTTCCA 
AGTAATCATT 
AGAATTCACC 
GACAACTCCA 
TGAGCCCACC 
CTGGACAGGT 
TTACCAAAGG 
CCCTAGTGAT 
CTTTTACTTT 
TTTATGTTTC 
TTTTTTGAGC 
GGGGAAGCTA 
TAGATTTCGG 
GTGTATCAGG 
AGATTTATTT 
TGGTAGGCCT 
ATTCCTTGTT 
GCCCACCCAG 
TCACATCTAT 
GAATACAGCC 
TTTTATCGAC 
AGCATCACCC 
CAATAGAGAC 
AGGATGAGGA 
TTTTTATTTT 
AAGTGATTTA 
ATGAAACCTT 
TAATTGTGGG 
TGTAAGTGGG 
GTCTCTTAAG 
TAGCAGTTAT 



11 

I 

AAGCCCTGCC 
CCAAGATGGA 
TGCTGCTGTG 
CTCTGGTGCC 
AATGCAATGC 
CTGACAACTG 
CAAATTACTC 
AATATCAAAA 
GCAGCATTGA 
GCCTCACGCC 
GTATGCTGAG 
GAAAGTGTTA 
AAAGAGGCTT 
AACACACAGG 
AAGCCCCGGC 
CCAATCCAGA 
CTTCCCTATG 
CCCACATAAC 
GGATGAACTC 
TTAAGTACCT 
CATTCTGGTA 
AAACACTCGG 
CTTCTCCTCT 
TCTTCATTGC 
CAGGGTTCTA 
CAAAGAATTG 
GCAATCTGTA 
CCTTAAAAAA 
ATTACCTAGA 
GAAATGCCTT 
TAGCCAAGTT 
CGTCTTCAGA 
TGAACCAAAG 
ATTGACTGGT 
AACCAGGCAC 
TCACTGGGAG 
GAAACTCCAA 
ATTTTATTGG 
TTTGCACATG 
ACCTTGGGGA 
CCTGTTCCTG 
CTATTGTTGT 



21 



TCTGTACAGC 
GCCCCAGCTG 
GGTCTCAGCC 
CCAAGTCAGA 
CTGCATCGGG 
GCTGGCTTCC 
AGATGATTCC 
CGAGATCTCA 
GCGTGTCCTG 
GGACCTGGTG 
ATAACACCAG 
GCCTTCTCCC 
CTGCATCAAA 
CCCACCAGCA 
TCTTTCCACC 
TGTGATCCCC 
AGATAGAACC 
TTCTAAATTT 
TCTTATCTCT 
CCATCAGAGT 
AGAACTCTTT 
GGGTATGGAT 
ATTCAGATTT 
TGTCCATTTT 
CCAGAGAAAC 
ATTTACATGA 
AACCTTTGGG 
ATCTGTTTTT 
TAATCTCTTT 
CACAGCAACA 
GACACATAAA 
CTGTTAAGGA 
GCCCCTATCA 
TGGCTGGCTT 
GGGAGAGGGA 
GTGGTAAGTT 
AGCAATTGGT 
AAATGGTTGG 
GCCAGCACAT 
GGAGCTGCCT 
CTACAGTTAT 
GTATTAAACC 



31 
I 

CTCGAGTGGA 
GGGCCTGAGG 
CTGAGCTGTT 
ACCAGCTACA 
ACATCTATTT 
CACCTTGGAC 
AAAATCTGGC 
GACAGGAAAA 
CGGAAAACAG 
CAGGACTGTC 
TGAAAAAGCC 
AACTGTGTTA 
CCTTCACATG 
ACAGGCTACC 
ACACTGTGGT 
CTGTGATCTA 
TGATAAGGAG 
TGGTCTGGTC 
GGCTTCTAGA 
CATGAAATCA 
AAATGAGGAC 
GAAAGCCAAT 
CACTAAAATC 
TGTAATGAAC 
AGAACCAGTA 
TTGTGGGGAT 
CAGGAGCTGA 
GTTCTTAAGG 
TACTTAAAGT 
CCTAGATTAG 
ATT AAC CATC 
TTGTGGTAGA 
AGTAACAATA 
CCCAAGGGAT 
GGGGCAACAG 
AGCCCTGTTG 
AATCCCCAAA 
TTTCCCAATT 
GCCAAACAGA 
CCATCATAAA 
AGAGGTTGCT 
ATTTCAACAC 



41 
I 

CAGCCAGAGG 
CTGCCGCCCT 
CTTTCTCCTT 
ATTTTGGAAG 
GCAAGAAGTT 
TGCCTCCCGA 
GCCCTGTGGA 
TCTGTGCCTC 
AGAGGTTCCA 
ACCAGGGCCA 
TGGCATGGAG 
TACCAACCAC 
CAGCTCCCAT 
TTTGCACAAT 
CCCCTAGATG 
CTTCTGGCAA 
CTAGGGCAAT 
TGAAGGAAAA 
GGGAAAAAAA 
CCTGTCAAGA 
ACTGCTGATT 
CGCAGGTCAA 
TTCCAAGATG 
GAGTGTTTTT 
GGAGATACAT 
TGGCAAGTCG 
TGCTGTAGTT 
GCTTTGAATG 
AAACTGATTG 
CATTCAATTG 
ACAGCAACAT 
GAACTGTGAC 
TAGCCAAGCA 
AGCACCAGAC 
AGGTCCAGGG 
CCCATGTATG 
ATGGGTGTAT 
CTGTTTGCCA 
AGTAGACAAA 
GGGAGGGGTT 
CAGAACCTTC 
AT 



51 
I 

CTGCAGCTGG 
CCGCCCTGGC 
GCCAGCTTCT 
GACTTTCCTC 
CTTTAAAGAA 
TTCCTTGCTT 
GATCTTTAGA 
TGCATCAGCC 
GAAATGGCTG 
GAGAGAACTA 
CCCAGCACTG 
ATTTTCAAAT 
GCCACCCTCC 
ATTCTCTGAT 
GGGCTGTTGC 
GATTCTCAGT 
TCTGACAACA 
CCTGTTCTCG 
AAGCATACCT 
CTATCTATCT 
GCTGGTGATG 
ATGACTCCTT 
AAAGCAAATC 
CCTTTAGCTA 
ATACATGTCC 
AAAATCCATA 
TGCAGATAGA 
ATTGGATCAG 
TAGGTGCTAA 
AATAACTGGG 
GCCTGCTAAA 
AGCCACTCTC 
AAATTCCAGT 
AAGAAATGCA 
TTTGGTTATC 
CAGATGGGAG 
CTGGTTTGAA 
TTGGCCAATA 
GGTCTCACTC 
AGTAAAAATG 
TCAGCAAATA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 



Seq ID NO: 217 Protein sequence : 
Protein Accession #: NP 078965.1 



1 11 21 31 41 51 

I I I I I I 

MEPQLGPEAA AIiRPGWLAIdj LWVSALSCSF SLPASSLSSL VPQVRTSYNF GRTFLGLDKC 
NACIGTSICK KFFKEEIRSD NWLASHLGLP PDSLLSYPAN YSDDSKIWRP VEIFRLVSKY 
QNEISDRKIC ASASAPKTCS IERVLRKTER FQKWLQAKRL TPDLVQDCHQ GQREIiKFLCM 
LR 



60 
120 
180 



Seq ID NO: 218 DNA sequence 

Nucleic Acid Accession #: AF075027.1 

Coding sequence: 3-269 (underlined sequences correspond to start and stop codons) 



GAT TAA TTAA 
TGGGATGGCT 
CTTTCTGTGC 
CTTTCTCCCT 
CCATAGATCT 
GACAAAAGTG 



11 
I 

GTGCTTTAAA 
GTAGCTTGAG 
CAAGGTGCTG 
GGAGTAAGGC 
CATGCATCAT 
GTGGGTTTTC 



21 



31 



CGGTCTTGGT AAATATTCCG 
TTGAATTTTA ACTGTCCTCA 
TGTTACGGGA GAGAGTGACT 
CGAAGACTGG ATTACTACAC 
T AATG C CATA_TG ACATTGCC 
ATTGTCTTCA CTGATTGTCA 



41 
I 

CGGGAGCTGG 
TTCTGGGTTT 
GGAAAGTAAC 
GCCTAGACGT 
ATTTTCTTTC 
ATGCATTAAT 



51 
I 

GGAGGACCGT 60 

TGTCGCTCTG 120 

AAAGCTGAAT 180 

GACACTACAC 240 

TCAGTTCACG 300 

AAAGAAGATG 360 
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TGTGGT 

Seq ID NO: 219 Protein sequence! 
Protein Accession #: AP075027 



11 



21 



31 



41 



51 



ERKWQCHMAL MMHEIYGCSV TSRRWIQSS ALLQGERFSP VTFQSLSPVT QHLGTESRAT 
KPRMRTVKIQ LKLQPSQRSS PAPAEYLPRP FKALN 



60 



Seq ID NO: 220 DNA sequence 

Nucleic Acid Accession #: AL133411.8 

Coding sequence: 1-1395 (underlined sequences correspond to start and stop codons) 



ATGGGCAAGG 
AAATGGGATC 
AACAGGCAAC 
ACATCCAGAA* 
AAAAAGGACA 
AACCAAACAC 
CAGGGAGGGG 
CTTGAACATG 
TGGGCCCAGG 
ATCTGGGTGG 
CTACCTTTAA 
TATGCTCCTG 
ACAACACAAA 
AATTTTGCTC 
GAAGAAGAAA 
GTGCCTGCAT 
GATAAAGATC 
AATCAGCCAG 
CTCCTCCTCT 
CTGAGTTATA 
TACTTTCATC 
ACATTTTTGG 
AAGATAATGA 
TCGGTTACCC 



11 
I 

ACTTCATGAC 
TAATCAAATT 
CTACAGAATG 
TCTATAAGGA 
TGGATGAAGC 
CACATGTTCT 
AACATCACAC 
CTGGGAGGAA 
AACAATCCAT 
CACAAACTCC 
ATGAAGATGT 
CTAATGAGAA 
ATCCAAATGG 
TAAAAAACGA 
CAACTACTAG 
TTTGGACAAT 
AATTATTTCA 
ATCTAGAGGA 
TTGTGGTCCT 
AAAGTTGTGA 
CATCAGAAGG 
GTACCACTTC 
CGGATATCAT 
GGTGA 



21 
I 

TAAAACACTA 
AAAGAGCTTC 
GGAGAAAAAT 
ACTTAAACAA 
TGGAAACCGT 
CACTCATAAG 
ACTGGGGCCT 
ATTACAATTC. 
AATACAAAAG 
TGGTGAATCT 
TAATAAGCAG 
AAATGGCAAT 
CACTGAGTCT 
TAAAACTGTC 
CGAACCCTCT 
GTTAGCTAAA 
CCCAATTCCA 
TCTGAAGATC 
CTTGGCATTC 
GAGTCAGTAC 
TGTTTCAGAT 
TTCAGATATG 
TTCCATAGGC 



31 
I 

AAAGCAATGG 
CGCACAGCAA 
TTTGCAATGT 
TTTTACAAGA 
CATTCTCAGA 
TGGGAGTTGA 
GTCAGAAGCC 
ATCCATGGGC 
AAATATGCAT 
ATCTCCAGTT 
GAAGAAAAGA 
TATTATAAAG 
GAAATATCTG 
AATGCAACTA 
CATAAAAATA 
GCTATAAATG 
GAGTCTGATG 
AAAATAATGC 
TGTAGTGCTA 
TCTGTCAACC 
ACATCCTTTT 
AGAAGATCAG 
TCAGATAATG 



41 
I 

CAACAAAAGC 
AAGAAACTAT 
ATCCATCTGA 
AAAAACCAAA 
AAACTAACAC 
ACAATGAGAA 
CCTCTGGCCT 
TGTTTACCCT 
TATGGATTGG 
CACCAGCATT 
ATGAAGATCA 
ATATAAAACA 
TGAGAGCCAC 
CATATGAAAA 
TTCAAAGATC 
GAACAGCAGT 
TGAATGCTAC 
TGGGAATCTC 
CACTGTACAA 
CAGAGCTGGC 
CCAAGAGTGC 
GCACAAGAAC 
AGATGCATGA 



51 

I 

CAAAATTGAC 
TATCAGAGTG 
CAAAGGGCTG 
CAACGCCATC 
AGGAACAGAA 
CACATGGACA 
CCTGGCTGGC 
TGAAAATGAA 
AACCAAGCAG 
GCCTAATGTG 
TACTCCCAAT 
ATATGTGTTC 
AACTGACCTG 
ATCCACCATT 
AACCCCAAAC 
GGTCATGGAT 
ACAGGGAGAA 
GTTGATGACC 
ACTGAGGCAT 
CACGATGTCT 
AGAGAGCAGC 
ATCAGAATCT 
AAACGATGAG 



60 
120 
180 
24 0 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



Seq ID NO: 221 Protein sequence: 
Protein Accession #: AL133411.8 



1 11 21 31 .41 51 

I I I I II 

MGKDFMTKTIi KAMATKAKID KWDLIKLKSF RTAKETIIRV NRQPTEWEKN FAMYPSDKGL 60 

TSRIYKELKQ FYKKKPNNAI KKDMDEAGNR HSQKTNTGTE NQTPHVLTHK WELNNENTWT 120 

QGGEHHTLGP VRSPSGLLAG LEHAGRKLQF IHGLFTLENE WAQEQSIIQK KYALWIGTKQ 180 

IWVAQTPGES ISSSPALPNV LPLNEDVNKQ EEKNEDHTPN YAPANEKNGN YYKDIKQYVF 240 

TTQNPNGTES EISVRATTDIi NFALKNDKTV NATTYEKSTI EEETTTSEPS HKNIQRSTPN 300 

VPAFWTMLAK AINGTAWMD DKDQLFHPIP. ESDVNATQGE NQPDLEDLKI KIMLGISLMT 360 

LLLFWLLAF CSATLYKLRH LSYKSCESQY SVNPELATMS YFHPSEGVSD TSFSKSAESS 420 
TFLGTTSSDM RRSGTRTSES KIMTDIISIG SDNEMHENDE SVTR 

Seq ID NO: 222 DNA sequence 

Nucleic Acid Accession #: AL050295.1 

Coding sequence: 237-2073 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

GAAGGGGACA GAAGGCAGTT CACCTCTGCT CCCGACAGCC TGGGAACCCG CAAGAGCCCC 60 

AGCATTTGAA GTCTGGTCTT GTGAAACCCC ACCCTCCTCT GGCTGTGTGA TTGAATGGGA 120 

TGCCCTCGAG GTACACCTCA CCTGAGAGGG TTTTGGGCAG ATCAGCAGTA AGGTGTTAAA 180 

TTTTAGAAGC CTGAAAACTC CAGAAGAGAA AGGCCAACCA ACTCAAACTT GAAGA CATG A 240 

AATCCCCAAG GAGAACCACT TTGTGCCTCA TGTTTATTGT GATTTATTCT TCCAAAGCTG 300 

CACTGAACTG GAATTACGAG TCTACTATTC ATCCTTTGAG TCTTCATGAA CATGAACCAG 360 

CTGGTGAAGA GGCACTGAGG CAAAAACGAG CCGTTGCCAC AAAAAGTCCT ACGGCTGAAG 420 

AATACACTGT TAATATTGAG ATCAGTTTTG AAAATGCATC CTTCCTGGAT CCTATCAAAG 480 

CCTACTTGAA CAGCCTCAGT TTTCCAATTC ATGGGAATAA CACTGACCAA ATTACTGACA 540 

TTTTGAGCAT AAATGTGACA ACAGTCTGCA GACCTGCTGG AAATGAAATC TGGTGCTCCT 600 

GCGAGACAGG TTATGGGTGG CCTCGGGAAA GGTGTCTTCA CAATCTCATT TGTCAAGAGC 660 

GTGAGGTCTT CCTCCCAGGG CACCATTGCA GTTGCCTTAA AGAACTGCCT CCCAATGGAC 720 
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10 



15 



20 



25 



CTTTTTGCCT 
TTCAAGAAGA 
AAACAGCGTT 
GGTTCAAGTC 
AGTTAATACA 
ACTACAACTC 
TCATCTTTGA 
TGTCTTGGCG 
ACACCGCACT 
CAGGTGATGC 
AGAAGAAAAT 
GCGACAACAA 
TAGAATGGAA 
CTAGCTGCAG 
GAACAACAGT 
ACATAAAAGT 
TTTCTGAGGG 
TTTATTGGAA 
ATCTTGATGG 
CCTATCACTG 
TTCACCCGCT 
GCAGTGGTTC 
TCCATATGGG 



GCTTCAGGAA 
CCTCATGAAC 
CCGGAAGGGT 
TGGAAGTGTG 
TAAAGCCAAT 
CTTTCAAGCA 
AGGGGACACA 
CTATGAAGAA 
TTTCAACAAC 
AGGTGAATAT 
AGATGTTATG 
TCCTGTATCT 
GCAGGAAGGA 
CAGATACACC 
CATCTACACT 
GACATTCATC 
ACAAAACTTT 
CACTTCTGCT 
AGCAGAATCA 
CATATTTAGA 
GCCTCTAAAG 
CCATCACATC 
TTCCTCATCC 



GATGTTACCC 
ACTTCCTCCG 
TACGGAATTT 
GTTGTGACAT 
GAACAAGTTG 
GTTACTATCA 
GTCAGTCTGG 
CAGCAGTTGG 
ATGACTTCGG 
GTTTGCAAAC 
CCCATCCAAA 
TTGAACTGCT 
AAAATAAATA 
CTCAAGGCTG 
TGTGAGTTCA 
TCTGTGGCCA 
TCTATAAAAT 
GGAATTAAAA 
GTACTGACAG 
TATAAGAATT 
CTGAACATCA 
AAGTGCTGCA 
CTTCCTGCTG 



TGAACATGAG 
CCCTCTATAG 
TACCAGGCTT 
ATGAAGTCAA 
TACAGAGCCT 
ATGAAAGCAA 
TGTGTGAAAA 
AAATCCAGAA 
TGTCCAAGCT 
TGATATTAGA 
TTTTGGCAAA 
GCAGTCAGGG 
TTCCAGGAAC 
ATGGAACCCA 
TCAGTGCCTA 
ATCTAACAAT 
GCATCAGTGA 
TATACCAAAG 
TCAAGACCTC 
CATACAGTAT 
TGATTGATCC 
TAGAGGAGGA 
TAAAAAAAAA 



AGTCAGACTA 
GTCCTACAAG 
CAAGGGCGTG 
GACTACACCA 
CAATCAGACC 
TTTCTTTGTC 
GGAAGTTTTG 
CAGCAGCAGA 
CACCATCCAC 
CATTTTTGAA 
TGAAGAAATG 
TAATGTTAAT 
CCCTGAGACA 
GTGCCCAAGC 
TGGAGCCAGA 
AACCCCGGAC 
TGTGAGTAAC 
ATTTTATACC 
GACCAGGGAG 
TGCAACCAAA 
TTTGGAAGCT 
TGGAGACTAC 
AAAAAAAAAA 



AATGTAGGCT 
ACCGACTTGG 
ACTGTGACAG 
CCATCACTTG 
TACAAAATGG 
ACACCAGAAA 
TCCTCCAATG 
TTCTOGATTT 
AACATCACTC 
TATGAGTGCA 
AAGGTGATGT 
TGGAGCAAAG 
GACATAGATT 
GGGTCGTCTG 
GGCAGTGCAA 
CCAATTTCTG 
TATGATGAGG 
ACGAGGAGGT 
TGGAATGGAA 
GACGTCATTG 
ACTGTTTCAT 
AAAGTTACTT 
A 



780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1600 
1860 
1920 
1980 
2040 



Seg ID NO: 223 Protein sequence: 
Protein Accession #: CAB43394.1 



30 



35 



40 



MKSPRRTTLC 
EEYTVNIEIS 
SCETGYGWPR 
GFQEDLMNTS 
LELIHKANEQ 
NVSWRYEEQQ 
CKKKIDVMPI 
DSSCSRYTUC 
SVSEGQNFSI 
GTYHCIFRYK 
TPHMGSSSLP 



11 
I 

LMFIVIYSSK 
FENASFLDPI 
ERCLHNLICQ 
SALYRSYKTD 
WQSLNQTYK 
LEIQNSSRFS 
QILANEEMKV 
ADGTQCPSGS 
KCISDVSNYD 
NSYSIATKDV 
AVKKKKKK 



21 
I 

AALNWNYEST 
KAYLNSLSFP 
ERDVFLPGHH 
LETAFRKGYG 
MDYNSFQAVT 
IYTALFNNMT 
MCDNNPVSLN 
SGTTVIYTCE 
EVYWNTSAGI 
IVHPLPLKLN 



31 
I 

IHPLSLHEHE 
IHGNNTDQIT 
CSCLKELPPN 
ILPGFKGVTV 
INESNFFVTP 
SVSKLTIHNI 
CCSQGNVNWS 
FISAYGARGS 
KIYQRFYTTR 
IMIDPLEATV 



41 

I 

PAGEEALRQK 
DILSINVTTV 
GPFCLLQEDV 
TGFKSGSWV 
EIIFEGDTVS 
TPGDAGEYVC 
KVEWKQEGKI 
ANIKVTFISV 
RYLDGAESVL 
SCSGSHHIKC 



51 
I 

RAVATKSPTA 
CRPAGNEIWC 
TLNMRVRLNV 
TYEVKTTPPS 
LVCEKEVLSS 
KLILDIFEYE 
NIPGTPETDI 
ANLTITPDPI 
TVKTSTREWN 
CIEEDGDYKV 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



45 



Seq ID NO: 224 DNA sequence 

Nucleic Acid Accession #: NMJ507268 

Coding sequence: 46-1245 (underlined sequences correspond to start and stop codons) 



11 



21 



31 



41 



51 



50 



55 



60 



65 



70 



75 



GGTAGCAGGA 
GGCCTGCTAC 
CCAGAGAGTG 
CTGCAAGGCT 
ACCATCTTTC 
CTGCATGTGA 
GATGACCGGA 
GTGAGAGATA 
ACAACTGGCA 
CAGGCTCGGG 
GAACCCATCA 
TCAGGCTCCT 
GTGAAGTTTG 
ACCATGACAT 
GACATGGATG 
GCCATCATCC 
CTCTGTCGGA 
GAGGCCAACG 
GATGAGCCAA 
GAGTACCAGA 
CTGGATTATG 
CCAGGATCTG 
GCTACCTCTC 
TCACTGGCTT 
GCTCTGGGCC 
GGGAAGATGC 
GCATCTTGCC 
TATCCAGGAT 
ACAGGCCAGG 



GGCTGGAAGA 
TCCTGGGGCA 
TAACAGGACC 
ACACCCAAGT 
TACGTGACTC 
GCCACAAGGT 
GCCACTACAC 
AGATTACTGA 
GCGGTTATGG 
GTTCTCCTCC 
AAGTAGCAAC 
ATTTCTGCAC 
TGGTCAAAGA 
ACCCCTTGAA 
GCTACCTTGG 
TCATCATCTC 
AGACATCCCA 
ACTCTGGAGA 
CTTCCCAGAA 
TCATCGCCCA 
AGTTTCTGGC 
CTGACATAAT 
TTCCTGGATA 
TGCCCTGGAA 
CTTCTAGTAT 
CCATAGCACT 
ACCAGAAGAC 
CATTTCTCTT 
GTTCAGTTCT 



AAGGACAGAA 
CCTAACAGTG 
TTGGAAAGGG 
CTTGGTGAAG 
TTCTGGAGAC 
TCCAGGAGAT 
GTGTGAAGTC 
GCTCCGTGTC 
CTTCACGGTG 
CATCAGTTAT 
CCTAAGTACC 
TGCCAAGGGC 
CTCCTCAAAG 
AGCAACATCT 
AGAGACCAGT 
CTTGTGCTGT 
ACAAGAGCAT 
AACCATGAGG 
TCTGGGCAAC 
GATCAATGGC 
CACTGAGGGC 
TGCCTAGTCA 
GCCCAAAGTG 
TTTGCCAGAT 
CTCTGCCGGG 
AGGACTTGGT 
CCGAGGGAGG 
TCTTCAGGGC 
GCTCCTCCAC 



GTAGCTCTGG 
GACACTTATG 
GATGTGAATC 
TGGCTGGTAC 
CATATCCAGC 
GTATCCCTCC 
ACCTGGCAGA 
CAGAAACTCT 
CCCCAGGGAA 
ATTTGGTATA 
TTACTCTTCA 
CAGGTTGGCT 
CTACTCAAGA 
ACAGTGAAGC 
GCTGGGCCAG 
ATGGTGGTTT 
GTCTACGAAG 
GTGGCCATCT 
AACTACTCTG 
AACTACGCCC 
AAAAGTGTCT 
GTCCTTGCCT 
TCCGCCTACC 
GCATCTCAAG 
GGCTTCTGGT 
CATCATGCCT 
CTCAGCTCTG 
CAGACAGCTT 
TATAAGTCTA 



CTGTGATGGG 
GCCGTCCCAT 
TTCCCTGCAC 
AACGTGGCTC 
AGGCAAAGTA 
AATTGAGCAC 
CTCCTGATGG 
CTGTCTCCAA 
TGAGGATTAG 
AGCAACAGAC 
AGCCTGCGGT 
CTGAGCAGCA 
CCAAGACTGA 
AGTCCTGGGA 
GAAAGAGCCT 
TTACCATGGC 
CAGCCAGGGC 
TCGCAAGTGG 
ATGAGCCCTG 
GCCTGCTGGA 
GTTAAAAATG 
TCTGCATGGC 
AACACTGGAG 
TAAGCCAGCT 
ACTCCTCTCT 
ACAGACACTA 
CCAGCTCAGA 
TTAATTGAAA 
ATGTTCTGAC 



GATCTTACTG 
CCTGGAAGTG 
CTATGACCCC 
AGACCCTGTC 
CCAGGGCCGC 
CCTGGAGATG 
CAACCAAGTC 
GCCCACAGTG 
CCTTCAATGC 
TAATAACCAG 
GATAGCCGAC 
CAGCGACATT 
GGCACCTACA 
CTGGACCACT 
GCCTGTCTTT 
CTATATCATG 
ACATGCCAGA 
CTGCTCCAGT 
CATAGGACAG 
CACAGTTCCT 
CCCCATTAGG 
CTTCTTCCCT 
CCGCTGGGAG 
GCTGGATTTG 
AAATACCAGA 
TTCAACTTTG 
GGACCAGCTA 
TTGTTATTTC 
TCTCTCCTGG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
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TGCTCAATAA ATATCTAATC ATAACAGCAA AAAAAAAAAA AAAAAAA 



Seq ID NO; 225 Protein sequence: 
Protein Accession #: NP_009199.1 



1 11 21 31 41 51 

I I I I I 1 

MGILLGIiLLL GHLTVDTYGR PILEVPESVT GPWKGDVNLP CTYDPLQGYT QVLVKWLVQR 60 

GSDPVTIFLR DSSGDHIQQA KYQGRLHVSH KVPGDVSLQL STLEMDDRSH YTCEVTWQTP 120 

DGNQWRDKI TELRVQKLSV SKPTVTTGSG YGFTVPQGMR ISLQCQARGS PPISYIWYKQ 180 

QTNNQEPIKV ATLSTLLFKP AVIADSGSYF CTAKGQVGSE QHSDIVKFW KDSSKXiLKTK 240 

TEAPTTMTYP LKATSTVKQS WDWTTDMDGY LGETSAGPGK SLPVFAIILI ISLCCMWFT 300 

MAYIMLCRKT SQQEHVYEAA RAHAREANDS GETMRVAIFA SGCSSDEPTS QNLGNNYSDE 360 
PCIGQEYQII AQINGNYARL LDTVPLDYEF LATEGKSVC 



Seq ID NO: 226 DNA sequence 

Nucleic Acid Accession #: XM_64321 

Coding sequence; 1-2079 {underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

ATGGTCGCCA GTTCCGATCA AGACAGAGCC CCGTATCTTC CAGGGACACT AGACAAGATG 60 

CCAGGACCAC GCCTCCGCTC TGCCCAGAGG CCAAAAGCAG CCCAACAAGA GCCCGGCATT 120 

GAGCCTGGTA CTTACAGGGA GGGTGGTGGA GCCATCGTCC TCACGTATGC GCTGGGGATC 180 

GGGGTTGGGA TCACGGGAAA CACAGTTCAA CAACCACCTC AACTCACTGA CTCCGCCAGC 240 

ATCCGTCAGG AGGATGCCTT TGATAACAAA ATTGACATTG CTGAAGATGG TGGCCAGACA 300 

CCATACGAAG CTACCTTGCA GCAAAGCTTT CAATACTCAC CTACAACAGA TCTTCCTCCA 360 

CTCACAAATG GCTACCTGCC ATCAATCAGC ATGTATGAAA TTCAAACCAA ATACCAGTCG 420 

CATAATCAAT ATCCTAATGG AAATTCTAAA CAGAAGACCA CATTAAATTC TAGAAAACCC 4 80 

TTCCCCTCCA CAGCCACCAC TTCGGTACCA CAAACTGTGA TTCCAAAGAA GAGTGGCTCA 540 

CCTGAAGTTA AACTAAAAAT AACCAAAACT ATCCAGAATG GCAGGGAATT GTTCAAGTCT 600 

TCCCTTTGTG GAGACCTTTT AAATGAAGTA CAGGCAAGTG AGCACACGAA GTCAAAGCAT 660 

GAAAGCAGAA AAGAAAAGAG GAAAAAACCC AAAAAGCATG ACTCATCAAG ATCTGAAGAG 720 

CGCAAGTCAC ACAAAATCCC CAAATTAGAA CCAGAGGAAC AAAATAGACC AAATGAGAGG 780 

GTTCACACCA TATCAGAAAA ACCAAGGGAA GATCCAGTAC TAAAAGAGGA AGCCCCAGTT 840 . 

CAGCCAATAC TATCTTCTGT TCCAACAACA GAAGTGTCCA CTGGTGTTAA GTTTCAAGTT 900 

GGTGATCTTG TGTGGTCCAA GGTGACGGTC ACACCCTGTT GGGTGCCCCG CCTGCGAGGA 960 

CGGAGGAGCC ATCACTGTTC CAGCTGCCTG GAGATCTTGG TGCTGGTGCC AGCCCTCAGC 1020 

CTCAAGAGGT CTTTCATGGT TTCTTCCTTG AAGTTCCTCA CCTCCACGGG CAAACAGAAG 1080 

CCCACATTCA AGGGAACTGC CCAGATGGGC TGGTCACCTA TGGCCTCCAC GACCAATGTC 1140 

TCCCTGCTCC TTGGTCATTG GGAAGGAACA GACCAGATGT CATCCAGGGG CCCGGAATTT 1200 

GGGGGGCGCC GCTGGGTGTG GCAGCATCAG AAGCCTCAGA TCCGCATCTC CATCTGCCAC 1260 

AGGCCAGGGA AGGAACCTCT GAGACTCAGT TTCCTACGAT GTGAAGTGGA GAGAAGAATC 1320 

TCCTCTTTAG CCACCTCTCA GGGCTGCTGG TGTTCGCCCC CAGACCACGT CTGTGAGAAA 1380 

TGCTTAGAAG ACTATGCAGG GCGCCGCCAT TTGACACTCA GAGCCCAGGA AGCCTTTCTT 1440 

GGTCCAGACA GCAGGACTGG AAGCCTTAGA GCTGTCGGCA AGAGATACTG CAGGAACAGC 1500 

CAGCACCAGA GATATCTCCT GCAAGGCCTC CTAGGTGGGT TCTTGGAAGA AAGGAATGCC 1560 

AATGAATATG ATTGCAAGCT AGAGACGAGA GAAGCGGCGT CCTCAACTCC AAGAATCCCG 1620 

TATTCCCCAA CCCACATCCT TCAGTCT.GAA AGTGCCCCTA ACCACTACTT TCCCTACCAC 1680 

GTCTCCCTTT CCAAGTTCCT CAAACGCAAA GCAAACAGCC ATTTCCTGCA CCTGTGTGCA 1740 

GTCGTAGCAG TACGTAGGAG ATCCAATATG CCTGGCACAA GGGGGTGGGG TGGCCACAAA 1800 

CAGAAGCAGC CCTGTCCTGC CAAGTACACG CCTGCCTGCC ACGCACAATG GGAGACATTC I860 

CGCAAGTTCC ACGTGATGGC TCAGAAGAGG GGCCTGTCAG GAAGATGTAG GGGCCAGCAG 1920 

CCCCCGGCCG CGCCCCGCAA GGTGGCTGAC AGACGCCAGC AGCTGCCGGG GGCTCCGGGC 1980 

TGCTCCTGCT CCCAGGATGT GTATCTGACT GGAGTTTCTG GATTAAAGGC CAGTCGTGGC 2040 
TTCATTCCAC ATCCCTGGGT GCCCTTCGGC TCCTCCTAG 



Seq ID NO: 227 Protein sequence: 
Protein Accession #: XP_064321.1 

1 11 21 31 ' 41 51 

I I I I I I 

MVASSDQDRA PYLPGTLDKM PGPRLRSAQR PKAAQQEPGI EPGTYREGGG AIVLTYALGI 60 

GVGITGNTVQ QPPQLTDSAS IRQEDAFDNK IDIAEDGGQT PYEATLQQSF QYSPTTDLPP 120 

LTNGYLPSIS MYEIQTKYQS HNQYPNGNSK QKTTLNSRKP FPSTATTSVP QTVTPKKSGS 180 

PEVKLKITKT IQNGRELFKS SLCGDLLNEV QASEHTKSKH ESRKEKRKKP KKHDSSRSEE 240 

RKSHKIPKLE PEEQNRPNER VHTISEKPRE DPVLKEEAPV QPILSSVPTT EVSTGVKFQV 300 

GDLVWSKVTV TPCWVPRLRG RRSHHCSSCL EILVLVPALS LKRSFMVSSL KFLTSTGKQK 360 

PTFKGTAQMG WSPMASTTNV SLLLGHWEGT DQMSSRGPEF GGRRWVWQHQ KPQIRISICH 420 

RPGKEPLRLS FLRCEVERRI SSLATSQGCW CSPPDHVCEK CLEDYAGRRH LTLRAQEAFL 480 

GPDSRTGSLR AVGKRYCRNS QHQRYLLQGIj LGGFLEERNA NEYDCKLETR EAASSTPRIP 540 

YSPTHILQSE SAPNHYFPYH VSLSKFLKRK ANSHFLHLCA WAVRRRSNM PGTRGWGGHK 600 

QKQPCPAKYT PACHAQWETF RKFHVMAQKR GLSGRCRGQQ PPAAPRKVAD RRQQLPGAPG 660 
CSCSQDVYLT GVSGLKASRG FIPHPWVPFG SS 
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Seq ID NO: 228 DNA sequence 

Nucleic Acid Accession #: NM_006033 

Coding sequence: 253-1752 {underlined sequences correspond to start and stop codons) 



1 
I 

AGCAGCGAGT 
CCGTTGACAC 
CCTTCTCTGC 
GCTGGAAACA 
TGGCGGGGCA 
TTTGCTGCGG 
AAACCCAAAG 
AAGGACCCAG 
TGCAGTTTCA 
ATCTTTGAAA 
AATGTAGTTG 
AATACCAGGG 
GATTTTTCTC 
TATGCAGGCA 
CCCATGTTTG 
GATGTCCTCC 
GGCCACATTG 
GTCTTGGGAT 
GCCGTCCACC 
TGCACTGACT 
AATAGCATTG 
AAAACCCGGG 
AGTTACAAGA 
GCAGATTCCC 
ACCTTCCTGG 
GAGGGGGCCT 
CCCCGCAACC 
CAGCGGAAAC 
GAGCTCTGGT 
GTGGAGCTTC 
TATCCAAGCC 
AGCCTTGACC 
GGAGGGGACT 
TCTGTCCACA 
GCCTCTCGTG 
GAGGAACGCT 
GAGCCTCAGT 
ATCTTGCTCG 
TCTTAGCCAT 
TATTTTTCTG 
ATGTGGTAAT 
TAGACCCTCT 
CTGGGGGAGG 
CTTCTTAGTT 
GTGTTTGATC 
GCCGGTTGCC 
ATGACATATG 
TCTAAGATTT 
CTTTTCTTGT 
ACACAATATC 
GTAGGGGGAG 
TCAGCCACAT 
TTCCTGGAGG 
AATATGCCCC 
ATCCTAAGCC 
TTATGGCTGA 
TGTACTTAGG 
TTAAACTTCT 
GTGTGCTTCT 
TATCCCTTAT 
TTTGGAATGT 
AAAATTCATT 
TATTTTAAAT 
CATTTTTTGT 
TTTTTGGATT 
TCTGTTTTTA 



11 
I 

CCTTGCCTCC 
TCGCTCCCTG 
CTCCAGTCCC 
CCAAGAGGTG 
GGATGAGCAA 
GGAGCCCCGT 
CTACACAGAC 
AGCATGAAGG 
ACATGACAGC 
ACTGGCTGCA 
TGGTTGACTG 
TGGTGGGACA 
TCGGGAATGT 
ACTTCGTGAA 
AAGGGGCCGA 
ACACCTACAC 
ACATCTACCC 
CAATTGCATA 
TCTTTGTTGA 
CCAATCGCTT 
GCTACAATGC 
CAGGCATGCC 
ACATGGGAGA 
AGACTCTGCC 
TCTACACCGA 
CTCAGTCTTG 
CCGGACGGGA 
TGACATTTTG 
TTCGCAAGTG 
CCTGAGGGTG 
CATGGAGGAA 
CTGGAGCACT 
GCGCTGCTAT 
CCTCCAGAGC 
CACACTGGAT 
GGCTCCGAAG 
GAGAAGTCCT 
GGCCCTAGCT 
TCCGTCCTGC 
TTCATTTTTT 
GGACATATTA 
TTCTGTTTGG 
CTATAGGATA 
ATTATGTGCC 
ACTAGCAGAA 
AGATATAACT 
TAATACACAT 
AATACAGTGC 
TGAACTTCTT 
TGAGACACTT 
AAGGGCAACT 
GCCTAGACTT 
CAAAGTCTAT 
AGACGTGAGA 
CTTTTATTAA 
GATTCGGGAG 
TTTTCTAAGG 
GAAGACATAA 
CTTGTTTCTG 
CCAAAATGCT 
TTGCATATAC 
GATGTGTCAG 
AGTTGTGTAC 
CTTGTTGGTG 
AGGGTTGCTC 
CTAATGGAAG 



21 

I 

CGGCGGCTCA 
CCACCGCCCG 
CCAGCCCCTG 
GTTTTTGTTT 
CTCCGTTCCT 
ACCTTTTGGT 
TGAGGTCAAA 
ATGCTACCTC 
TAAAACCTTT 
CAAACTCGTG 
GCTCCCCCTG 
CAGCATTGCC 
CCACTTGATC 
AGGAACGGTG 
CATCCACAAG 
GCGTTCCTTC 
CAATGGGGGT 
TGGAACAATC 
CTCTCTGGTG 
CAAAAAGGGG 
CAAGAAAATG 
TTTCAGAGTT 
AATTGAGCCC 
ACTGGAAATA 
GGAGGACTTG 
GTACAACCTG 
GCTGAATATC 
TACAGAAGAC 
TCGGGATGGC 
CCCGGGCAAG 
AGTTACTGCT 
GGGAACAACT 
AGCTCTTGCT 
ACCAAGTCCA 
TGGTTTCTCA 
AGGCCCTGTG 
TCCGACAGGA 
GTTGGGGTTC 
TCCCCAGCTC 
AATTGAGCAA 
CTGAGCCTCT 
ATGGTGTATG 
TAAGCATTAG 
ACCTTCTTAG 
TAGCAAGCAG 
GCTTTGGAGC 
CTGTGTACAC 
TTTTTTTCCT 
GGAAAAGCCA 
ACACTTTTCA 
ATTATTATCC 
ATATACTAGT 
CTCTGAAACT 
CAAACAAGGA 
TGTATAACCA 
GAAGTGTGAC 
ACATTGTTTT 
CCAGTTGAGT 
TGATTGCTTT 
TGGAACCAGA 
ATAATGAGAT 
TTACACCTTA 
ATGAAGCATG 
CTCAAAAAGT 
AACCCATATT 
CTTTGCA 



31 
I 

GGACGAGGGC 
GGCTCCGTGC 
GCCGAGAGAA 
TTTAAAACTT 
CTGCTCTGTT 
CCAGAGGGAC 
CCATCTGTGA 
TCCGTCGGCC 
TTCATCATTC 
TCAGCCCTGC 
GCCCACCAGC 
AGGATGCTCG 
GGCTACAGCC 
GGCCGAATCA 
AGGCTCTCTC 
GGCTTGAGCA 
GACTTCCAGC 
ACAGAGGTGG 
AATCAGGACA 
ATCTGTCTGA 
AGGAACAAGA 
TACCATTATC 
ACCTTTTACG 
GTGGAGCGGA 
GGAGACCTCT 
TGGAAGGAGT 
AGGCGCATCC 
CCTGAGAACA 
TGGAGGATGA 
TCTTGCCAGC 
GAGGACCCAC 
GGTCTCCTGT 
GCCTCTCTTG 
GATTTGTGTG 
GTTGCTGGGC 
TAGAAGGCTG 
GCTGACTCAT 
TCATGGGTTG 
ACTCTCTGAA 
ATGTCTATTG 
CCATTTGGAA 
TGTATATGCA 
GGACCCTGAG 
TTATTATGTG 
AGTATCATTC 
AAATCTCTTC 
AGAAACCGGC 
CTTTGAAATA 
CCAATTCTAG 
AAAGATTTGT 
CTATTTTACA 
TAGTGGTGCA 
CCATGAAGAC 
CTTTTTTTTT 
GGAGAACATC 
ACCAAGCAGG 
AATCTGTATC 
CTTATTTCAA 
CTAGCCAAAG 
AGTGTTTCAA 
ATTTTGGGAA 
TCCACATAGC 
GTTTGTGGTA 
TTTGGATTTT 
ATTGGCTGTA 



41 
I 

AGATCTCGTT 
CGCCAAGTTT 
GGGTCTTACC 
CTGTTTCTTG 
TCTGGAGCCT 
GGCTGGAAGA 
GGTTTAACCT 
ACAGCCAGCC 
ACGGATGGAC 
ACACAAGAGA 
TTTACACGGA 
ACTGGCTGCA 
TCGGAGCGCA 
CAGGTTTGGA 
CGGACGATGC 
TTGGTATTCA 
CAGGCTGTGG 
TAAAATGTGA 
AGCCGAGTTT 
GCTGCCGCAA 
GGAACAGCAA 
AGATGAAAAT 
TCACCCTTTA 
TCGAGCAGAA 
TGAAGATCCA 
TTCGCAGCTA 
GGGTGAAGTC 
CCAGCATATC 
AAAACGAAAC 
AAGGCAGCAA 
CCAATGGAAG 
GATGGCTGGG 
AATAGCTCTA 
TAAGCAGCTG 
GAGCCTGTAC 
TCAGCTGCTC 
GTCAGGATGG 
CACTGACCAT 
GCACACATCA 
AACACTTAAA 
CCCAGTGGAG 
TGGGGAAAGG 
GCTTTAAGTG 
CCACCTCCCC 
ATGCTGGGGC 
TGTTTAGAGA 
ACCTGCCAGA 
TTTTACTTTA 
ATCTTGATTT 
GTATGCATTG 
AAACTGAGGC 
GCCAGGGAGA 
TTTTGCAGCC 
TATATAGAGC 
TGTGCCAACG 
AGAGGAAGAA 
GTGCCAAAGT 
GATATGTTCT 
CGAAGCTTGT 
ATTTTAGATT 
TAGGACCCGA 
CTGAGGGTAA 
ACTTATGTGA 
GGAGCATTTC 
CATCCTGGTC 
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CTGGGGCAAG 
TCATTTTCCA 
GGCCGGGATT 
GGAGGGGGTG 
CTGCTATTGC 
TAAGCTCCAC 
CCGCACCTCC 
CTTAGAAGAC 
GATGAGCGGT 
GAAAGACGCC 
TGCGGTCAAT 
GGAGAAGGAC 
CGTGGCCGGG 
TCCTGCCGGG 
AGATTTTGTG 
GATGCCTGTG 
ACTCAACGAT 
GCATGAGCGA 
TGCCTTCCAG 
GAACCGTTGT 
AATGTACCTA 
CCATGTCTTC 
TGGCACTAAT 
TGCCACCAAC 
GCTCACCTGG 
CCTGTCTCAA 
TGGGGAAACC 
CCCAGGCCGG 
CAGTCCCACT 
GACTTCCTGC 
GATTCTTCTC 
ACTCCTCGCG 
ACTCCAAACC 
GGTGCCTGGG 
TCTGCCTGAC 
AGCCTGCTTT 
CAGGCCTGGT 
ACTGCTTACG 
TTGGCTTTCC 
ATTAATTAGA 
TTGGGATTTC 
CACCTGGGGC 
GTTTCTATTT 
TATGAGTGAC 
CAGAATGATG 
GATAGAAGTT 
CAGAGCTGGT 
ATACCAGTGC 
GAATTAATAC 
CCTAATTAGA 
TTAGTGAGGT 
GGACTCAGAT 
AGTTCCCACC 
CATC CAT AAA 
GTTGGACTTT 
TGATTTTCTT 
TGTATCACTG 
CAAGCCAATT 
ACAGGTTGAG 
ATTTTCAGAT 
GCCTAAACAC 
TTTTATACGA 
GGGGTTTTCC 
GGATTTTGGA 
ACTTCTGACT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2260 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
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Protein Accession #: NPJ>06024.1 

1 11 21 31 41 51 

I I I I I I 

MSNSVPLLCF WSLCYCFAAG SPVPPGPBGR hEDKLHKPKA TQTEVKPSVR PNLRTSKDPE 60 

HEGCYLSVGH SQPLEDCSFN MTAKTFFIIH GWTMSGIFEN WLHKLVSALH TREKDANVW 120 

VDWLPLAHQL YTDAVNNTRV VGHSIARMLD WLQEKDDFSL GNVHIjIGYSL GAHVAGYAGN 180 

FVKGTVGRIT GIiDPAGPMFE GADIHJCRLSP DDADFVDVLH TYTRSFGLSI GIQMPVGHID 240 

IYPNGGDFQP GCGLNDVLGS IAYGTITBW KCEHERAVHL FVDSLVNQDK PSFAFQCTDS 300 

NRFKKGICLS CRKNRCNSIG YNAKKMRNKR NSKMYLKTRA GMPFRVYHYQ MKIHVFSYKN 360 

MGBIEPTFYV TLYGTNADSQ TLPLEIVERI EQNATNTFLV YTEEDLGDLL KIQLTWBGAS 420 

QSWYNLWKEF RSYLSQPRNP GRELNIRRIR VKSGETQRKL TFCTEDPENT SISPGRELWF 480 
RKCRDGWRMK NETSPTVELP 
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It is understood that the examples described above in no way serve to limit the 
true scope of this invention, but rather are presented for illustrative purposes. All 
publications, sequences of accession numbers, and patent applications cited in this 
specification are herein incorporated by reference as if each individual publication or patent 
5 application were specifically and individually indicated to be incorporated by reference. 
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WHAT IS CLAIMED IS: 

1 1 . A method of detecting an angiogenesis-associated transcript in a cell in 

2 a patient, the method comprising contacting a biological sample from the patient with a 

3 polynucleotide that selectively hybridized to a sequence at least 80% identical to a sequence 

4 as shown in Tables 1-8. 

1 2. The method of claim 1 , wherein the biological sample is a tissue 

2 sample. 

1 3. The method of claim 1 , wherein the biological sample comprises 

2 isolated nucleic acids. 

1 4. The method of claim 3, wherein the nucleic acids are mRNA, 

1 5. The method of claim 3, further comprising the step of amplifying 

2 nucleic acids before the step of contacting the biological sample with the polynucleotide. 

1 6. The method of claim 1 , wherein the polynucleotide comprises a 

2 sequence as shown in Tables 1 -8 . 

1 7. The method of claim 1 , wherein the polynucleotide is labeled. 

1 8. The method of claim 7, wherein the label is a fluorescent label. 

1 9. The method of claim 1 , wherein the polynucleotide is immobilized on 

2 a solid surface. 

1 1 0. The method of claim I , wherein the patient is undergoing a therapeutic 

2 regimen to treat a disease associated with angiongenesis. 

1 11. The method of claim 1, wherein the patient is suspected of having 

2 cancer. 

1 1 2. An isolated nucleic acid molecule consisting of a polynucleotide 

2 sequence as shown in Tables 1 -8. 

1 13. The nucleic acid molecule of claim 1 2, which is labeled. 

1 1 4. The nucleic acid of claim 13, wherein the label is a fluorescent label 
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1 1 5. An expression vector comprising the nucleic acid of claim 12. 

1 1 6. A host cell comprising the expression vector of claim 15. 

1 1 7. An isolated polypeptide which is encoded by a nucleic acid molecule 

2 having polynucleotide sequence as shown in Tables 1-8 

1 1 8. An antibody that specifically binds a polypeptide of claim 17. 

.1 1 9. The antibody of claim 18, further conjugated or fused to an effector 

2 component. 

1 20. The antibody of claim 19, wherein the effector component is a 

2 fluorescent label. 

1 21. The antibody of claim 1 9, wherein the effector component is a 

2 radioisotope. 

1 22. The antibody of claim 1 9, which is an antibody fragment. 

1 23. The antibody of claim 1 9, which is a humanized antibody 

1 24. A method of detecting a cell undergoing angiogenesis in a biological 

2 sample from a patient, the method comprising contacting the biological sample with an 

3 antibody of claim 18. 

1 25. The method of claim 24, wherein the antibody is further conjugated or 

2 fused to an effector component. 

1 26. The method of claim 25, wherein the effector component is a 

2 fluorescent label. 

1 27. The method of detecting antibodies specific to angiogenesis in a 

2 patient, the method comprising contacting a biological sample from the patient with a 

3 polypeptide which is encoded by a nucleotide sequence of Tables 1-8. 
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